Agent skills
ai-video-generation

Agent skill

ai-video-generation

Generate AI videos with Google Veo, Seedance, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 1.5 Pro, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative

View SKILL.md on GitHub Repository

Stars 247

Forks 46

Install this agent skill to your Project

npx add-skill https://github.com/inference-sh/skills/tree/main/tools/video/ai-video-generation

SKILL.md

AI Video Generation

Generate videos with 40+ AI models via inference.sh CLI.

AI Video Generation

Quick Start

Requires inference.sh CLI (infsh). Install instructions

bash

infsh login

# Generate a video with Veo
infsh app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'

Available Models

Text-to-Video

Model	App ID	Best For
Veo 3.1 Fast	`google/veo-3-1-fast`	Fast, with optional audio
Veo 3.1	`google/veo-3-1`	Best quality, frame interpolation
Veo 3	`google/veo-3`	High quality with audio
Veo 3 Fast	`google/veo-3-fast`	Fast with audio
Veo 2	`google/veo-2`	Realistic videos
P-Video	`pruna/p-video`	Fast, economical, with audio support
WAN-T2V	`pruna/wan-t2v`	Economical 480p/720p
Grok Video	`xai/grok-imagine-video`	xAI, configurable duration
Seedance 1.5 Pro	`bytedance/seedance-1-5-pro`	With first-frame control
Seedance 1.0 Pro	`bytedance/seedance-1-0-pro`	Up to 1080p

Image-to-Video

Model	App ID	Best For
Wan 2.5	`falai/wan-2-5`	Animate any image
Wan 2.5 I2V	`falai/wan-2-5-i2v`	High quality i2v
WAN-I2V	`pruna/wan-i2v`	Economical 480p/720p
P-Video	`pruna/p-video`	Fast i2v with audio
Seedance Lite	`bytedance/seedance-1-0-lite`	Lightweight 720p

Avatar / Lipsync

Model	App ID	Best For
OmniHuman 1.5	`bytedance/omnihuman-1-5`	Multi-character
OmniHuman 1.0	`bytedance/omnihuman-1-0`	Single character
Fabric 1.0	`falai/fabric-1-0`	Image talks with lipsync
PixVerse Lipsync	`falai/pixverse-lipsync`	Realistic lipsync

Utilities

Tool	App ID	Description
HunyuanVideo Foley	`infsh/hunyuanvideo-foley`	Add sound effects to video
Topaz Upscaler	`falai/topaz-video-upscaler`	Upscale video quality
Media Merger	`infsh/media-merger`	Merge videos with transitions

Browse All Video Apps

bash

infsh app list --category video

Examples

Text-to-Video with Veo

bash

infsh app run google/veo-3-1-fast --input '{
  "prompt": "A timelapse of a flower blooming in a garden"
}'

Grok Video

bash

infsh app run xai/grok-imagine-video --input '{
  "prompt": "Waves crashing on a beach at sunset",
  "duration": 5
}'

Image-to-Video with Wan 2.5

bash

infsh app run falai/wan-2-5 --input '{
  "image_url": "https://your-image.jpg"
}'

AI Avatar / Talking Head

bash

infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Fabric Lipsync

bash

infsh app run falai/fabric-1-0 --input '{
  "image_url": "https://face.jpg",
  "audio_url": "https://audio.mp3"
}'

PixVerse Lipsync

bash

infsh app run falai/pixverse-lipsync --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Video Upscaling

bash

infsh app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'

Add Sound Effects (Foley)

bash

infsh app run infsh/hunyuanvideo-foley --input '{
  "video_url": "https://silent-video.mp4",
  "prompt": "footsteps on gravel, birds chirping"
}'

Merge Videos

bash

infsh app run infsh/media-merger --input '{
  "videos": ["https://clip1.mp4", "https://clip2.mp4"],
  "transition": "fade"
}'

Related Skills

bash

# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli

# Pruna P-Video (fast & economical)
npx skills add inference-sh/skills@p-video

# Google Veo specific
npx skills add inference-sh/skills@google-veo

# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video

# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech

# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation

# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation

Browse all apps: infsh app list

Documentation

Running Apps - How to run apps via CLI
Streaming Results - Real-time progress updates
Content Pipeline Example - Building media workflows

Maintainer

inference-sh Core maintainer

Source details

Full Name: inference-sh/skills
Branch: main
Path in repo: tools/video/ai-video-generation

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

inference-sh/skills

agent-ui

Batteries-included agent component for React/Next.js from ui.inference.sh. One component with runtime, tools, streaming, approvals, and widgets built in. Capabilities: drop-in agent, human-in-the-loop, client-side tools, form filling. Use for: building AI chat interfaces, agentic UIs, SaaS copilots, assistants. Triggers: agent component, agent ui, chat agent, shadcn agent, react agent, agentic ui, ai assistant ui, copilot ui, inference ui, human in the loop

247 46

Explore

inference-sh/skills

chat-ui

Chat UI building blocks for React/Next.js from ui.inference.sh. Components: container, messages, input, typing indicators, avatars. Capabilities: chat interfaces, message lists, input handling, streaming. Use for: building custom chat UIs, messaging interfaces, AI assistants. Triggers: chat ui, chat component, message list, chat input, shadcn chat, react chat, chat interface, messaging ui, conversation ui, chat building blocks

247 46

Explore

inference-sh/skills

tools-ui

Tool lifecycle UI components for React/Next.js from ui.inference.sh. Display tool calls: pending, progress, approval required, results. Capabilities: tool status, progress indicators, approval flows, results display. Use for: showing agent tool calls, human-in-the-loop approvals, tool output. Triggers: tool ui, tool calls, tool status, tool approval, tool results, agent tools, mcp tools ui, function calling ui, tool lifecycle, tool pending

247 46

Explore

inference-sh/skills

widgets-ui

Declarative UI widgets from JSON for React/Next.js from ui.inference.sh. Render rich interactive UIs from structured agent responses. Capabilities: forms, buttons, cards, layouts, inputs, selects, checkboxes. Use for: agent-generated UIs, dynamic forms, data display, interactive cards. Triggers: widgets, declarative ui, json ui, widget renderer, agent widgets, dynamic ui, form widgets, card widgets, shadcn widgets, structured output ui

247 46

Explore

inference-sh/skills

web-search

Web search and content extraction with Tavily and Exa via inference.sh CLI. Apps: Tavily Search, Tavily Extract, Exa Search, Exa Answer, Exa Extract. Capabilities: AI-powered search, content extraction, direct answers, research. Use for: research, RAG pipelines, fact-checking, content aggregation, agents. Triggers: web search, tavily, exa, search api, content extraction, research, internet search, ai search, search assistant, web scraping, rag, perplexity alternative

247 46

Explore

inference-sh/skills

ai-rag-pipeline

Build RAG (Retrieval Augmented Generation) pipelines with web search and LLMs. Tools: Tavily Search, Exa Search, Exa Answer, Claude, GPT-4, Gemini via OpenRouter. Capabilities: research, fact-checking, grounded responses, knowledge retrieval. Use for: AI agents, research assistants, fact-checkers, knowledge bases. Triggers: rag, retrieval augmented generation, grounded ai, search and answer, research agent, fact checking, knowledge retrieval, ai research, search + llm, web grounded, perplexity alternative, ai with sources, citation, research pipeline

247 46

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

AI Video Generation

Quick Start

Available Models

Text-to-Video

Image-to-Video

Avatar / Lipsync

Utilities

Browse All Video Apps

Examples

Text-to-Video with Veo

Grok Video

Image-to-Video with Wan 2.5

AI Avatar / Talking Head

Fabric Lipsync

PixVerse Lipsync

Video Upscaling

Add Sound Effects (Foley)

Merge Videos

Related Skills

Documentation

Recommended Agent Skills

agent-ui

chat-ui

tools-ui

widgets-ui

web-search

ai-rag-pipeline