Agent skill
ai-video-generation
Generate AI videos with Google Veo, Seedance, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 1.5 Pro, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative
Install this agent skill to your Project
npx add-skill https://github.com/inference-sh/skills/tree/main/tools/video/ai-video-generation
SKILL.md
AI Video Generation
Generate videos with 40+ AI models via inference.sh CLI.
Quick Start
Requires inference.sh CLI (
infsh). Install instructions
infsh login
# Generate a video with Veo
infsh app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'
Available Models
Text-to-Video
| Model | App ID | Best For |
|---|---|---|
| Veo 3.1 Fast | google/veo-3-1-fast |
Fast, with optional audio |
| Veo 3.1 | google/veo-3-1 |
Best quality, frame interpolation |
| Veo 3 | google/veo-3 |
High quality with audio |
| Veo 3 Fast | google/veo-3-fast |
Fast with audio |
| Veo 2 | google/veo-2 |
Realistic videos |
| P-Video | pruna/p-video |
Fast, economical, with audio support |
| WAN-T2V | pruna/wan-t2v |
Economical 480p/720p |
| Grok Video | xai/grok-imagine-video |
xAI, configurable duration |
| Seedance 1.5 Pro | bytedance/seedance-1-5-pro |
With first-frame control |
| Seedance 1.0 Pro | bytedance/seedance-1-0-pro |
Up to 1080p |
Image-to-Video
| Model | App ID | Best For |
|---|---|---|
| Wan 2.5 | falai/wan-2-5 |
Animate any image |
| Wan 2.5 I2V | falai/wan-2-5-i2v |
High quality i2v |
| WAN-I2V | pruna/wan-i2v |
Economical 480p/720p |
| P-Video | pruna/p-video |
Fast i2v with audio |
| Seedance Lite | bytedance/seedance-1-0-lite |
Lightweight 720p |
Avatar / Lipsync
| Model | App ID | Best For |
|---|---|---|
| OmniHuman 1.5 | bytedance/omnihuman-1-5 |
Multi-character |
| OmniHuman 1.0 | bytedance/omnihuman-1-0 |
Single character |
| Fabric 1.0 | falai/fabric-1-0 |
Image talks with lipsync |
| PixVerse Lipsync | falai/pixverse-lipsync |
Realistic lipsync |
Utilities
| Tool | App ID | Description |
|---|---|---|
| HunyuanVideo Foley | infsh/hunyuanvideo-foley |
Add sound effects to video |
| Topaz Upscaler | falai/topaz-video-upscaler |
Upscale video quality |
| Media Merger | infsh/media-merger |
Merge videos with transitions |
Browse All Video Apps
infsh app list --category video
Examples
Text-to-Video with Veo
infsh app run google/veo-3-1-fast --input '{
"prompt": "A timelapse of a flower blooming in a garden"
}'
Grok Video
infsh app run xai/grok-imagine-video --input '{
"prompt": "Waves crashing on a beach at sunset",
"duration": 5
}'
Image-to-Video with Wan 2.5
infsh app run falai/wan-2-5 --input '{
"image_url": "https://your-image.jpg"
}'
AI Avatar / Talking Head
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Fabric Lipsync
infsh app run falai/fabric-1-0 --input '{
"image_url": "https://face.jpg",
"audio_url": "https://audio.mp3"
}'
PixVerse Lipsync
infsh app run falai/pixverse-lipsync --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Video Upscaling
infsh app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'
Add Sound Effects (Foley)
infsh app run infsh/hunyuanvideo-foley --input '{
"video_url": "https://silent-video.mp4",
"prompt": "footsteps on gravel, birds chirping"
}'
Merge Videos
infsh app run infsh/media-merger --input '{
"videos": ["https://clip1.mp4", "https://clip2.mp4"],
"transition": "fade"
}'
Related Skills
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli
# Pruna P-Video (fast & economical)
npx skills add inference-sh/skills@p-video
# Google Veo specific
npx skills add inference-sh/skills@google-veo
# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video
# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech
# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation
# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation
Browse all apps: infsh app list
Documentation
- Running Apps - How to run apps via CLI
- Streaming Results - Real-time progress updates
- Content Pipeline Example - Building media workflows
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
agent-ui
Batteries-included agent component for React/Next.js from ui.inference.sh. One component with runtime, tools, streaming, approvals, and widgets built in. Capabilities: drop-in agent, human-in-the-loop, client-side tools, form filling. Use for: building AI chat interfaces, agentic UIs, SaaS copilots, assistants. Triggers: agent component, agent ui, chat agent, shadcn agent, react agent, agentic ui, ai assistant ui, copilot ui, inference ui, human in the loop
chat-ui
Chat UI building blocks for React/Next.js from ui.inference.sh. Components: container, messages, input, typing indicators, avatars. Capabilities: chat interfaces, message lists, input handling, streaming. Use for: building custom chat UIs, messaging interfaces, AI assistants. Triggers: chat ui, chat component, message list, chat input, shadcn chat, react chat, chat interface, messaging ui, conversation ui, chat building blocks
tools-ui
Tool lifecycle UI components for React/Next.js from ui.inference.sh. Display tool calls: pending, progress, approval required, results. Capabilities: tool status, progress indicators, approval flows, results display. Use for: showing agent tool calls, human-in-the-loop approvals, tool output. Triggers: tool ui, tool calls, tool status, tool approval, tool results, agent tools, mcp tools ui, function calling ui, tool lifecycle, tool pending
widgets-ui
Declarative UI widgets from JSON for React/Next.js from ui.inference.sh. Render rich interactive UIs from structured agent responses. Capabilities: forms, buttons, cards, layouts, inputs, selects, checkboxes. Use for: agent-generated UIs, dynamic forms, data display, interactive cards. Triggers: widgets, declarative ui, json ui, widget renderer, agent widgets, dynamic ui, form widgets, card widgets, shadcn widgets, structured output ui
web-search
Web search and content extraction with Tavily and Exa via inference.sh CLI. Apps: Tavily Search, Tavily Extract, Exa Search, Exa Answer, Exa Extract. Capabilities: AI-powered search, content extraction, direct answers, research. Use for: research, RAG pipelines, fact-checking, content aggregation, agents. Triggers: web search, tavily, exa, search api, content extraction, research, internet search, ai search, search assistant, web scraping, rag, perplexity alternative
ai-rag-pipeline
Build RAG (Retrieval Augmented Generation) pipelines with web search and LLMs. Tools: Tavily Search, Exa Search, Exa Answer, Claude, GPT-4, Gemini via OpenRouter. Capabilities: research, fact-checking, grounded responses, knowledge retrieval. Use for: AI agents, research assistants, fact-checkers, knowledge bases. Triggers: rag, retrieval augmented generation, grounded ai, search and answer, research agent, fact checking, knowledge retrieval, ai research, search + llm, web grounded, perplexity alternative, ai with sources, citation, research pipeline
Didn't find tool you were looking for?