tools/video/ai-video-generation/SKILL.md
Generate AI videos with Google Veo, Seedance 2.0, HappyHorse, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 2.0, HappyHorse 1.0, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, reference-to-video, video editing, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative, seedance, happyhorse
npx skillsauth add inference-sh-0/skills ai-video-generationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Install the belt CLI skill:
npx skills add belt-sh/cli
Generate videos with 40+ AI models via inference.sh CLI.

Requires inference.sh CLI (
belt). Install instructions
belt login
# Generate a video with Veo
belt app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'
| Model | App ID | Best For |
|-------|--------|----------|
| Veo 3.1 Fast | google/veo-3-1-fast | Fast, with optional audio |
| Veo 3.1 | google/veo-3-1 | Best quality, frame interpolation |
| Veo 3 | google/veo-3 | High quality with audio |
| Veo 3 Fast | google/veo-3-fast | Fast with audio |
| Veo 2 | google/veo-2 | Realistic videos |
| P-Video | pruna/p-video | Fast, economical, with audio support |
| WAN-T2V | pruna/wan-t2v | Economical 480p/720p |
| Grok Video | xai/grok-imagine-video | xAI, configurable duration |
| Seedance 2.0 | bytedance/seedance-2-0 | Text/image/ref-to-video with sync audio, up to 1080p |
| Seedance 2.0 Fast | bytedance/seedance-2-0-fast | Fast variant, same capabilities |
| HappyHorse T2V | alibaba/happyhorse-1-0-t2v | Physically realistic, up to 15s |
| Model | App ID | Best For |
|-------|--------|----------|
| Wan 2.5 | falai/wan-2-5 | Animate any image |
| Wan 2.5 I2V | falai/wan-2-5-i2v | High quality i2v |
| WAN-I2V | pruna/wan-i2v | Economical 480p/720p |
| P-Video | pruna/p-video | Fast i2v with audio |
| Seedance 2.0 | bytedance/seedance-2-0 | Animate images with sync audio, up to 1080p |
| Seedance 2.0 Fast | bytedance/seedance-2-0-fast | Fast variant, same capabilities |
| HappyHorse I2V | alibaba/happyhorse-1-0-i2v | Animate images, up to 1080P/15s |
| HappyHorse R2V | alibaba/happyhorse-1-0-r2v | Character-preserving from references |
| Model | App ID | Best For |
|-------|--------|----------|
| OmniHuman 1.5 | bytedance/omnihuman-1-5 | Multi-character |
| OmniHuman 1.0 | bytedance/omnihuman-1-0 | Single character |
| Fabric 1.0 | falai/fabric-1-0 | Image talks with lipsync |
| PixVerse Lipsync | falai/pixverse-lipsync | Realistic lipsync |
| Model | App ID | Best For |
|-------|--------|----------|
| HappyHorse Edit | alibaba/happyhorse-1-0-video-edit | Natural language video editing |
| Tool | App ID | Description |
|------|--------|-------------|
| HunyuanVideo Foley | infsh/hunyuanvideo-foley | Add sound effects to video |
| Topaz Upscaler | falai/topaz-video-upscaler | Upscale video quality |
| Media Merger | infsh/media-merger | Merge videos with transitions |
belt app store --category video
belt app run google/veo-3-1-fast --input '{
"prompt": "A timelapse of a flower blooming in a garden"
}'
belt app run xai/grok-imagine-video --input '{
"prompt": "Waves crashing on a beach at sunset",
"duration": 5
}'
belt app run falai/wan-2-5 --input '{
"image_url": "https://your-image.jpg"
}'
belt app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
belt app run falai/fabric-1-0 --input '{
"image_url": "https://face.jpg",
"audio_url": "https://audio.mp3"
}'
belt app run bytedance/seedance-2-0 --input '{
"prompt": "a jazz band performing in a dimly lit club",
"generate_audio": true,
"duration": 10
}'
belt app run bytedance/seedance-2-0 --input '{
"image": "https://your-image.jpg",
"prompt": "gentle camera movement, leaves rustling in the wind",
"generate_audio": true
}'
belt app run bytedance/seedance-2-0 --input '{
"prompt": "A person who looks like the reference walking through a garden",
"reference_image": "https://portrait.jpg",
"generate_audio": true
}'
belt app run alibaba/happyhorse-1-0-t2v --input '{
"prompt": "a golden retriever running through autumn leaves, slow motion",
"duration": 10,
"resolution": "1080P"
}'
belt app run alibaba/happyhorse-1-0-video-edit --input '{
"video": "https://your-video.mp4",
"prompt": "change the background to a snowy mountain landscape"
}'
belt app run falai/pixverse-lipsync --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
belt app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'
belt app run infsh/hunyuanvideo-foley --input '{
"video_url": "https://silent-video.mp4",
"prompt": "footsteps on gravel, birds chirping"
}'
belt app run infsh/media-merger --input '{
"videos": ["https://clip1.mp4", "https://clip2.mp4"],
"transition": "fade"
}'
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
# Pruna P-Video (fast & economical)
npx skills add inference-sh/skills@p-video
# Google Veo specific
npx skills add inference-sh/skills@google-veo
# Seedance 2.0
npx skills add inference-sh/skills@seedance
# HappyHorse 1.0
npx skills add inference-sh/skills@happyhorse
# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video
# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech
# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation
# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation
Browse all apps: belt app store
data-ai
Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.
tools
Generate videos with ByteDance Seedance 2.0 via inference.sh CLI. Unified model for text-to-video, image-to-video, and reference-to-video with synchronized audio, up to 1080p, 4-15s duration. Pro and Fast variants. Studio variants with private asset library for portrait consistency. Use for: social media videos, music videos, product demos, animated content, AI video with sound. Triggers: seedance, seedance 2, bytedance video, seedance t2v, seedance i2v, seedance r2v, video with audio, seedance 2.0, bytedance seedance, seedance studio
tools
Generate talking head avatar videos with Pruna P-Video-Avatar via inference.sh CLI. Turn a portrait image into a realistic speaking video with built-in TTS. 18x faster and 6x cheaper than competitors. Models: P-Video-Avatar, P-Image (for portrait generation). Capabilities: text-to-avatar, audio-driven avatars, 30 voices, 10 languages, 720p/1080p, built-in TTS, dynamic backgrounds, full-body control. Use for: AI presenters, product demos, explainer videos, virtual influencers, marketing, education, multilingual content, UGC, gaming avatars. Triggers: avatar video, talking head, ai avatar, p-video-avatar, pruna avatar, video avatar, ai presenter, digital human, virtual presenter, lipsync, talking avatar, ai spokesperson, heygen alternative, synthesia alternative, veed alternative, fabric alternative, omnihuman alternative
tools
Generate and edit videos with Alibaba HappyHorse 1.0 models via inference.sh CLI. Models: HappyHorse T2V, I2V, R2V, Video Edit. Capabilities: text-to-video, image-to-video, reference-to-video, video editing with natural language, character preservation, 720P/1080P, up to 15 seconds. Use for: physically realistic video, video editing, character-consistent content, product demos, social media. Triggers: happyhorse, happy horse, alibaba video, happyhorse 1.0, dashscope video, alibaba happyhorse, video editing ai, ai video editor