tools/video/image-to-video/SKILL.md
Still-to-video conversion guide: model selection, motion prompting, and camera movement. Covers Wan 2.5 i2v, Seedance, Fabric, Grok Video with when to use each. Use for: animating images, creating video from stills, adding motion, product animations. Triggers: image to video, i2v, animate image, still to video, add motion to image, image animation, photo to video, animate still, wan i2v, image2video, bring image to life, animate photo, motion from image
npx skillsauth add inferencesh/skills image-to-videoInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Convert still images to animated videos via inference.sh CLI.
Requires inference.sh CLI (
belt). Install instructions
belt login
# Generate a still image
belt app run falai/flux-dev-lora --input '{
"prompt": "serene mountain lake at sunset, snow-capped peaks reflected in still water, golden hour light, landscape photography",
"width": 1248,
"height": 832
}'
# Animate it
belt app run falai/wan-2-5-i2v --input '{
"prompt": "gentle ripples on the lake surface, clouds slowly drifting, warm light shifting, birds flying in the distance",
"image": "path/to/lake-image.png"
}'
| Model | App ID | Best For | Motion Style |
|-------|--------|----------|-------------|
| Wan 2.5 i2v | falai/wan-2-5-i2v | Realistic motion, natural movement | Photorealistic, subtle |
| WAN-I2V (Pruna) | pruna/wan-i2v | Economical, fast, 480p/720p | Natural, efficient |
| Seedance 1.5 Pro | bytedance/seedance-1-5-pro | Stylized, creative, animation-like | Artistic, expressive |
| Seedance 1.0 Pro | bytedance/seedance-1-0-pro | General purpose, good quality | Balanced |
| Fabric 1.0 | falai/fabric-1-0 | Cloth, fabric, liquid, flowing materials | Physics-based flow |
| Grok Imagine Video | xai/grok-imagine-video | General animation, text-guided | Versatile |
| Scenario | Best Model | Why | |----------|-----------|-----| | Landscape with water/clouds | Wan 2.5 i2v | Best at natural, realistic motion | | Portrait with subtle expression | Wan 2.5 i2v | Maintains face fidelity | | Product with fabric/cloth | Fabric 1.0 | Specialized in material physics | | Flag waving, curtain flowing | Fabric 1.0 | Cloth simulation | | Illustrated/artistic image | Seedance | Matches stylized content | | General "bring to life" | Seedance 1.5 Pro | Good all-rounder | | Quick test/iteration | Seedance 1.0 Lite | Fastest, 720p |
| Movement | Prompt Keyword | Effect | |----------|---------------|--------| | Push in / Dolly forward | "slow dolly forward", "camera pushes in" | Increasing intimacy/focus | | Pull out / Dolly back | "camera pulls back", "slow zoom out" | Reveal, context | | Pan left/right | "camera pans slowly to the right" | Scanning, following | | Tilt up/down | "camera tilts upward" | Revealing height | | Orbit | "camera orbits around the subject" | 3D exploration | | Crane up | "camera rises upward" | Grand reveal | | Static | (no camera movement prompt) | Subject motion only |
| Type | Prompt Examples | |------|----------------| | Natural elements | "water rippling", "clouds drifting", "leaves rustling in breeze" | | Hair/clothing | "hair blowing gently in wind", "dress fabric flowing" | | Atmospheric | "fog slowly rolling", "dust particles floating in light beams" | | Character | "person slowly turns to camera", "subtle breathing motion" | | Mechanical | "gears turning", "clock hands moving" | | Liquid | "coffee steam rising", "paint dripping", "water pouring" |
AI video models produce better results with gentle, subtle motion than dramatic action. Requesting too much movement causes distortion and artifacts.
❌ "person running and jumping over obstacles while the camera spins"
✅ "person slowly walking forward, gentle breeze, camera follows alongside"
❌ "explosion with debris flying everywhere"
✅ "candle flame flickering gently, warm ambient light shifting"
❌ "fast zoom into the eyes with dramatic camera shake"
✅ "slow dolly forward toward the subject, subtle focus shift"
[Camera movement] + [Subject motion] + [Atmospheric effects] + [Mood/pace]
# Landscape animation
belt app run falai/wan-2-5-i2v --input '{
"prompt": "gentle camera pan right, water reflecting moving clouds, trees swaying slightly in breeze, warm golden light, peaceful and slow",
"image": "landscape.png"
}'
# Portrait animation
belt app run falai/wan-2-5-i2v --input '{
"prompt": "subtle breathing motion, slight head turn, natural eye blink, hair moving gently, soft ambient lighting shifts",
"image": "portrait.png"
}'
# Product shot animation
belt app run bytedance/seedance-1-5-pro --input '{
"prompt": "slow 360 degree orbit around the product, gentle spotlight movement, subtle reflections shifting, premium product showcase, smooth motion",
"image": "product.png"
}'
# Fabric/cloth animation
belt app run falai/fabric-1-0 --input '{
"prompt": "fabric flowing and rippling in gentle wind, natural cloth physics, soft movement",
"image": "fabric-scene.png"
}'
# Architectural visualization
belt app run falai/wan-2-5-i2v --input '{
"prompt": "slow dolly forward through the entrance, slight camera tilt upward, ambient light filtering through windows, dust particles in light beams",
"image": "building-interior.png"
}'
| Duration | Quality | Use For | |----------|---------|---------| | 2-3 seconds | Highest quality | GIFs, looping backgrounds, cinemagraphs | | 4-5 seconds | High quality | Social media posts, product reveals | | 6-8 seconds | Good quality | Short clips, transitions | | 10+ seconds | Quality degrades | Avoid unless stitching shorter clips |
For longer videos, generate multiple short clips and stitch:
# Generate 3 clips from the same image with progressive motion
belt app run falai/wan-2-5-i2v --input '{
"prompt": "slow pan left, gentle water motion",
"image": "scene.png"
}' --no-wait
belt app run falai/wan-2-5-i2v --input '{
"prompt": "continuing pan, clouds shifting, light changing",
"image": "scene.png"
}' --no-wait
# Stitch together
belt app run infsh/media-merger --input '{
"media": ["clip1.mp4", "clip2.mp4"]
}'
# 1. Generate source image (best quality)
belt app run bytedance/seedream-4-5 --input '{
"prompt": "cinematic landscape, misty mountains at dawn, lake in foreground, dramatic clouds, golden hour, 4K quality, professional photography",
"size": "2K"
}'
# 2. Animate the image
belt app run falai/wan-2-5-i2v --input '{
"prompt": "gentle mist rolling through the valley, lake surface rippling, clouds slowly moving, birds in distance, warm light shifting",
"image": "landscape.png"
}'
# 3. Upscale video if needed
belt app run falai/topaz-video-upscaler --input '{
"video": "animated-landscape.mp4"
}'
# 4. Add ambient audio
belt app run infsh/hunyuanvideo-foley --input '{
"video": "animated-landscape.mp4",
"prompt": "gentle nature ambience, distant birds, soft wind, water lapping"
}'
# 5. Merge video with audio
belt app run infsh/video-audio-merger --input '{
"video": "upscaled-landscape.mp4",
"audio": "ambient-audio.mp3"
}'
A cinemagraph is a still photo where only one element moves (e.g., waterfall moving in an otherwise frozen scene). To achieve this:
belt app run falai/wan-2-5-i2v --input '{
"prompt": "only the waterfall is moving, everything else remains perfectly still, water cascading smoothly, rest of scene frozen",
"image": "waterfall-scene.png"
}'
| Mistake | Problem | Fix | |---------|---------|-----| | Too much motion requested | Distortion, artifacts, warping | Subtle > dramatic, always | | Wrong model for content type | Poor results | Use selection guide above | | Clips too long (10s+) | Quality degrades significantly | Keep to 3-5 seconds, stitch if needed | | No camera movement specified | Random/unpredictable motion | Always specify camera behavior | | Conflicting motion directions | Chaotic, unnatural | One primary motion direction | | Low-res source image | Low-res video output | Start with highest quality source | | Complex action scenes | Models can't handle | Keep motion simple and natural |
npx skills add inference-sh/skills@ai-video-generation
npx skills add inference-sh/skills@ai-image-generation
npx skills add inference-sh/skills@p-video
npx skills add inference-sh/skills@video-prompting-guide
npx skills add inference-sh/skills@prompt-engineering
Browse all apps: belt app list
tools
Build multi-step AI content creation pipelines combining image, video, audio, and text. Workflow examples: generate image -> animate -> add voiceover -> merge with music. Tools: FLUX, Veo, Kokoro TTS, OmniHuman, media merger, upscaling. Use for: YouTube videos, social media content, marketing materials, automated content. Triggers: content pipeline, ai workflow, content creation, multi-step ai, content automation, ai video workflow, generate and edit, ai content factory, automated content creation, ai production pipeline, media pipeline, content at scale
tools
Build automated AI workflows combining multiple models and services. Patterns: batch processing, scheduled tasks, event-driven pipelines, agent loops. Tools: inference.sh CLI, bash scripting, Python SDK, webhook integration. Use for: content automation, data processing, monitoring, scheduled generation. Triggers: ai automation, workflow automation, batch processing, ai pipeline, automated content, scheduled ai, ai cron, ai batch job, automated generation, ai workflow, content at scale, automation script, ai orchestration
data-ai
Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.
tools
Generate AI images with GPT-Image-2, FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: GPT-Image-2, FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image, gpt image, openai image, chatgpt image