guides/video/ai-marketing-videos/SKILL.md
Create AI marketing videos for ads, promos, product launches, and brand content. Models: Veo, Seedance, Wan, FLUX for visuals, Kokoro for voiceover. Types: product demos, testimonials, explainers, social ads, brand videos. Use for: Facebook ads, YouTube ads, product launches, brand awareness. Triggers: marketing video, ad video, promo video, commercial, brand video, product video, explainer video, ad creative, video ad, facebook ad video, youtube ad, instagram ad, tiktok ad, promotional video, launch video
npx skillsauth add skill-zero/s ai-marketing-videosInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Install the belt CLI skill:
npx skills add belt-sh/cli
Create professional marketing videos via inference.sh CLI.

Requires inference.sh CLI (
belt). Install instructions
belt login
# Generate a product promo video
belt app run google/veo-3-1-fast --input '{
"prompt": "Sleek product reveal video, smartphone emerging from light particles, premium tech aesthetic, commercial quality"
}'
| Type | Duration | Platform | |------|----------|----------| | Bumper Ad | 6 seconds | YouTube | | Short Ad | 15 seconds | Instagram, Facebook | | Standard Ad | 30 seconds | YouTube, TV | | Explainer | 60-90 seconds | Website, YouTube | | Product Demo | 30-60 seconds | All platforms |
# Dramatic product reveal
belt app run google/veo-3 --input '{
"prompt": "Cinematic product launch video, premium tech device floating in space, dramatic lighting, particles and light effects, Apple-style reveal, commercial quality"
}'
# Emotional brand narrative
belt app run google/veo-3-1 --input '{
"prompt": "Brand story video showing diverse people connecting through technology, warm color grading, lifestyle montage, emotional and inspiring, commercial"
}'
# Focus on specific feature
belt app run bytedance/seedance-2-0 --input '{
"prompt": "Close-up product feature demonstration, hands interacting with device, clean background, informative, tech commercial style",
"generate_audio": true
}'
# Talking head testimonial
belt app run google/veo-3-1-fast --input '{
"prompt": "Customer testimonial style video, person speaking to camera, neutral office background, professional lighting, authentic feel"
}'
# Transformation reveal
belt app run google/veo-3-1-fast --input '{
"prompt": "Before and after transformation video, split screen transition, dramatic reveal, satisfying comparison, commercial style"
}'
# 1. Opening hook (0-3s)
belt app run google/veo-3-1-fast --input '{
"prompt": "Attention-grabbing opening, product silhouette in dramatic lighting, building anticipation"
}' > hook.json
# 2. Problem statement (3-8s)
belt app run google/veo-3-1-fast --input '{
"prompt": "Frustrated person dealing with common problem, relatable everyday situation, documentary style"
}' > problem.json
# 3. Solution reveal (8-15s)
belt app run google/veo-3-1-fast --input '{
"prompt": "Product reveal with features highlighted, clean demonstration, solving the problem shown before"
}' > solution.json
# 4. Benefits showcase (15-25s)
belt app run google/veo-3-1-fast --input '{
"prompt": "Happy customer using product, lifestyle integration, multiple quick cuts showing benefits"
}' > benefits.json
# 5. Call to action (25-30s)
belt app run google/veo-3-1-fast --input '{
"prompt": "Product hero shot with space for text overlay, professional lighting, commercial ending"
}' > cta.json
# 6. Generate voiceover
belt app run infsh/kokoro-tts --input '{
"prompt": "Tired of [problem]? Introducing [Product]. [Key benefit 1]. [Key benefit 2]. [Key benefit 3]. Get yours today.",
"voice": "af_nicole"
}' > voiceover.json
# 7. Merge all clips with voiceover
belt app run infsh/media-merger --input '{
"videos": ["<hook>", "<problem>", "<solution>", "<benefits>", "<cta>"],
"audio_url": "<voiceover>",
"transition": "crossfade"
}'
# Vertical format, fast-paced
belt app run google/veo-3-1-fast --input '{
"prompt": "Fast-paced product showcase, vertical 9:16, quick cuts, trending style, hook in first 2 seconds, satisfying visually, Gen-Z aesthetic"
}'
# Add trendy music
belt app run infsh/media-merger --input '{
"video_url": "<video>",
"audio_url": "https://trending-music.mp3"
}'
# 1. Write script
belt app run openrouter/claude-sonnet-45 --input '{
"prompt": "Write a 60-second explainer video script for a SaaS product. Include: hook, problem, solution, 3 key features, social proof, CTA. Make it conversational."
}' > script.json
# 2. Generate visuals for each section
SECTIONS=("hook" "problem" "solution" "feature1" "feature2" "feature3" "social_proof" "cta")
for section in "${SECTIONS[@]}"; do
belt app run google/veo-3-1-fast --input "{
\"prompt\": \"Explainer video scene for $section, motion graphics style, clean modern aesthetic, SaaS product\"
}" > "$section.json"
done
# 3. Generate professional voiceover
belt app run infsh/kokoro-tts --input '{
"prompt": "<full-script>",
"voice": "am_michael"
}' > voiceover.json
# 4. Assemble final video
belt app run infsh/media-merger --input '{
"videos": ["<hook>", "<problem>", "<solution>", ...],
"audio_url": "<voiceover>",
"transition": "fade"
}'
belt app run google/veo-3-1-fast --input '{
"prompt": "Square format product video 1:1, eye-catching visuals, works without sound, text-friendly, scroll-stopping"
}'
belt app run google/veo-3-1-fast --input '{
"prompt": "YouTube ad style, skip button awareness (hook in 5 seconds), 16:9, professional commercial quality"
}'
belt app run google/veo-3-1-fast --input '{
"prompt": "Professional B2B product video, corporate style, clean and modern, business audience, subtle motion"
}'
belt app run google/veo-3-1-fast --input '{
"prompt": "TikTok native style ad, vertical 9:16, raw authentic feel, not overly polished, trendy, user-generated content aesthetic"
}'
belt app run google/veo-3-1-fast --input '{
"prompt": "Opening hook: [choose one]
- Surprising visual transformation
- Bold statement text animation
- Relatable problem scenario
- Curiosity gap visual
- Satisfying action"
}'
# Add appropriate music
belt app run infsh/ai-music --input '{
"prompt": "Upbeat commercial background music, modern, energetic, 30 seconds"
}' > music.json
belt app run infsh/media-merger --input '{
"video_url": "<ad-video>",
"audio_url": "<music>",
"audio_volume": 0.5
}'
# Generate multiple creative variants
HOOKS=(
"Problem-focused opening"
"Product reveal opening"
"Testimonial opening"
"Statistic opening"
)
for hook in "${HOOKS[@]}"; do
belt app run google/veo-3-1-fast --input "{
\"prompt\": \"Marketing video with $hook, professional commercial quality\"
}" > "variant_${hook// /_}.json"
done
# Video generation
npx skills add inference-sh/skills@ai-video-generation
# Image generation for thumbnails
npx skills add inference-sh/skills@ai-image-generation
# Text-to-speech for voiceover
npx skills add inference-sh/skills@text-to-speech
# Social media content
npx skills add inference-sh/skills@ai-social-media-content
# Full platform skill
npx skills add inference-sh/skills@infsh-cli
Browse all apps: belt app store
data-ai
Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.
development
Build and deploy applications on inference.sh. Use when getting started, understanding the platform, creating apps, configuring resources, or needing an overview of inference.sh app development. Supports both Python and Node.js. Triggers: inference.sh app, belt app, inf.yml, inference.py, inference.js, deploy app, app development, build app, create app, GPU app, VRAM, app resources, app secrets, app integrations, multi-function app
data-ai
Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.
tools
Generate videos with ByteDance Seedance 2.0 via inference.sh CLI. Unified model for text-to-video, image-to-video, and reference-to-video with synchronized audio, up to 1080p, 4-15s duration. Pro and Fast variants. Studio variants with private asset library for portrait consistency. Use for: social media videos, music videos, product demos, animated content, AI video with sound. Triggers: seedance, seedance 2, bytedance video, seedance t2v, seedance i2v, seedance r2v, video with audio, seedance 2.0, bytedance seedance, seedance studio