Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

inference-sh/video-ad-specs

Name: video-ad-specs
Author: inference-sh

guides/video/video-ad-specs/SKILL.md

npx skillsauth add inference-sh/agent-skills video-ad-specs

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Video Ad Specs

Create platform-specific video ads via inference.sh CLI.

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

# Generate a vertical video ad scene
belt app run bytedance/seedance-1-5-pro --input '{
  "prompt": "vertical video, person excitedly unboxing a product, clean modern room, bright natural lighting, social media ad style, authentic feeling, 9:16 format"
}'

Platform Specifications

TikTok

| Spec | Value | |------|-------| | Aspect ratio | 9:16 (vertical) | | Resolution | 1080 x 1920 px | | Duration | 5-60 seconds (15-30s recommended) | | File size | Max 500 MB | | Format | MP4, MOV | | Sound | On by default (design with sound) | | Text safe zone | 150px from all edges | | Hook window | 1 second — first frame must grab attention |

Instagram Reels

| Spec | Value | |------|-------| | Aspect ratio | 9:16 (vertical) | | Resolution | 1080 x 1920 px | | Duration | Up to 90 seconds (15-30s for ads) | | Cover image | Separate upload, shows in grid | | Sound | On by default | | Caption area | Bottom 20% reserved for text overlay |

Instagram Stories

| Spec | Value | |------|-------| | Aspect ratio | 9:16 | | Resolution | 1080 x 1920 px | | Duration | Up to 15 seconds per segment | | Swipe-up/Link | Available for ads | | Top/bottom | 14% top and 20% bottom = unsafe for key content |

YouTube

| Format | Aspect | Duration | Skip | |--------|--------|----------|------| | Bumper | 16:9 | 6 seconds exactly | Non-skippable | | Non-skippable | 16:9 | 15 seconds | Non-skippable | | Skippable (TrueView) | 16:9 | Any length | Skip after 5 seconds | | Shorts | 9:16 | Up to 60 seconds | N/A |

Resolution: 1920 x 1080 (16:9) or 1080 x 1920 (Shorts)

Facebook Feed

| Spec | Value | |------|-------| | Aspect ratio | 1:1 (square) or 4:5 (recommended for mobile) | | Resolution | 1080 x 1080 or 1080 x 1350 | | Duration | Up to 240 min (15-30s recommended) | | Autoplay | Silent — captions are essential | | Sound | 85% of Facebook video is watched without sound |

| Spec | Value | |------|-------| | Aspect ratio | 1:1 or 16:9 | | Resolution | 1080 x 1080 or 1920 x 1080 | | Duration | 3 seconds to 10 minutes (15-30s for ads) | | Tone | Professional | | Autoplay | Silent in feed |

AIDA Framework for Video Ads

| Phase | Time | Goal | Technique | |-------|------|------|-----------| | Attention | 0-3s | Stop the scroll | Pattern interrupt, bold visual, question | | Interest | 3-10s | Keep watching | State the problem, show relevance | | Desire | 10-20s | Want the solution | Show the product/outcome, social proof | | Action | Final 3-5s | Click/buy/sign up | Clear CTA, urgency, offer |

Hook Techniques (First 3 Seconds)

| Technique | Example | |-----------|---------| | Bold statement | "This tool replaced my entire marketing team" | | Question | "Why are you still doing this manually?" | | Surprising visual | Unexpected transformation, before/after reveal | | Pattern interrupt | Start mid-action, unusual angle, bright color | | Social proof | "2 million people switched to this" | | Pain point | "If you hate [common frustration], watch this" |

Creating Video Ads

Vertical (TikTok, Reels, Stories, Shorts)

# Hook scene (0-3s)
belt app run google/veo-3-1-fast --input '{
  "prompt": "vertical 9:16 video, close-up of hands struggling with tangled cables and messy desk, frustrated energy, shaky handheld camera, authentic social media style, bright lighting"
}'

# Solution reveal (3-15s)
belt app run bytedance/seedance-1-5-pro --input '{
  "prompt": "vertical video, smooth product reveal, clean wireless charging station on minimalist desk, satisfying organization transformation, bright modern room, social media ad aesthetic"
}'

# Add voiceover
belt app run falai/dia-tts --input '{
  "prompt": "[S1] Stop wasting time with this mess. This one product changed my entire setup. Everything charges. Everything is organized. Link in bio."
}'

# Merge video + audio
belt app run infsh/video-audio-merger --input '{
  "video": "solution-reveal.mp4",
  "audio": "voiceover.mp3"
}'

# Add captions (critical for silent autoplay)
belt app run infsh/caption-videos --input '{
  "video": "ad-with-audio.mp4",
  "caption_file": "captions.srt"
}'

Square (Facebook, LinkedIn Feed)

belt app run google/veo-3-1-fast --input '{
  "prompt": "square 1:1 video, professional person at desk discovering a new software tool, laptop screen showing clean dashboard, natural office lighting, corporate commercial style, satisfied expression"
}'

YouTube Bumper (6 Seconds)

# 6-second bumper: one message, one visual, one CTA
belt app run google/veo-3-1-fast --input '{
  "prompt": "6 second product ad, quick montage of a sleek app being used on phone, fast cuts, modern, energetic, brand logo reveal at end, punchy and dynamic, wide 16:9"
}'

# Keep it tight
belt app run falai/dia-tts --input '{
  "prompt": "[S1] Your reports. Automated. Try DataFlow free."
}'

Captions Are Mandatory

85% of Facebook and 40%+ of Instagram video is watched on mute.

Caption Best Practices

| Rule | Reason | |------|--------| | Always add captions | Silent viewing is the default on most platforms | | Large, readable font | Small text is invisible on mobile | | High contrast | White text with dark outline/background | | Centered or bottom-third | Standard viewing position | | Max 2 lines at a time | More text = can't be read fast enough | | Key words in bold/color | Draws eye to important words |

# Generate captions from audio
# (create SRT file from your script, then burn in)
belt app run infsh/caption-videos --input '{
  "video": "ad-video.mp4",
  "caption_file": "ad-captions.srt"
}'

Ad Structure Templates

Testimonial Ad (15-30s)

| Time | Content | |------|---------| | 0-3s | Customer states the problem they had | | 3-15s | How they discovered and tried the product | | 15-25s | The specific result they achieved | | 25-30s | Product name + CTA |

Demo Ad (15-30s)

| Time | Content | |------|---------| | 0-3s | The problem (text or visual) | | 3-20s | Product demo showing the solution | | 20-25s | Key result/benefit | | 25-30s | CTA + offer |

Before/After Ad (15s)

| Time | Content | |------|---------| | 0-3s | "Before" state (messy, slow, frustrating) | | 3-5s | Transition / product introduction | | 5-12s | "After" state (clean, fast, satisfying) | | 12-15s | CTA |

Common Mistakes

| Mistake | Problem | Fix | |---------|---------|-----| | No hook in first 1-3s | Viewer scrolls past | Open with pattern interrupt | | Landscape video on TikTok/Reels | Letterboxed, looks amateur | Use 9:16 for vertical platforms | | No captions | Most viewers watch silent | Always add captions | | CTA too late | Viewers already left | Clear CTA within last 5 seconds | | Too long for platform | Forced skip or dropout | Match platform duration norms | | Same ad for all platforms | Wrong specs, wrong tone | Create platform-specific versions | | Logo in first 3s | Feels like a commercial, gets skipped | Save branding for the end | | Text in unsafe zones | Cut off by platform UI | Check safe zone per platform |

Checklist

[ ] Correct aspect ratio for target platform
[ ] Hook in first 1-3 seconds
[ ] Captions added (readable, high contrast)
[ ] CTA clear and within final 5 seconds
[ ] Duration matches platform norms
[ ] Text outside platform unsafe zones
[ ] Audio designed for both sound-on and sound-off
[ ] Platform-specific version (not one-size-fits-all)

Related Skills

npx skills add inference-sh/skills@ai-video-generation
npx skills add inference-sh/skills@video-prompting-guide
npx skills add inference-sh/skills@text-to-speech
npx skills add inference-sh/skills@prompt-engineering

Browse all apps: belt app list

inference-sh/video-ad-specs

guides/video/video-ad-specs/SKILL.md

Video ad creation with exact platform-specific specs for TikTok, Instagram, YouTube, Facebook, LinkedIn. Covers dimensions, duration limits, AIDA framework, and caption requirements. Use for: video ads, social media ads, paid media creative, video marketing, ad production. Triggers: video ad, social media ad, tiktok ad, instagram ad, youtube ad, facebook ad, linkedin ad, video creative, ad specs, paid media, video marketing, ad production, reels ad, stories ad, pre roll, bumper ad

362 stars

development

Updated Apr 23, 2026

$ install --global

skillsauth

npx skillsauth add inference-sh/agent-skills video-ad-specs

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 23, 2026, 4:16 AM242.5s1 file scanned

SKILL.md

name:: video-ad-specs
description:: Video ad creation with exact platform-specific specs for TikTok, Instagram, YouTube, Facebook, LinkedIn. Covers dimensions, duration limits, AIDA framework, and caption requirements. Use for: video ads, social media ads, paid media creative, video marketing, ad production. Triggers: video ad, social media ad, tiktok ad, instagram ad, youtube ad, facebook ad, linkedin ad, video creative, ad specs, paid media, video marketing, ad production, reels ad, stories ad, pre roll, bumper ad
allowed-tools:: Bash(belt *)

Video Ad Specs

Create platform-specific video ads via inference.sh CLI.

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

# Generate a vertical video ad scene
belt app run bytedance/seedance-1-5-pro --input '{
  "prompt": "vertical video, person excitedly unboxing a product, clean modern room, bright natural lighting, social media ad style, authentic feeling, 9:16 format"
}'

Platform Specifications

TikTok

Instagram Reels

Instagram Stories

YouTube

Resolution: 1920 x 1080 (16:9) or 1080 x 1920 (Shorts)

Facebook Feed

AIDA Framework for Video Ads

Hook Techniques (First 3 Seconds)

Creating Video Ads

Vertical (TikTok, Reels, Stories, Shorts)

# Hook scene (0-3s)
belt app run google/veo-3-1-fast --input '{
  "prompt": "vertical 9:16 video, close-up of hands struggling with tangled cables and messy desk, frustrated energy, shaky handheld camera, authentic social media style, bright lighting"
}'

# Solution reveal (3-15s)
belt app run bytedance/seedance-1-5-pro --input '{
  "prompt": "vertical video, smooth product reveal, clean wireless charging station on minimalist desk, satisfying organization transformation, bright modern room, social media ad aesthetic"
}'

# Add voiceover
belt app run falai/dia-tts --input '{
  "prompt": "[S1] Stop wasting time with this mess. This one product changed my entire setup. Everything charges. Everything is organized. Link in bio."
}'

# Merge video + audio
belt app run infsh/video-audio-merger --input '{
  "video": "solution-reveal.mp4",
  "audio": "voiceover.mp3"
}'

# Add captions (critical for silent autoplay)
belt app run infsh/caption-videos --input '{
  "video": "ad-with-audio.mp4",
  "caption_file": "captions.srt"
}'

Square (Facebook, LinkedIn Feed)

belt app run google/veo-3-1-fast --input '{
  "prompt": "square 1:1 video, professional person at desk discovering a new software tool, laptop screen showing clean dashboard, natural office lighting, corporate commercial style, satisfied expression"
}'

YouTube Bumper (6 Seconds)

# 6-second bumper: one message, one visual, one CTA
belt app run google/veo-3-1-fast --input '{
  "prompt": "6 second product ad, quick montage of a sleek app being used on phone, fast cuts, modern, energetic, brand logo reveal at end, punchy and dynamic, wide 16:9"
}'

# Keep it tight
belt app run falai/dia-tts --input '{
  "prompt": "[S1] Your reports. Automated. Try DataFlow free."
}'

Captions Are Mandatory

85% of Facebook and 40%+ of Instagram video is watched on mute.

Caption Best Practices

# Generate captions from audio
# (create SRT file from your script, then burn in)
belt app run infsh/caption-videos --input '{
  "video": "ad-video.mp4",
  "caption_file": "ad-captions.srt"
}'

Ad Structure Templates

Testimonial Ad (15-30s)

Demo Ad (15-30s)

| Time | Content | |------|---------| | 0-3s | The problem (text or visual) | | 3-20s | Product demo showing the solution | | 20-25s | Key result/benefit | | 25-30s | CTA + offer |

Before/After Ad (15s)

Common Mistakes

Checklist

[ ] Correct aspect ratio for target platform
[ ] Hook in first 1-3 seconds
[ ] Captions added (readable, high contrast)
[ ] CTA clear and within final 5 seconds
[ ] Duration matches platform norms
[ ] Text outside platform unsafe zones
[ ] Audio designed for both sound-on and sound-off
[ ] Platform-specific version (not one-size-fits-all)

Related Skills

npx skills add inference-sh/skills@ai-video-generation
npx skills add inference-sh/skills@video-prompting-guide
npx skills add inference-sh/skills@text-to-speech
npx skills add inference-sh/skills@prompt-engineering

Browse all apps: belt app list

Related Skills

inference-sh/remotion-render

development

VerifiedTrustedCommunity

Render videos from React/Remotion component code via inference.sh. Pass TSX code, get MP4. Supports all Remotion APIs: useCurrentFrame, useVideoConfig, spring, interpolate, AbsoluteFill, Sequence. Configurable resolution, FPS, duration, codec. Use for: programmatic video generation, animated graphics, motion design, data-driven videos, React animations to video. Triggers: remotion, render video from code, tsx to video, react video, programmatic video, remotion render, code to video, animated video, motion graphics code, react animation video

362SKILL.mdUpdated Apr 21, 2026

inference-sh/remotion-render

inference-sh/p-video

tools

VerifiedTrustedCommunity

Generate videos with Pruna P-Video and WAN models via inference.sh CLI. Models: P-Video, WAN-T2V, WAN-I2V. Capabilities: text-to-video, image-to-video, audio support, 720p/1080p, fast inference. Pruna optimizes models for speed without quality loss. Triggers: pruna video, p-video, pruna ai video, fast video generation, optimized video, wan t2v, wan i2v, economic video generation, cheap video generation, pruna text to video, pruna image to video

362SKILL.mdUpdated Apr 21, 2026

inference-sh/image-to-video

documentation

VerifiedTrustedCommunity

Still-to-video conversion guide: model selection, motion prompting, and camera movement. Covers Wan 2.5 i2v, Seedance, Fabric, Grok Video with when to use each. Use for: animating images, creating video from stills, adding motion, product animations. Triggers: image to video, i2v, animate image, still to video, add motion to image, image animation, photo to video, animate still, wan i2v, image2video, bring image to life, animate photo, motion from image

362SKILL.mdUpdated Apr 21, 2026

inference-sh/image-to-video

inference-sh/google-veo

tools

VerifiedTrustedCommunity

Generate videos with Google Veo models via inference.sh CLI. Models: Veo 3.1, Veo 3.1 Fast, Veo 3, Veo 3 Fast, Veo 2. Capabilities: text-to-video, cinematic output, high quality video generation. Triggers: veo, google veo, veo 3, veo 2, veo 3.1, vertex ai video, google video generation, google video ai, veo model, veo video

362SKILL.mdUpdated Apr 21, 2026

inference-sh/google-veo

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/inference-sh/agent-skills.git

# Copy into Claude Code skills folder (global)
cp -r agent-skills/guides/video/video-ad-specs ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

inference-sh/agent-skills

362 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT