Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

inference-sh/youtube-thumbnail-design

Name: youtube-thumbnail-design
Author: inference-sh

guides/design/youtube-thumbnail-design/SKILL.md

npx skillsauth add inference-sh/agent-skills youtube-thumbnail-design

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

YouTube Thumbnail Design

Create high-CTR YouTube thumbnails with AI image generation via inference.sh CLI.

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

# Generate a thumbnail
belt app run falai/flux-dev-lora --input '{
  "prompt": "YouTube thumbnail style, close-up of a person with surprised excited expression looking at a glowing laptop screen, vibrant blue and orange color scheme, dramatic studio lighting, shallow depth of field, high contrast, cinematic",
  "width": 1280,
  "height": 720
}'

Specifications

| Spec | Value | |------|-------| | Dimensions | 1280 x 720 px (minimum) | | Recommended | 1920 x 1080 px | | Aspect ratio | 16:9 | | Max file size | 2 MB | | Formats | JPG, GIF, PNG |

The 120px Test

Your thumbnail appears at roughly 120px wide on mobile — that's how most viewers first see it.

At 120px, viewers must be able to identify:

The mood/emotion (from colors and expression)
The general subject (from composition)
The text (if any — only if large enough)

Test: view your thumbnail at 120px width. If it's a muddy blur, redesign.

Safe Zones

┌─────────────────────────────────────────────┐
│                                             │
│   ✅ SAFE FOR TEXT AND KEY ELEMENTS         │
│                                             │
│                                             │
│                                             │
│                                             │
│                                       ┌───┐ │
│                                       │ ⏱ │ │ ← Timestamp overlay
│                              ┌────────┴───┘ │    (bottom-right)
│   ┌────┐                     │  DURATION    │
│   │ CH │ Chapter marker      └──────────────│
└───┴────┴────────────────────────────────────┘
     ↑ Bottom-left: chapter/progress markers

Avoid placing critical elements in:

Bottom-right corner (video duration timestamp)
Bottom-left corner (chapter markers, progress bar)
Extreme edges (cropping varies by device)

Color Strategy

High-Contrast Pairs That Work

| Combination | Mood | Best For | |-------------|------|----------| | Yellow + Black | Urgency, attention | Tech, business, lists | | Red + White | Energy, excitement | Entertainment, reactions | | Blue + Orange | Professional contrast | Education, tutorials | | Green + White | Growth, money | Finance, success stories | | Purple + Yellow | Premium, creative | Design, art, creativity | | White + Dark | Clean, minimal | Luxury, minimalist channels |

Color Rules

Background and text/subject should be complementary or high-contrast
Avoid same-temperature colors touching (red on orange = mud)
Use 3 colors maximum per thumbnail
Saturate more than real life — thumbnails compete with bright UI

Text on Thumbnails

When to Use Text

Lists/numbers: "7 Tips", "Top 10"
Strong opinions: "STOP Doing This"
Results: "$10K in 30 Days"
Comparisons: "vs" between two things

When NOT to Use Text

The video title already says it (redundant)
The emotion/visual tells the story
You can't make it large enough to read at 120px

Text Rules

| Rule | Reason | |------|--------| | Max 6 words | Readability at thumbnail size | | Min 60pt equivalent | Must be legible at 120px width | | Bold sans-serif font | Thin fonts disappear at small sizes | | Contrast stroke/shadow | Ensures readability on any background | | No small text | If it's not readable small, cut it |

Face Expression Psychology

Thumbnails with faces get higher CTR than faceless thumbnails. Expression matters:

| Expression | CTR Impact | Best For | |------------|-----------|----------| | Surprise/shock | Highest | Reaction, reveal, discovery content | | Curiosity | High | Tutorial, how-to, tips | | Excitement | High | Unboxing, reviews, announcements | | Concern/worry | Medium-high | Warning, mistake, problem content | | Confidence | Medium | Expert advice, authority content | | Neutral | Lowest | Avoid unless your brand is minimalist |

Face Composition Rules

Face should fill 30-50% of the thumbnail
Eyes looking toward the text or subject (directs viewer attention)
Eyes looking at camera = connection. Eyes looking at object = curiosity.
Place face on one side (usually left), text or subject on the other

# Generate a face-forward thumbnail
belt app run falai/flux-dev-lora --input '{
  "prompt": "close-up portrait of a man with genuinely surprised expression, mouth slightly open, raised eyebrows, looking at camera, left side of frame, vibrant teal background, dramatic rim lighting, YouTube thumbnail style, high contrast, cinematic",
  "width": 1280,
  "height": 720
}'

# Generate a face-looking-at-subject thumbnail
belt app run bytedance/seedream-4-5 --input '{
  "prompt": "person looking amazed at a glowing holographic chart showing upward growth, dramatic blue and green lighting, right side profile view, dark background, tech aesthetic, high energy",
  "size": "2K"
}'

Thumbnail Patterns by Content Type

Tutorial / How-To

belt app run falai/flux-dev-lora --input '{
  "prompt": "overhead flat lay of organized workspace with laptop showing code editor, colorful sticky notes, coffee cup, clean bright background, professional setup, tutorial style composition, warm lighting",
  "width": 1280,
  "height": 720
}'

Before/After

belt app run falai/flux-dev-lora --input '{
  "prompt": "split composition, left side dark and messy disorganized desk, right side bright clean organized minimalist workspace, dramatic contrast between chaos and order, clear dividing line in center, high contrast",
  "width": 1280,
  "height": 720
}'

Product Review / Comparison

belt app run falai/flux-dev-lora --input '{
  "prompt": "two products facing each other with dramatic lighting and sparks between them, competition battle concept, dark background with colorful rim lighting, versus comparison style, high energy, product photography",
  "width": 1280,
  "height": 720
}'

Listicle / Number

belt app run falai/flux-dev-lora --input '{
  "prompt": "dynamic arrangement of 7 different colorful objects floating in space against dark gradient background, each item distinct and clearly separated, energetic composition, vibrant saturated colors, studio lighting",
  "width": 1280,
  "height": 720
}'

A/B Testing

Test one variable at a time:

| Variable | Test A vs B | |----------|-------------| | Face vs No face | Same composition, with/without person | | Expression | Surprise vs curiosity | | Color scheme | Warm vs cool palette | | Text vs No text | With/without text overlay | | Background | Bright vs dark | | Composition | Left-facing vs right-facing subject |

# Generate variant A
belt app run falai/flux-dev-lora --input '{
  "prompt": "..., bright yellow background, ...",
  "width": 1280, "height": 720
}' --no-wait

# Generate variant B (same prompt, different background)
belt app run falai/flux-dev-lora --input '{
  "prompt": "..., dark navy background, ...",
  "width": 1280, "height": 720
}' --no-wait

Thumbnail Checklist

[ ] 1280x720 minimum (1920x1080 preferred)
[ ] Under 2MB file size
[ ] Passes the 120px squint test
[ ] No critical elements in bottom-right (timestamp) or bottom-left (chapter)
[ ] Max 3 colors, high contrast
[ ] Text (if any) is max 6 words, bold, with contrast stroke
[ ] Face expression matches content energy (if applicable)
[ ] Doesn't duplicate the video title
[ ] Stands out from surrounding thumbnails (check your niche)
[ ] Works on both light and dark YouTube backgrounds

Common Mistakes

| Mistake | Problem | Fix | |---------|---------|-----| | Too much text | Unreadable at thumbnail size | Max 6 words or no text | | Low contrast | Disappears in the feed | Use complementary colors | | Cluttered composition | Eye doesn't know where to look | One focal point | | Generic stock photo feel | No personality, gets skipped | Authentic expressions, unique angles | | Tiny details | Lost at 120px | Bold, simple shapes | | Same style every video | Viewer fatigue | Vary within brand guidelines | | Misleading thumbnail | Kills trust, hurts retention | Match the actual content |

Related Skills

npx skills add inference-sh/skills@ai-image-generation
npx skills add inference-sh/skills@image-upscaling
npx skills add inference-sh/skills@prompt-engineering

Browse all apps: belt app list

inference-sh/youtube-thumbnail-design

guides/design/youtube-thumbnail-design/SKILL.md

YouTube thumbnail design with specific dimensions, contrast rules, and mobile preview optimization. Covers safe zones, text placement, face expression psychology, and A/B testing. Use for: YouTube thumbnails, video cover images, click-through optimization. Triggers: youtube thumbnail, thumbnail design, video thumbnail, click through rate, ctr optimization, youtube cover, video cover image, thumbnail maker, thumbnail tips, youtube design, video preview image

362 stars

tools

Updated Apr 23, 2026

$ install --global

skillsauth

npx skillsauth add inference-sh/agent-skills youtube-thumbnail-design

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 23, 2026, 4:07 AM292.4s1 file scanned

SKILL.md

name:: youtube-thumbnail-design
description:: YouTube thumbnail design with specific dimensions, contrast rules, and mobile preview optimization. Covers safe zones, text placement, face expression psychology, and A/B testing. Use for: YouTube thumbnails, video cover images, click-through optimization. Triggers: youtube thumbnail, thumbnail design, video thumbnail, click through rate, ctr optimization, youtube cover, video cover image, thumbnail maker, thumbnail tips, youtube design, video preview image
allowed-tools:: Bash(belt *)

YouTube Thumbnail Design

Create high-CTR YouTube thumbnails with AI image generation via inference.sh CLI.

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

# Generate a thumbnail
belt app run falai/flux-dev-lora --input '{
  "prompt": "YouTube thumbnail style, close-up of a person with surprised excited expression looking at a glowing laptop screen, vibrant blue and orange color scheme, dramatic studio lighting, shallow depth of field, high contrast, cinematic",
  "width": 1280,
  "height": 720
}'

Specifications

| Spec | Value | |------|-------| | Dimensions | 1280 x 720 px (minimum) | | Recommended | 1920 x 1080 px | | Aspect ratio | 16:9 | | Max file size | 2 MB | | Formats | JPG, GIF, PNG |

The 120px Test

Your thumbnail appears at roughly 120px wide on mobile — that's how most viewers first see it.

At 120px, viewers must be able to identify:

The mood/emotion (from colors and expression)
The general subject (from composition)
The text (if any — only if large enough)

Test: view your thumbnail at 120px width. If it's a muddy blur, redesign.

Safe Zones

┌─────────────────────────────────────────────┐
│                                             │
│   ✅ SAFE FOR TEXT AND KEY ELEMENTS         │
│                                             │
│                                             │
│                                             │
│                                             │
│                                       ┌───┐ │
│                                       │ ⏱ │ │ ← Timestamp overlay
│                              ┌────────┴───┘ │    (bottom-right)
│   ┌────┐                     │  DURATION    │
│   │ CH │ Chapter marker      └──────────────│
└───┴────┴────────────────────────────────────┘
     ↑ Bottom-left: chapter/progress markers

Avoid placing critical elements in:

Bottom-right corner (video duration timestamp)
Bottom-left corner (chapter markers, progress bar)
Extreme edges (cropping varies by device)

Color Strategy

High-Contrast Pairs That Work

Color Rules

Background and text/subject should be complementary or high-contrast
Avoid same-temperature colors touching (red on orange = mud)
Use 3 colors maximum per thumbnail
Saturate more than real life — thumbnails compete with bright UI

Text on Thumbnails

When to Use Text

Lists/numbers: "7 Tips", "Top 10"
Strong opinions: "STOP Doing This"
Results: "$10K in 30 Days"
Comparisons: "vs" between two things

When NOT to Use Text

The video title already says it (redundant)
The emotion/visual tells the story
You can't make it large enough to read at 120px

Text Rules

Face Expression Psychology

Thumbnails with faces get higher CTR than faceless thumbnails. Expression matters:

Face Composition Rules

Face should fill 30-50% of the thumbnail
Eyes looking toward the text or subject (directs viewer attention)
Eyes looking at camera = connection. Eyes looking at object = curiosity.
Place face on one side (usually left), text or subject on the other

# Generate a face-forward thumbnail
belt app run falai/flux-dev-lora --input '{
  "prompt": "close-up portrait of a man with genuinely surprised expression, mouth slightly open, raised eyebrows, looking at camera, left side of frame, vibrant teal background, dramatic rim lighting, YouTube thumbnail style, high contrast, cinematic",
  "width": 1280,
  "height": 720
}'

# Generate a face-looking-at-subject thumbnail
belt app run bytedance/seedream-4-5 --input '{
  "prompt": "person looking amazed at a glowing holographic chart showing upward growth, dramatic blue and green lighting, right side profile view, dark background, tech aesthetic, high energy",
  "size": "2K"
}'

Thumbnail Patterns by Content Type

Tutorial / How-To

belt app run falai/flux-dev-lora --input '{
  "prompt": "overhead flat lay of organized workspace with laptop showing code editor, colorful sticky notes, coffee cup, clean bright background, professional setup, tutorial style composition, warm lighting",
  "width": 1280,
  "height": 720
}'

Before/After

belt app run falai/flux-dev-lora --input '{
  "prompt": "split composition, left side dark and messy disorganized desk, right side bright clean organized minimalist workspace, dramatic contrast between chaos and order, clear dividing line in center, high contrast",
  "width": 1280,
  "height": 720
}'

Product Review / Comparison

belt app run falai/flux-dev-lora --input '{
  "prompt": "two products facing each other with dramatic lighting and sparks between them, competition battle concept, dark background with colorful rim lighting, versus comparison style, high energy, product photography",
  "width": 1280,
  "height": 720
}'

Listicle / Number

belt app run falai/flux-dev-lora --input '{
  "prompt": "dynamic arrangement of 7 different colorful objects floating in space against dark gradient background, each item distinct and clearly separated, energetic composition, vibrant saturated colors, studio lighting",
  "width": 1280,
  "height": 720
}'

A/B Testing

Test one variable at a time:

# Generate variant A
belt app run falai/flux-dev-lora --input '{
  "prompt": "..., bright yellow background, ...",
  "width": 1280, "height": 720
}' --no-wait

# Generate variant B (same prompt, different background)
belt app run falai/flux-dev-lora --input '{
  "prompt": "..., dark navy background, ...",
  "width": 1280, "height": 720
}' --no-wait

Thumbnail Checklist

[ ] 1280x720 minimum (1920x1080 preferred)
[ ] Under 2MB file size
[ ] Passes the 120px squint test
[ ] No critical elements in bottom-right (timestamp) or bottom-left (chapter)
[ ] Max 3 colors, high contrast
[ ] Text (if any) is max 6 words, bold, with contrast stroke
[ ] Face expression matches content energy (if applicable)
[ ] Doesn't duplicate the video title
[ ] Stands out from surrounding thumbnails (check your niche)
[ ] Works on both light and dark YouTube backgrounds

Common Mistakes

Related Skills

npx skills add inference-sh/skills@ai-image-generation
npx skills add inference-sh/skills@image-upscaling
npx skills add inference-sh/skills@prompt-engineering

Browse all apps: belt app list

Related Skills

inference-sh/remotion-render

development

VerifiedTrustedCommunity

Render videos from React/Remotion component code via inference.sh. Pass TSX code, get MP4. Supports all Remotion APIs: useCurrentFrame, useVideoConfig, spring, interpolate, AbsoluteFill, Sequence. Configurable resolution, FPS, duration, codec. Use for: programmatic video generation, animated graphics, motion design, data-driven videos, React animations to video. Triggers: remotion, render video from code, tsx to video, react video, programmatic video, remotion render, code to video, animated video, motion graphics code, react animation video

362SKILL.mdUpdated Apr 21, 2026

inference-sh/remotion-render

inference-sh/p-video

tools

VerifiedTrustedCommunity

Generate videos with Pruna P-Video and WAN models via inference.sh CLI. Models: P-Video, WAN-T2V, WAN-I2V. Capabilities: text-to-video, image-to-video, audio support, 720p/1080p, fast inference. Pruna optimizes models for speed without quality loss. Triggers: pruna video, p-video, pruna ai video, fast video generation, optimized video, wan t2v, wan i2v, economic video generation, cheap video generation, pruna text to video, pruna image to video

362SKILL.mdUpdated Apr 21, 2026

inference-sh/image-to-video

documentation

VerifiedTrustedCommunity

Still-to-video conversion guide: model selection, motion prompting, and camera movement. Covers Wan 2.5 i2v, Seedance, Fabric, Grok Video with when to use each. Use for: animating images, creating video from stills, adding motion, product animations. Triggers: image to video, i2v, animate image, still to video, add motion to image, image animation, photo to video, animate still, wan i2v, image2video, bring image to life, animate photo, motion from image

362SKILL.mdUpdated Apr 21, 2026

inference-sh/image-to-video

inference-sh/google-veo

tools

VerifiedTrustedCommunity

Generate videos with Google Veo models via inference.sh CLI. Models: Veo 3.1, Veo 3.1 Fast, Veo 3, Veo 3 Fast, Veo 2. Capabilities: text-to-video, cinematic output, high quality video generation. Triggers: veo, google veo, veo 3, veo 2, veo 3.1, vertex ai video, google video generation, google video ai, veo model, veo video

362SKILL.mdUpdated Apr 21, 2026

inference-sh/google-veo

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/inference-sh/agent-skills.git

# Copy into Claude Code skills folder (global)
cp -r agent-skills/guides/design/youtube-thumbnail-design ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

inference-sh/agent-skills

362 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT