Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

FacuM/youtube-step-extractor

Name: youtube-step-extractor
Author: FacuM

.claude/skills/youtube-step-extractor/SKILL.md

Extract frames from a YouTube video and analyze them to identify a sequence of steps. Use when user provides a YouTube URL and wants to understand the process, tutorial, or workflow shown in the video by examining its visual content frame-by-frame. Triggers on "extract steps from video", "what steps does this video show", "analyze YouTube tutorial", "screenshot a video", "figure out the steps".

documentation

Updated Apr 17, 2026

$ install --global

skillsauth

npx skillsauth add FacuM/yolo-agent youtube-step-extractor

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 17, 2026, 2:49 AM5.3s4 files scanned

SKILL.md

name:: youtube-step-extractor
description:: Extract frames from a YouTube video and analyze them to identify a sequence of steps. Use when user provides a YouTube URL and wants to understand the process, tutorial, or workflow shown in the video by examining its visual content frame-by-frame. Triggers on "extract steps from video", "what steps does this video show", "analyze YouTube tutorial", "screenshot a video", "figure out the steps".

YouTube Step Extractor

Download a YouTube video, extract frames at regular intervals, and analyze them to identify a specific sequence of steps from the visual content.

Prerequisites

Requires yt-dlp and ffmpeg:

# Ubuntu/Debian
sudo apt-get install -y ffmpeg
pip install yt-dlp

# macOS
brew install ffmpeg yt-dlp

Workflow

Step 1: Download the YouTube video

yt-dlp -f "bestvideo[height<=1080]+bestaudio/best[height<=1080]" \
  -o "/tmp/yt_video.mp4" \
  --merge-output-format mp4 \
  "YOUTUBE_URL"

For faster download (lower quality is fine for frame analysis):

yt-dlp -f "bestvideo[height<=720]+bestaudio/best[height<=720]" \
  -o "/tmp/yt_video.mp4" \
  --merge-output-format mp4 \
  "YOUTUBE_URL"

Step 2: Extract frames

Use the bundled script:

{baseDir}/scripts/extract_frames.sh /tmp/yt_video.mp4 /tmp/yt_frames 1

Arguments:

video_path (required): Path to the downloaded video
output_dir (optional): Where to save frames. Default: ./frames_<video_name>
fps (optional): Frames per second. Default: 1 (one frame per second)

For longer videos, reduce fps to avoid too many frames:

# 1 frame every 2 seconds for videos > 5 min
{baseDir}/scripts/extract_frames.sh /tmp/yt_video.mp4 /tmp/yt_frames 0.5

# 1 frame every 5 seconds for videos > 15 min
{baseDir}/scripts/extract_frames.sh /tmp/yt_video.mp4 /tmp/yt_frames 0.2

Or use the all-in-one script:

{baseDir}/scripts/download_and_extract.sh "YOUTUBE_URL" /tmp/yt_frames 1

Step 3: Analyze the frames

List the extracted frames: ls /tmp/yt_frames/
Read key frames using the image viewing tool
For comprehensive analysis, sample frames at regular intervals (e.g., every 5th frame)
Identify distinct steps by looking for:
- Scene/screen transitions
- UI changes (new dialogs, menus, pages)
- Text overlays, titles, or captions
- Actions being performed (clicks, typing, navigation)
- Before/after states
Build a numbered step-by-step summary of the process shown

Step 4: (Optional) Extract transcript for context

Subtitles add context to what's visible in the frames:

yt-dlp --write-auto-sub --sub-lang en --skip-download --sub-format vtt \
  -o "/tmp/yt_transcript" "YOUTUBE_URL"

Clean to plain text:

sed -e '/^$/d' -e '/^[0-9]/d' -e '/-->/d' -e 's/<[^>]*>//g' \
  /tmp/yt_transcript.en.vtt | sort -u > /tmp/yt_transcript.txt

Tips

Short videos (<2 min): Use fps=1, review all frames
Medium videos (2-10 min): Use fps=0.5, sample every 3-5 frames
Long videos (>10 min): Use fps=0.2, focus on scene changes
Tutorials/screencasts: Higher fps (1-2) captures more UI transitions
Presentations/talks: Lower fps (0.2-0.5) is sufficient
Combine frame analysis with transcript for best results
Look for: screen transitions, text changes, button clicks, new panels/dialogs

Output

The extracted frames are numbered sequentially: frame_001.jpg, frame_002.jpg, etc.

Each frame filename corresponds to its position in time:

At fps=1: frame_001.jpg = ~1s, frame_060.jpg = ~60s
At fps=0.5: frame_001.jpg = ~2s, frame_030.jpg = ~60s

Cleanup

rm -rf /tmp/yt_video.mp4 /tmp/yt_frames /tmp/yt_transcript*

Related Skills

FacuM/writing-skills

testing

VerifiedTrustedCommunity

Use when creating new skills, editing existing skills, or verifying skills work before deployment

SKILL.mdUpdated Apr 17, 2026

FacuM/Writing Hookify Rules

documentation

VerifiedTrustedCommunity

This skill should be used when the user asks to "create a hookify rule", "write a hook rule", "configure hookify", "add a hookify rule", or needs guidance on hookify rule syntax and patterns.

SKILL.mdUpdated Apr 17, 2026

FacuM/Writing Hookify Rules

FacuM/writing-plans

development

VerifiedTrustedCommunity

Use when you have a spec or requirements for a multi-step task, before touching code

SKILL.mdUpdated Apr 17, 2026

FacuM/working-with-claude-code

tools

VerifiedTrustedCommunity

Use when working with Claude Code CLI, plugins, hooks, MCP servers, skills, configuration, or any Claude Code feature - provides comprehensive official documentation for all aspects of Claude Code

SKILL.mdUpdated Apr 17, 2026

FacuM/working-with-claude-code

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/FacuM/yolo-agent.git

# Copy into Claude Code skills folder (global)
cp -r yolo-agent/.claude/skills/youtube-step-extractor ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

FacuM/yolo-agent

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

FacuM/youtube-step-extractor

.claude/skills/youtube-step-extractor/SKILL.md

documentation

Updated Apr 17, 2026

$ install --global

skillsauth

npx skillsauth add FacuM/yolo-agent youtube-step-extractor

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 17, 2026, 2:49 AM5.3s4 files scanned

SKILL.md

name:: youtube-step-extractor
description:: Extract frames from a YouTube video and analyze them to identify a sequence of steps. Use when user provides a YouTube URL and wants to understand the process, tutorial, or workflow shown in the video by examining its visual content frame-by-frame. Triggers on "extract steps from video", "what steps does this video show", "analyze YouTube tutorial", "screenshot a video", "figure out the steps".

YouTube Step Extractor

Download a YouTube video, extract frames at regular intervals, and analyze them to identify a specific sequence of steps from the visual content.

Prerequisites

Requires yt-dlp and ffmpeg:

# Ubuntu/Debian
sudo apt-get install -y ffmpeg
pip install yt-dlp

# macOS
brew install ffmpeg yt-dlp

Workflow

Step 1: Download the YouTube video

yt-dlp -f "bestvideo[height<=1080]+bestaudio/best[height<=1080]" \
  -o "/tmp/yt_video.mp4" \
  --merge-output-format mp4 \
  "YOUTUBE_URL"

For faster download (lower quality is fine for frame analysis):

yt-dlp -f "bestvideo[height<=720]+bestaudio/best[height<=720]" \
  -o "/tmp/yt_video.mp4" \
  --merge-output-format mp4 \
  "YOUTUBE_URL"

Step 2: Extract frames

Use the bundled script:

{baseDir}/scripts/extract_frames.sh /tmp/yt_video.mp4 /tmp/yt_frames 1

Arguments:

video_path (required): Path to the downloaded video
output_dir (optional): Where to save frames. Default: ./frames_<video_name>
fps (optional): Frames per second. Default: 1 (one frame per second)

For longer videos, reduce fps to avoid too many frames:

# 1 frame every 2 seconds for videos > 5 min
{baseDir}/scripts/extract_frames.sh /tmp/yt_video.mp4 /tmp/yt_frames 0.5

# 1 frame every 5 seconds for videos > 15 min
{baseDir}/scripts/extract_frames.sh /tmp/yt_video.mp4 /tmp/yt_frames 0.2

Or use the all-in-one script:

{baseDir}/scripts/download_and_extract.sh "YOUTUBE_URL" /tmp/yt_frames 1

Step 3: Analyze the frames

List the extracted frames: ls /tmp/yt_frames/
Read key frames using the image viewing tool
For comprehensive analysis, sample frames at regular intervals (e.g., every 5th frame)
Identify distinct steps by looking for:
- Scene/screen transitions
- UI changes (new dialogs, menus, pages)
- Text overlays, titles, or captions
- Actions being performed (clicks, typing, navigation)
- Before/after states
Build a numbered step-by-step summary of the process shown

Step 4: (Optional) Extract transcript for context

Subtitles add context to what's visible in the frames:

yt-dlp --write-auto-sub --sub-lang en --skip-download --sub-format vtt \
  -o "/tmp/yt_transcript" "YOUTUBE_URL"

Clean to plain text:

sed -e '/^$/d' -e '/^[0-9]/d' -e '/-->/d' -e 's/<[^>]*>//g' \
  /tmp/yt_transcript.en.vtt | sort -u > /tmp/yt_transcript.txt

Tips

Short videos (<2 min): Use fps=1, review all frames
Medium videos (2-10 min): Use fps=0.5, sample every 3-5 frames
Long videos (>10 min): Use fps=0.2, focus on scene changes
Tutorials/screencasts: Higher fps (1-2) captures more UI transitions
Presentations/talks: Lower fps (0.2-0.5) is sufficient
Combine frame analysis with transcript for best results
Look for: screen transitions, text changes, button clicks, new panels/dialogs

Output

The extracted frames are numbered sequentially: frame_001.jpg, frame_002.jpg, etc.

Each frame filename corresponds to its position in time:

At fps=1: frame_001.jpg = ~1s, frame_060.jpg = ~60s
At fps=0.5: frame_001.jpg = ~2s, frame_030.jpg = ~60s

Cleanup

rm -rf /tmp/yt_video.mp4 /tmp/yt_frames /tmp/yt_transcript*

Related Skills

FacuM/writing-skills

testing

VerifiedTrustedCommunity

Use when creating new skills, editing existing skills, or verifying skills work before deployment

SKILL.mdUpdated Apr 17, 2026

FacuM/Writing Hookify Rules

documentation

VerifiedTrustedCommunity

This skill should be used when the user asks to "create a hookify rule", "write a hook rule", "configure hookify", "add a hookify rule", or needs guidance on hookify rule syntax and patterns.

SKILL.mdUpdated Apr 17, 2026

FacuM/Writing Hookify Rules

FacuM/writing-plans

development

VerifiedTrustedCommunity

Use when you have a spec or requirements for a multi-step task, before touching code

SKILL.mdUpdated Apr 17, 2026

FacuM/working-with-claude-code

tools

VerifiedTrustedCommunity

Use when working with Claude Code CLI, plugins, hooks, MCP servers, skills, configuration, or any Claude Code feature - provides comprehensive official documentation for all aspects of Claude Code

SKILL.mdUpdated Apr 17, 2026

FacuM/working-with-claude-code

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/FacuM/yolo-agent.git

# Copy into Claude Code skills folder (global)
cp -r yolo-agent/.claude/skills/youtube-step-extractor ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

FacuM/yolo-agent

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT