Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

rameerez/transcribe-video

Name: transcribe-video
Author: rameerez

skills/transcribe-video/SKILL.md

npx skillsauth add rameerez/claude-code-startup-skills transcribe-video

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Error

VirusTotalMulti-engine malware detection

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Video Transcription Skill

Generate subtitles and transcripts from $ARGUMENTS (a video or audio file path, optionally followed by a language code like en-US or es-ES) using AWS Transcribe.

Outputs .srt, .vtt, and .txt files next to the source file.

Process

Verify prerequisites - check ffmpeg and aws CLI are installed and configured
Extract audio from the video as MP3 using ffmpeg
Create temporary S3 bucket, upload audio
Run AWS Transcribe job with SRT and VTT subtitle output
Download results and generate plain text transcript
Clean up all AWS resources - delete S3 bucket, Transcribe job, and temp files. No recurring costs.

Prerequisites

ffmpeg installed (brew install ffmpeg)
aws CLI installed and configured with valid credentials (brew install awscli && aws configure)
AWS credentials need permissions for: s3:* (create/delete buckets), transcribe:* (start/delete jobs)

Step-by-Step

Step 1: Extract audio

ffmpeg -i "input.mp4" -vn -acodec mp3 -q:a 2 "/tmp/transcribe-audio.mp3" -y

Step 2: Create temp S3 bucket and upload

BUCKET="tmp-transcribe-$(date +%s)"
aws s3 mb "s3://$BUCKET" --region us-east-1
aws s3 cp "/tmp/transcribe-audio.mp3" "s3://$BUCKET/audio.mp3"

Step 3: Start transcription job

JOB_NAME="tmp-job-$(date +%s)"
aws transcribe start-transcription-job \
  --transcription-job-name "$JOB_NAME" \
  --language-code en-US \
  --media-format mp3 \
  --media "MediaFileUri=s3://$BUCKET/audio.mp3" \
  --subtitles "Formats=srt,vtt" \
  --output-bucket-name "$BUCKET" \
  --region us-east-1

Language codes: en-US, es-ES, fr-FR, de-DE, pt-BR, ja-JP, zh-CN, it-IT, ko-KR, etc. Default to en-US if not specified.

Step 4: Poll until complete

while true; do
  STATUS=$(aws transcribe get-transcription-job \
    --transcription-job-name "$JOB_NAME" \
    --region us-east-1 \
    --query 'TranscriptionJob.TranscriptionJobStatus' \
    --output text)
  if [ "$STATUS" = "COMPLETED" ] || [ "$STATUS" = "FAILED" ]; then break; fi
  sleep 5
done

Step 5: Download subtitle files

Save .srt and .vtt next to the original file:

aws s3 cp "s3://$BUCKET/$JOB_NAME.srt" "/path/to/input.srt"
aws s3 cp "s3://$BUCKET/$JOB_NAME.vtt" "/path/to/input.vtt"

Step 6: Generate plain text transcript

Download the JSON result and extract the full transcript text:

aws s3 cp "s3://$BUCKET/$JOB_NAME.json" "/tmp/transcribe-result.json"

Then use a tool to extract the .results.transcripts[0].transcript field from the JSON and save it as a .txt file next to the original.

Step 7: Clean up everything

IMPORTANT: Always clean up to avoid recurring S3 storage costs.

# Delete S3 bucket and all contents
aws s3 rb "s3://$BUCKET" --force --region us-east-1

# Delete the transcription job
aws transcribe delete-transcription-job --transcription-job-name "$JOB_NAME" --region us-east-1

# Delete temp audio file
rm -f "/tmp/transcribe-audio.mp3" "/tmp/transcribe-result.json"

Real-World Results (Reference)

From actual transcription runs:

| Video | Duration | Audio Size | Transcribe Time | Subtitle Segments | |-------|----------|------------|-----------------|-------------------| | X/Twitter clip | 2:40 | 2.5 MB | ~20 seconds | 83 | | Screen recording | 18:45 | 11.4 MB | ~60 seconds | 500+ |

Key Insights

AWS Transcribe is fast - even 19-minute videos complete in about a minute
Short-form content (tweets, reels) transcribes almost instantly
Cost is negligible - AWS Transcribe charges ~$0.024/min, so a 19-min video costs ~$0.46
Cleanup is critical - always delete the S3 bucket to avoid storage charges
SRT is most compatible - works with most video players and editors; VTT is better for web

Output Files

original-video.mp4
original-video.srt          # Subtitles with timestamps (most compatible)
original-video.vtt          # Web-optimized subtitles (for HTML5 <track>)
original-video.txt          # Plain text transcript (no timestamps)

After Transcription

Verify all output files exist: ls -lh /path/to/original-video.{srt,vtt,txt}
Report the number of subtitle segments and total duration
Confirm all AWS resources have been cleaned up (no S3 buckets, no Transcribe jobs remaining)

rameerez/transcribe-video

skills/transcribe-video/SKILL.md

Generate subtitles (SRT/VTT) and plain text transcripts from video or audio files using AWS Transcribe. Use when creating captions, extracting spoken content, generating transcripts for notes, or making video content searchable.

19 stars

devops

Updated Mar 20, 2026

$ install --global

skillsauth

npx skillsauth add rameerez/claude-code-startup-skills transcribe-video

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Error

VirusTotalMulti-engine malware detection

70%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Mar 20, 2026, 5:42 PM226.5s1 file scanned

SKILL.md

name:: transcribe-video
description:: Generate subtitles (SRT/VTT) and plain text transcripts from video or audio files using AWS Transcribe. Use when creating captions, extracting spoken content, generating transcripts for notes, or making video content searchable.
argument-hint:: [file] [language-code]
allowed-tools:: Bash(ffmpeg:*), Bash(aws:*), Bash(ls:*), Bash(rm:*), Bash(which:*)

Video Transcription Skill

Generate subtitles and transcripts from $ARGUMENTS (a video or audio file path, optionally followed by a language code like en-US or es-ES) using AWS Transcribe.

Outputs .srt, .vtt, and .txt files next to the source file.

Process

Verify prerequisites - check ffmpeg and aws CLI are installed and configured
Extract audio from the video as MP3 using ffmpeg
Create temporary S3 bucket, upload audio
Run AWS Transcribe job with SRT and VTT subtitle output
Download results and generate plain text transcript
Clean up all AWS resources - delete S3 bucket, Transcribe job, and temp files. No recurring costs.

Prerequisites

ffmpeg installed (brew install ffmpeg)
aws CLI installed and configured with valid credentials (brew install awscli && aws configure)
AWS credentials need permissions for: s3:* (create/delete buckets), transcribe:* (start/delete jobs)

Step-by-Step

Step 1: Extract audio

ffmpeg -i "input.mp4" -vn -acodec mp3 -q:a 2 "/tmp/transcribe-audio.mp3" -y

Step 2: Create temp S3 bucket and upload

BUCKET="tmp-transcribe-$(date +%s)"
aws s3 mb "s3://$BUCKET" --region us-east-1
aws s3 cp "/tmp/transcribe-audio.mp3" "s3://$BUCKET/audio.mp3"

Step 3: Start transcription job

JOB_NAME="tmp-job-$(date +%s)"
aws transcribe start-transcription-job \
  --transcription-job-name "$JOB_NAME" \
  --language-code en-US \
  --media-format mp3 \
  --media "MediaFileUri=s3://$BUCKET/audio.mp3" \
  --subtitles "Formats=srt,vtt" \
  --output-bucket-name "$BUCKET" \
  --region us-east-1

Language codes: en-US, es-ES, fr-FR, de-DE, pt-BR, ja-JP, zh-CN, it-IT, ko-KR, etc. Default to en-US if not specified.

Step 4: Poll until complete

while true; do
  STATUS=$(aws transcribe get-transcription-job \
    --transcription-job-name "$JOB_NAME" \
    --region us-east-1 \
    --query 'TranscriptionJob.TranscriptionJobStatus' \
    --output text)
  if [ "$STATUS" = "COMPLETED" ] || [ "$STATUS" = "FAILED" ]; then break; fi
  sleep 5
done

Step 5: Download subtitle files

Save .srt and .vtt next to the original file:

aws s3 cp "s3://$BUCKET/$JOB_NAME.srt" "/path/to/input.srt"
aws s3 cp "s3://$BUCKET/$JOB_NAME.vtt" "/path/to/input.vtt"

Step 6: Generate plain text transcript

Download the JSON result and extract the full transcript text:

aws s3 cp "s3://$BUCKET/$JOB_NAME.json" "/tmp/transcribe-result.json"

Then use a tool to extract the .results.transcripts[0].transcript field from the JSON and save it as a .txt file next to the original.

Step 7: Clean up everything

IMPORTANT: Always clean up to avoid recurring S3 storage costs.

# Delete S3 bucket and all contents
aws s3 rb "s3://$BUCKET" --force --region us-east-1

# Delete the transcription job
aws transcribe delete-transcription-job --transcription-job-name "$JOB_NAME" --region us-east-1

# Delete temp audio file
rm -f "/tmp/transcribe-audio.mp3" "/tmp/transcribe-result.json"

Real-World Results (Reference)

From actual transcription runs:

Key Insights

AWS Transcribe is fast - even 19-minute videos complete in about a minute
Short-form content (tweets, reels) transcribes almost instantly
Cost is negligible - AWS Transcribe charges ~$0.024/min, so a 19-min video costs ~$0.46
Cleanup is critical - always delete the S3 bucket to avoid storage charges
SRT is most compatible - works with most video players and editors; VTT is better for web

Output Files

original-video.mp4
original-video.srt          # Subtitles with timestamps (most compatible)
original-video.vtt          # Web-optimized subtitles (for HTML5 <track>)
original-video.txt          # Plain text transcript (no timestamps)

After Transcription

Verify all output files exist: ls -lh /path/to/original-video.{srt,vtt,txt}
Report the number of subtitle segments and total duration
Confirm all AWS resources have been cleaned up (no S3 buckets, no Transcribe jobs remaining)

Related Skills

rameerez/x-post

content-media

VerifiedTrustedCommunity

Post to X (Twitter) from the command line. Text, images, and video.

19SKILL.mdUpdated Mar 20, 2026

rameerez/download-video

testing

VerifiedTrustedCommunity

Download videos from social media URLs (X/Twitter, YouTube, Instagram, TikTok, etc.) using yt-dlp. Use when saving a video locally, extracting content for transcription, or archiving video references.

19SKILL.mdUpdated Mar 20, 2026

rameerez/download-video

rameerez/customer-empathy

content-media

VerifiedTrustedCommunity

Deep-dive into customer empathy and user journey thinking. Use when designing onboarding, improving UX, planning features, or trying to understand how to delight users faster.

19SKILL.mdUpdated Mar 20, 2026

rameerez/customer-empathy

rameerez/compress-images

development

VerifiedTrustedCommunity

Compress images for web/SEO performance using cwebp. Use when optimizing images for faster page loads, reducing file sizes, or converting JPG/PNG to WebP format.

19SKILL.mdUpdated Mar 20, 2026

rameerez/compress-images

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/rameerez/claude-code-startup-skills.git

# Copy into Claude Code skills folder (global)
cp -r claude-code-startup-skills/skills/transcribe-video ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

rameerez/claude-code-startup-skills

19 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT