skills/transcribe-clip/SKILL.md
Transcribe a video clip using Gemini to get timestamped segments for captions
npx skillsauth add nuva-lab/vibecut transcribe-clipInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this skill to generate a timestamped transcript of a video clip for captions.
python skills/transcribe-clip/transcribe.py <video_path> [output_json]
# Example
python skills/transcribe-clip/transcribe.py clip.mp4
python skills/transcribe-clip/transcribe.py clip.mp4 transcript.json
{
"segments": [
{"start": 0.0, "end": 3.5, "text": "First sentence of dialogue."},
{"start": 3.5, "end": 7.2, "text": "Second sentence continues here."}
],
"full_text": "Complete transcript..."
}
tools
Generate voiceover scripts in Joyce's style for video clips
tools
Clone a voice using qwen3-tts and generate speech from text
development
# Validate Media Skill Pre-flight media validation and diagnostics using ffprobe. ## Purpose Check video/audio files for common issues before rendering: - Duration mismatches between video and audio tracks - Missing audio tracks - Codec compatibility - Volume levels - Potential freeze points ## Usage ```bash python skills/validate-media/validate.py <video_file> [--verbose] ``` ## Output JSON report with issues and recommendations: ```json { "file": "video.mp4", "video_duration": 35.1
testing
ASR with ~30ms timestamp precision using Qwen3-ASR + ForcedAligner