skills/chunk-process/SKILL.md
# Chunk Process Skill Smart video chunking and MLX-accelerated transcription for long-form content. ## Problem Solved - Raw footage too long for single Gemini upload (~47 min = 5GB+) - Need word-level timestamps for precise cutting - Fixed-length chunks break mid-sentence ## Smart Chunking Instead of fixed 5-minute segments, `smart_chunk.py` finds natural break points: ```bash python skills/chunk-process/smart_chunk.py raw_footage.mp4 -o chunks/ ``` **How it works:** 1. Detect silence regi
npx skillsauth add nuva-lab/vibecut skills/chunk-processInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Smart video chunking and MLX-accelerated transcription for long-form content.
Instead of fixed 5-minute segments, smart_chunk.py finds natural break points:
python skills/chunk-process/smart_chunk.py raw_footage.mp4 -o chunks/
How it works:
chunk_001.mp4, chunk_002.mp4, ...Options:
--min-chunk 150: Minimum chunk length (seconds)--max-chunk 210: Maximum chunk length (seconds)--silence-thresh -40: Silence detection threshold (dB)mlx_transcribe.py uses MLX-accelerated Qwen3-ASR for fast transcription on Mac:
# Single file
python skills/chunk-process/mlx_transcribe.py audio.wav -o transcript.json
# Batch process chunks
python skills/chunk-process/mlx_transcribe.py chunks/ --batch --word-timestamps
Features:
transcript.json with segments + words# 1. Smart chunk the video
python skills/chunk-process/smart_chunk.py raw.mp4 -o chunks/
# 2. Transcribe all chunks
python skills/chunk-process/mlx_transcribe.py chunks/ --batch --word-timestamps
# Output: chunks/transcript.json (merged from all chunks)
{
"language": "English",
"segments": [
{"text": "First sentence.", "start": 0.0, "end": 2.5},
{"text": "Second sentence.", "start": 2.5, "end": 5.0}
],
"words": [
{"text": "First", "start": 0.0, "end": 0.3},
{"text": "sentence", "start": 0.3, "end": 0.8}
],
"full_text": "First sentence. Second sentence..."
}
| Fixed Chunks | Smart Chunks | |--------------|--------------| | Breaks mid-word | Breaks at pauses | | 5 min arbitrary | 2.5-3.5 min natural | | Hard cuts | Clean transitions | | Timestamp gaps | Continuous timeline |
tools
Generate voiceover scripts in Joyce's style for video clips
tools
Clone a voice using qwen3-tts and generate speech from text
development
# Validate Media Skill Pre-flight media validation and diagnostics using ffprobe. ## Purpose Check video/audio files for common issues before rendering: - Duration mismatches between video and audio tracks - Missing audio tracks - Codec compatibility - Volume levels - Potential freeze points ## Usage ```bash python skills/validate-media/validate.py <video_file> [--verbose] ``` ## Output JSON report with issues and recommendations: ```json { "file": "video.mp4", "video_duration": 35.1
tools
Transcribe a video clip using Gemini to get timestamped segments for captions