skills/voice-clone/SKILL.md
Clone a voice using qwen3-tts and generate speech from text
npx skillsauth add nuva-lab/vibecut voice-cloneInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this skill to clone a speaker's voice and generate text-to-speech audio.
python skills/voice-clone/clone.py <audio_sample.wav> [--transcript "text"]
Creates a speaker embedding file that can be reused.
python skills/voice-clone/speak.py <embedding.safetensors> "Text to speak"
Generates audio using the cloned voice.
assets/outputs/voice_embeddings/<name>_embedding.safetensors - Reusable voice modelassets/outputs/audio/<name>_speech.wav - Generated audiotools
Generate voiceover scripts in Joyce's style for video clips
development
# Validate Media Skill Pre-flight media validation and diagnostics using ffprobe. ## Purpose Check video/audio files for common issues before rendering: - Duration mismatches between video and audio tracks - Missing audio tracks - Codec compatibility - Volume levels - Potential freeze points ## Usage ```bash python skills/validate-media/validate.py <video_file> [--verbose] ``` ## Output JSON report with issues and recommendations: ```json { "file": "video.mp4", "video_duration": 35.1
tools
Transcribe a video clip using Gemini to get timestamped segments for captions
testing
ASR with ~30ms timestamp precision using Qwen3-ASR + ForcedAligner