skills/speech-video-transcriber/SKILL.md
Transcribes a local video or audio file into a markdown transcript using Whisper or OpenAI cloud. Use when the user wants a transcript from a video, audio, or voice note.
npx skillsauth add psycho-baller/ai-agents-config speech-video-transcriberInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
turn a local media file into a markdown transcript with minimal local compute.
the intended path is:
ffmpeg../transcriptions/ relative to the skills/ directoryfor this repo, that means outputs land in:
/Users/rami/Documents/life-os/ai-agents-config/transcriptions/use this skill when the user wants any of the following from a local media file:
if the user asks for speaking feedback but no transcript exists yet, use this skill first so later steps can work from a clean markdown source.
local (default when user mentions whisper or offline):
--localsmall (uses ~/.cache/whisper/small.pt — no download needed if already cached)cloud (default otherwise):
gpt-4o-transcribeOPENAI_API_KEYgpt-4o-mini-transcribe if the user explicitly wants the cheaper modelai-agents-config/transcriptions/from /Users/rami/Documents/life-os/ai-agents-config/skills:
uv pip install -r speech-video-transcriber/scripts/requirements.txt
the machine also needs:
ffmpegOPENAI_API_KEY--language--prompt with those tokens to improve recognitionlocal (no API key):
cd /Users/rami/Documents/life-os/ai-agents-config/skills
uv run python speech-video-transcriber/scripts/transcribe_video.py "/absolute/path/to/video.mov" --local
cloud (OpenAI API):
cd /Users/rami/Documents/life-os/ai-agents-config/skills
uv run python speech-video-transcriber/scripts/transcribe_video.py "/absolute/path/to/video.mov"
common options:
# local with language hint
uv run python speech-video-transcriber/scripts/transcribe_video.py \
"/absolute/path/to/video.mov" \
--local \
--language en
# cloud with jargon hint
uv run python speech-video-transcriber/scripts/transcribe_video.py \
"/absolute/path/to/video.mov" \
--language en \
--prompt "rami, chalant, purpose os, posthog" \
--model gpt-4o-transcribe
# save to specific path
uv run python speech-video-transcriber/scripts/transcribe_video.py \
"/absolute/path/to/video.mov" \
--local \
--output "/path/to/output.md"
the script writes one markdown file to the shared transcriptions directory and prints the final path.
the markdown includes:
if a file with the same name already exists, the script appends a timestamp suffix instead of overwriting it.
example 1
user request:
transcribe /Users/rami/Documents/life-os/speech/founder-story-take-01.mov
run:
cd /Users/rami/Documents/life-os/ai-agents-config/skills
uv run python speech-video-transcriber/scripts/transcribe_video.py \
"/Users/rami/Documents/life-os/speech/founder-story-take-01.mov"
example 2
user request:
make a transcript of /Users/rami/Documents/life-os/speech/camera-practice/clarity.mp4 and keep the names right. the language is english.
run:
cd /Users/rami/Documents/life-os/ai-agents-config/skills
uv run python speech-video-transcriber/scripts/transcribe_video.py \
"/Users/rami/Documents/life-os/speech/camera-practice/clarity.mp4" \
--language en \
--prompt "rami, chalant, purpose os"
OPENAI_API_KEY is missing, try to run source .env to load it and if it still fails stop and ask for itffmpeg is missing, stop and report that dependency clearlytools
Pre-task clarity ritual for Rami. Surfaces real intention, maps work to his pillars, and produces a Session Brief. Use when Rami is about to start a task or plan a work session.
testing
Find and retrieve notes from Rami's Obsidian vault by topic or theme using semantic search against Smart Connections embeddings. Use when asked to find notes about a specific subject, retrieve relevant vault content, or surface what Rami has written about a topic.
testing
Update living Obsidian pattern files from metadata-enriched transcriptions. Use when asked to populate or update pattern files for communication flaws, beliefs, fears, principles, or people/projects mentioned.
testing
Generate evidence-backed personal principles from markdown notes, reflections, and transcripts. Use when extracting life principles, decision rules, or lessons from journal entries and reflections.