skills/video-summarize/SKILL.md
Ingest a YouTube URL or local video via steipete's `summarize` CLI — returns transcript (with timestamps), scene-detected slides (PNG + optional OCR), and an LLM narrative summary with section headings. Generic sibling to llm-wiki/ingest-youtube and llm-wiki/ingest-video (those persist to a vault; this one just produces artefacts and leaves placement to the caller).
npx skillsauth add RonanCodes/ronan-skills video-summarizeInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Thin wrapper around summarize that standardises where the outputs go so callers (like an ingest flow or a research workflow) can consume them.
For a single input (YouTube URL or local video file), this skill produces:
<out>/<slug>-transcript.md — full transcript with timestamps, via summarize --extract --timestamps<out>/<slug>-summary.md — narrative summary with section headings, via summarize --slides --timestamps --length <...><out>/slides/*.png — scene-detected slides, each filename encoding its timestamp (e.g. slide_0003_96.07s.png)<out>/slides/slides.json — machine-readable slide manifest (timestamps, paths, OCR text if enabled)<out>/<slug>-summarize-log.txt — verbose log from the run (provider used, token counts, cost estimate)/ro:video-summarize https://www.youtube.com/watch?v=XXXX # defaults: xxl summary, 6 slides
/ro:video-summarize https://www.youtube.com/watch?v=XXXX --out ./my-research # custom output dir
/ro:video-summarize /path/to/meeting.mp4 --slides-max 20 --ocr # local file, more slides, run OCR
/ro:video-summarize https://youtu.be/XXXX --length long --model openai/gpt-4.1 # different length + model
/ro:video-summarize https://youtu.be/XXXX --skip-summary # transcript + slides only, no LLM cost
summarize — brew install summarizeffmpeg, yt-dlp, tesseract — pulled in automatically by the summarize formulaOPENAI_API_KEY, ANTHROPIC_API_KEY, GEMINI_API_KEY, XAI_API_KEY (or others summarize supports) must be available in the environment. Keys typically live in ~/.claude/.env (see the user memory on this).which summarize >/dev/null 2>&1 || {
echo "Installing summarize..." >&2
brew install summarize
}
Source API keys from ~/.claude/.env if present:
set -a && source "$(ro context env)" && set +a
unset GITHUB_TOKEN GH_TOKEN # important — see feedback_github_token_env memory
Derive a kebab-case slug from the video:
yt-dlp --get-title) truncated to ~50 charsCreate:
OUT="${OUT_DIR:-./video-output}"
mkdir -p "$OUT/slides"
summarize "$INPUT" \
--extract --timestamps \
> "$OUT/$SLUG-transcript.md"
summarize "$INPUT" \
--slides \
--slides-dir "$OUT/slides" \
--slides-max "${SLIDES_MAX:-6}" \
${OCR:+--slides-ocr} \
--timestamps \
--length "${LENGTH:-xxl}" \
${MODEL:+--model "$MODEL"} \
--verbose \
> "$OUT/$SLUG-summary.md" \
2> "$OUT/$SLUG-summarize-log.txt"
Skip this step entirely if --skip-summary.
Print to the caller:
... · $X.XXXX · <model> · ...)# Video summarize: <title> (<duration>)
Transcript: <abs path>
Summary: <abs path>
Slides: <N> slides in <abs path>/slides/
Log: <abs path>
Cost: $<X.XXXX> via <model>
~/.summarize/cache.sqlite, so re-running is nearly instant and free.--slides-max to 3 or use --skip-summary and lean on the transcript.llm-wiki/.claude/skills/ingest-youtube — llm-wiki's transcript-only YouTube handler (no slides). Prefer /ro:video-summarize when you need slides/screenshots.llm-wiki/.claude/skills/ingest-video — llm-wiki's local-video handler (ffmpeg keyframes + whisper transcript, no LLM summary).steipete/summarize — the underlying CLI. Source cloned at ~/Dev/.reference/steipete-summarize./ro:trend-scan — upstream: use to find videos worth deep-diving.development
--- name: worktree description: Coordinate multiple agents on one repo via a worktree-lock pool, so two agents never clobber each other's working tree. Acquire the first free slot (main, then beta/gamma… worktrees, created on demand), work there on your own branch, release when you've pushed. Use before modifying any repo that might be in use by another agent (factory, dataforce, etc.), or whenever you're told a repo is being worked on. Backed by `ro worktree`. category: development argument-hin
testing
--- name: ship description: Ship a feature branch the local-CI-first way — run the full local gate, push, open a PR, squash-merge, then deploy, without waiting on GitHub Actions. Use when a branch is ready for main and you want it merged and deployed now. Reads CI policy from `ro ci` (default skips remote CI because GitHub Actions billing keeps hitting limits). Sibling to /ro:gh-ship (waits on GitHub checks) and /ro:cf-ship (the deploy half). Triggers on "ship it", "ship this", "merge and deploy
testing
--- name: setup-logging description: Set up (or audit) the observability stack in a TanStack Start + Cloudflare Workers app so it is "diagnosable by default" — structured logging (logtape) with a request context carrying trace_id + userId + tenant/orgId, a trace_id propagated FE→BE→logs→Sentry→PostHog, Cloudflare Workers observability enabled, and Sentry + PostHog wired. Two modes: `setup` (wire it into an app) and `audit` (check an existing app + report gaps). Use when scaffolding a new app, wh
development
Manage credentials INSIDE the active ~/.claude/.env file — read which token/account to use for a given app (Simplicity vs Dataforce vs Ronan-personal), add or update a secret WITHOUT it passing through the chat (an interactive Terminal window prompts for it), and track secrets that were exposed in a transcript so they get rotated. Sibling to /ro:context (which switches WHICH env file is active). Use when the user wants to add an API key/token/secret, asks "which credential do I use for X", needs the env organized/labelled, or a secret was pasted into the chat and should be rotated.