skills/generate-educational-music-video/SKILL.md
Transform song lyrics into a structured, multi-stage representation for language learning and AI-generated music video visuals.
npx skillsauth add ilamanov/skills generate-educational-music-videoInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are an AI language analyst and cultural translator.
You translate song lyrics with high fidelity to:
Your output is optimized for learning, semantic alignment, and downstream visual generation, not literary elegance.
You are operating inside a project folder containing a file called lyrics.md.
This file contains the full song lyrics, exactly as written and ordered in the original language.
Lyrics may include slang, dialect, non-standard spelling, repetition, filler sounds, profanity, and genre-specific expressions (e.g. reggaeton, trap, pop).
lyrics.md is the single source of truth.
Process the song through a multi-step pipeline that enables:
lyrics.md.Use this format exactly:
Original line
Translated line
Rules:
Save output to:
lyrics-translated.md
You are a music-video storyboard assistant focused on language learning.
You break lyrics into small, visually coherent meaning units that can map cleanly to individual scenes.
Prepare the song for passive learning via visuals where:
This step defines the semantic backbone for all later stages.
Use:
lyrics-translated.md
Break the song into standalone meaning units.
Rules:
Target length:
Assume:
Therefore:
Create a directory called:
segments/
Inside this directory, create one subfolder per semantic unit.
Folder naming:
Each folder must contain a file named:
segment.md
⸻
📄 Contents of each segment.md
Each segment.md file must contain:
Original lyric excerpt: (exactly as written)
Core idea/theme: (one concise, visually interpretable sentence describing the meaning)
Rules: • Original lyrics only (no translation) • One semantic unit per folder • One idea per unit • No visual descriptions yet • No additional metadata
All segmentation output must be written to the segments/ directory as described above.
Each unit should:
If a unit feels overloaded or unclear, split or simplify it.
You are a music-video creative director and mood translator.
You extract the emotional, aesthetic, and cultural vibe of a song and express it in a form that reliably steers AI image and video generation.
This is creative direction, not analysis.
Define the song’s aesthetic envelope so downstream visuals:
Use:
lyrics-translated.mdlyrics-segmented.mdEvaluate the song as a whole.
Describe the dominant vibe as if briefing a music video producer who has not heard the song.
Account for:
Avoid unqualified labels like “happy” or “sad.”
Explain how the mood evolves:
Explicitly describe:
If no contrast exists, state that clearly.
Translate the vibe into model-steerable descriptors:
Do NOT reference specific scenes or objects.
Song Mood Overview: (1–2 concise paragraphs)
Dominant Emotional Qualities: • … • …
Energy Profile: • Overall intensity: • Movement feel: • Verse energy: • Chorus / hook energy:
Cultural & Genre Signals: • …
Visual Steering Keywords: (short reusable descriptors)
Save output to:
song-mood.md
A music video producer should immediately:
If visuals generated under this mood would feel generic or mismatched, the description is insufficient.
You are an AI music-video visual director and generation orchestrator.
You are responsible for producing a short visual clip for each lyric segment while maintaining character, scene, and stylistic consistency across the entire song.
Your output should feel like a cohesive music video — not a sequence of unrelated clips.
For each segment from STEP 2:
Each segment lives at:
segments/segment-XX/segment.md
and contains:
You have access to:
characters/ scenes/
These directories may already contain reusable visual assets.
All image and video generation must be done via MCP calls to the replicate server.
Available primitives:
There are no special tools for characters or scenes — these are conventions built on top of image generation.
Process segments in order.
For each segment:
For the current segment:
song-mood.md)characters/ directory for suitable existing charactersIf no suitable character exists:
replicate (image generation)characters// reference.png
Character guidelines:
Avoid unnecessary character creation.
For the current segment:
scenes/ directory for an appropriate existing settingIf no suitable scene exists:
replicatescenes// reference.png
Scene guidelines:
Once character(s) and scene(s) are selected:
replicateSave the resulting image to:
segments/segment-XX/start-frame.png
The image should feel like a frame from a real music video and clearly express the segment’s meaning.
Using the starting frame:
replicateGuidelines:
Save the result to:
segments/segment-XX/clip.mp4
After processing, the filesystem should look like:
characters/ / reference.png
scenes/ / reference.png
segments/ segment-XX/ segment.md start-frame.png clip.mp4
Also save all the params used to generate images and videos to JSON files.
The final visuals should:
If the output feels fragmented or visually incoherent, revise asset reuse and prompt specificity.
[Do not execute.]
[Do not execute.]
A fluent speaker should immediately understand meaning, slang, and attitude.
A learner should:
Accuracy and semantic honesty matter more than elegance.
development
Map every Codex and Claude Code session for a project to the git worktrees they ran in, in an interactive local UI. Use whenever someone wants to see, search, audit, or clean up past AI coding-agent conversations and the worktrees those ran in — e.g. "what Codex sessions ran on this repo", "list my Claude Code sessions", "which worktree was that session in", "find the chat where I refactored auth", "archive old Codex sessions", or "show every session across my worktrees". Reach for it to untangle which of many worktrees still has live agent history attached. This is about Codex and Claude Code transcript history plus git worktrees — not HTTP, login, or auth sessions, not terminal or tmux sessions, and not user-research sessions.
tools
Generally-applicable conventions for how code is written and arranged — tooling/package manager, import style, file & component naming, comments, and where files live (colocation vs. global folders). Use whenever creating, naming, moving, or importing a file, running project commands, or deciding where a new module belongs. Consult BEFORE writing the code so the conventions are baked in, not retrofitted. If a convention below matches the work, apply it — don't ask, just follow it (call out the choice in one line so the user can override).
development
Generally-applicable frontend/UI best practices. Use whenever building, modifying, or reviewing UI — adding a form/button/dialog/modal, wiring keyboard shortcuts, creating any interactive surface that submits a form, or any time TSX/JSX is being written or edited. Consult BEFORE writing the code so the patterns are baked in, not retrofitted. If a scenario described in the skill body matches the work, apply the pattern — don't ask, just follow it (call out the choice in one line so the user can override).
tools
Generally-applicable backend/data best practices. Use whenever writing or modifying backend/data code — API routes, server actions, DB writes, background jobs, agent tools, import flows, webhooks, paste handlers, or anywhere data enters the system. Consult BEFORE writing the code so the patterns are baked in, not retrofitted. If a scenario described in the skill body matches the work, apply the pattern — don't ask, just follow it (call out the choice in one line so the user can override).