skills/audio-mix/SKILL.md
Mix multiple audio tracks (voice, music, SFX) into a single output file via ffmpeg. Layer narration over background music, add sound effects at timestamps, adjust volumes.
npx skillsauth add RonanCodes/ronan-skills audio-mixInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Combine voice, background music, and sound effects into a single mixed audio file using ffmpeg. The glue between tts-elevenlabs, music-elevenlabs, and sfx-elevenlabs.
/audio-mix --voice narration.mp3 --music background.mp3 [--sfx intro-whoosh.mp3@0s] [--output final.mp3]
/audio-mix --voice narration.mp3 --music bg.mp3 --music-volume 0.15 --sfx [email protected] --sfx chime.mp3@30s
--voice — primary voice/narration track. Determines the output duration unless --duration is set.--music — background music track. Auto-looped if shorter than voice. Auto-faded out at the end.--sfx <file>@<timestamp> — sound effect placed at a specific timestamp. Repeatable.--music-volume — volume level for music (0.0–1.0). Default: 0.15 (subtle background).--sfx-volume — volume level for SFX (0.0–1.0). Default: 0.6.--voice-volume — volume level for voice (0.0–1.0). Default: 1.0.--fade-in — fade in duration in seconds for music. Default: 2.--fade-out — fade out duration in seconds for music. Default: 3.--output — output file path. Default: /tmp/audio-mix-<timestamp>.mp3.--duration — force output duration in seconds. Default: auto from voice track.which ffmpeg >/dev/null 2>&1 || {
echo "Installing ffmpeg…"
brew install ffmpeg
}
VOICE_DURATION=$(ffprobe -v quiet -show_entries format=duration -of csv=p=0 "$VOICE_FILE" | cut -d. -f1)
MUSIC_DURATION=$(ffprobe -v quiet -show_entries format=duration -of csv=p=0 "$MUSIC_FILE" | cut -d. -f1)
OUTPUT_DURATION="${DURATION:-$VOICE_DURATION}"
echo "Voice: ${VOICE_DURATION}s"
echo "Music: ${MUSIC_DURATION}s"
echo "Output: ${OUTPUT_DURATION}s"
echo "SFX cues: ${#SFX_ENTRIES[@]}"
The filter graph layers tracks with volume control, looping, and fading:
# Base inputs
INPUTS="-i \"$VOICE_FILE\" -i \"$MUSIC_FILE\""
FILTER_PARTS=()
INPUT_IDX=2 # 0=voice, 1=music, 2+=sfx
# Voice: volume adjust
FILTER_PARTS+=("[0:a]volume=${VOICE_VOL:-1.0}[voice]")
# Music: loop if needed, volume, fade in/out
if [ "$MUSIC_DURATION" -lt "$OUTPUT_DURATION" ]; then
LOOPS=$(( (OUTPUT_DURATION / MUSIC_DURATION) + 1 ))
FILTER_PARTS+=("[1:a]aloop=loop=${LOOPS}:size=$(( MUSIC_DURATION * 44100 )),atrim=0:${OUTPUT_DURATION},volume=${MUSIC_VOL:-0.15},afade=t=in:d=${FADE_IN:-2},afade=t=out:st=$(( OUTPUT_DURATION - ${FADE_OUT:-3} )):d=${FADE_OUT:-3}[music]")
else
FILTER_PARTS+=("[1:a]atrim=0:${OUTPUT_DURATION},volume=${MUSIC_VOL:-0.15},afade=t=in:d=${FADE_IN:-2},afade=t=out:st=$(( OUTPUT_DURATION - ${FADE_OUT:-3} )):d=${FADE_OUT:-3}[music]")
fi
# SFX: place each at its timestamp
MIX_INPUTS="[voice][music]"
MIX_COUNT=2
for entry in "${SFX_ENTRIES[@]}"; do
SFX_FILE="${entry%@*}"
SFX_TIME="${entry#*@}"
SFX_TIME="${SFX_TIME%s}" # strip trailing 's'
INPUTS+=" -i \"$SFX_FILE\""
FILTER_PARTS+=("[${INPUT_IDX}:a]volume=${SFX_VOL:-0.6},adelay=${SFX_TIME}s:all=1[sfx${INPUT_IDX}]")
MIX_INPUTS+="[sfx${INPUT_IDX}]"
MIX_COUNT=$(( MIX_COUNT + 1 ))
INPUT_IDX=$(( INPUT_IDX + 1 ))
done
# Final mix
FILTER_PARTS+=("${MIX_INPUTS}amix=inputs=${MIX_COUNT}:duration=first:dropout_transition=0[out]")
FILTER=$(printf "%s;" "${FILTER_PARTS[@]}")
FILTER="${FILTER%;}" # remove trailing semicolon
OUTPUT="${OUTPUT_PATH:-/tmp/audio-mix-$(date +%s).mp3}"
eval ffmpeg $INPUTS \
-filter_complex "$FILTER" \
-map "[out]" \
-codec:a libmp3lame -qscale:a 2 \
"$OUTPUT" -y 2>/dev/null
FINAL_DURATION=$(ffprobe -v quiet -show_entries format=duration -of csv=p=0 "$OUTPUT" | cut -d. -f1)
SIZE=$(wc -c < "$OUTPUT" | tr -d ' ')
✅ Audio mixed
Voice: <voice file> (<N>s)
Music: <music file> (<N>s, vol: <vol>)
SFX: <N> cues
Duration: <N>s
Size: <N> KB
Output: <output path>
Play: open <output path>
# 1. Generate narration
/tts-elevenlabs "Welcome to our product demo..." --output narration.mp3
# 2. Generate background music
/music-elevenlabs "calm modern corporate ambient" --duration 60 --instrumental --output bg-music.mp3
# 3. Generate sound effects
/sfx-elevenlabs "cinematic swoosh transition" --duration 2 --output swoosh.mp3
/sfx-elevenlabs "gentle notification chime" --duration 1 --output chime.mp3
# 4. Mix everything together
/audio-mix \
--voice narration.mp3 \
--music bg-music.mp3 --music-volume 0.12 \
--sfx swoosh.mp3@0s \
--sfx chime.mp3@15s \
--sfx chime.mp3@30s \
--fade-in 3 --fade-out 5 \
--output final-audio.mp3
None — this skill uses only ffmpeg (local, free).
tts-elevenlabs — generates the voice trackmusic-elevenlabs — generates the background musicsfx-elevenlabs — generates individual sound effectsgenerate-video — uses the mixed audio as the soundtrack for Remotion compositionsgenerate-podcast — could use this for podcast intros with music + voicedevelopment
Close the loop on a Linear ticket when its work ships - move the status and post a deploy comment with the PR link, what shipped, and a try-it link, mentioning the collaborator. Used as the tail of /ro:linear-nightshift for every merged mirror, or manually after an ad-hoc build. Triggers on "linear update", "update the linear ticket", "mark NUT-x done", "tell eoin it shipped", "/ro:linear-update".
devops
Run a night-shift against a collaborator's Linear board. Pulls the team's Grilled tickets (/ro:linear-grill moves a ticket to Grilled once its questions are answered), VERIFIES the questions were actually answered (unanswered → bounce the ticket to the "Question for <name>" state), mirrors verified tickets to ephemeral GitHub issues with ready-for-agent, then runs the standard /ro:night-shift machinery on GitHub. Tail-calls /ro:linear-update for everything that merged + deployed. Triggers on "linear nightshift", "nightshift linear", "drain the linear board", "run the shift off linear", "/ro:linear-nightshift".
development
Grill a collaborator's Linear tickets and move every processed ticket to where it belongs. Resolves the board from the repo's .ro-linear.json, reads the collaborator's Backlog / Ready-for-agent issues, then per ticket either posts 3-5 decision-extracting questions (state moves to "Question for <name>") or confirms it build-ready (state moves to "Grilled", the gate /ro:linear-nightshift consumes); shipped-and-confirmed tickets close as Done. The async-collaborator counterpart of /ro:day-shift for people who never touch GitHub. Triggers on "grill linear", "grill eoin's tickets", "linear grill", "add questions to the linear tickets", "/ro:linear-grill".
development
--- name: about-page description: Add a standard About page to any web app, what it is, the tech stack, and an FAQ, wired into a footer link with a sticky footer. Built with Spartan + Tailwind (the canonical component layer) and falls back to semantic HTML so it ships reliably. Use whenever building, polishing, or shipping an app, every app should have one. Triggers on "add an about page", "about page", "footer about link", or as a standard step in app build/polish. category: frontend argument-h