skills/media/songsee/SKILL.md
Generate spectrograms and audio feature visualizations (mel, chroma, MFCC, tempogram, etc.) from audio files via CLI. Useful for audio analysis, music production debugging, and visual documentation.
npx skillsauth add garrettroi/open-manus songseeInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate spectrograms and multi-panel audio feature visualizations from audio files.
Requires Go:
go install github.com/steipete/songsee/cmd/songsee@latest
Optional: ffmpeg for formats beyond WAV/MP3.
# Basic spectrogram
songsee track.mp3
# Save to specific file
songsee track.mp3 -o spectrogram.png
# Multi-panel visualization grid
songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux
# Time slice (start at 12.5s, 8s duration)
songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg
# From stdin
cat track.mp3 | songsee - --format png -o out.png
Use --viz with comma-separated values:
| Type | Description |
|------|-------------|
| spectrogram | Standard frequency spectrogram |
| mel | Mel-scaled spectrogram |
| chroma | Pitch class distribution |
| hpss | Harmonic/percussive separation |
| selfsim | Self-similarity matrix |
| loudness | Loudness over time |
| tempogram | Tempo estimation |
| mfcc | Mel-frequency cepstral coefficients |
| flux | Spectral flux (onset detection) |
Multiple --viz types render as a grid in a single image.
| Flag | Description |
|------|-------------|
| --viz | Visualization types (comma-separated) |
| --style | Color palette: classic, magma, inferno, viridis, gray |
| --width / --height | Output image dimensions |
| --window / --hop | FFT window and hop size |
| --min-freq / --max-freq | Frequency range filter |
| --start / --duration | Time slice of the audio |
| --format | Output format: jpg or png |
| -o | Output file path |
ffmpegvision_analyze for automated audio analysisdevelopment
# Voice Sanitizer This skill cleans up text before it is sent to the Text-to-Speech (TTS) engine. It removes technical jargon, code blocks, and long URLs to ensure the agent sounds natural and conversational in voice chat. ## Usage To sanitize text for speech, run the following command in the terminal: ```bash python3 /app/skills/voice_sanitizer/sanitizer.py "Your long, technical text with `code` and https://links.com/long-url" ``` ### Example Output ```text Your long, technical text with a
tools
Professional AI video production workflow. Use when creating videos, short films, commercials, or any video content using AI generation tools.
tools
Secure API key access from the centralized vault. Fetch keys on-demand without storing them in environment variables.
testing
# Task Board — Persistent Task Tracking for Open Manus This skill provides a shared task board backed by Redis. Harmony uses it to track delegated work across all agents, and agents use it to report progress and completion. ## When to Use - **Harmony**: Use this whenever you delegate a task to an agent. Add the task to the board, then check the board periodically to follow up. - **Worker Agents**: Use this to update your task status or mark tasks as complete. ## Commands ### Add a new task