Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

garrettroi/songsee

Name: songsee
Author: garrettroi

skills/media/songsee/SKILL.md

npx skillsauth add garrettroi/open-manus songsee

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

songsee

Generate spectrograms and multi-panel audio feature visualizations from audio files.

Prerequisites

Requires Go:

go install github.com/steipete/songsee/cmd/songsee@latest

Optional: ffmpeg for formats beyond WAV/MP3.

Quick Start

# Basic spectrogram
songsee track.mp3

# Save to specific file
songsee track.mp3 -o spectrogram.png

# Multi-panel visualization grid
songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux

# Time slice (start at 12.5s, 8s duration)
songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg

# From stdin
cat track.mp3 | songsee - --format png -o out.png

Visualization Types

Use --viz with comma-separated values:

| Type | Description | |------|-------------| | spectrogram | Standard frequency spectrogram | | mel | Mel-scaled spectrogram | | chroma | Pitch class distribution | | hpss | Harmonic/percussive separation | | selfsim | Self-similarity matrix | | loudness | Loudness over time | | tempogram | Tempo estimation | | mfcc | Mel-frequency cepstral coefficients | | flux | Spectral flux (onset detection) |

Multiple --viz types render as a grid in a single image.

Common Flags

| Flag | Description | |------|-------------| | --viz | Visualization types (comma-separated) | | --style | Color palette: classic, magma, inferno, viridis, gray | | --width / --height | Output image dimensions | | --window / --hop | FFT window and hop size | | --min-freq / --max-freq | Frequency range filter | | --start / --duration | Time slice of the audio | | --format | Output format: jpg or png | | -o | Output file path |

Notes

WAV and MP3 are decoded natively; other formats require ffmpeg
Output images can be inspected with vision_analyze for automated audio analysis
Useful for comparing audio outputs, debugging synthesis, or documenting audio processing pipelines

garrettroi/songsee

skills/media/songsee/SKILL.md

Generate spectrograms and audio feature visualizations (mel, chroma, MFCC, tempogram, etc.) from audio files via CLI. Useful for audio analysis, music production debugging, and visual documentation.

tools

Updated May 16, 2026

$ install --global

skillsauth

npx skillsauth add garrettroi/open-manus songsee

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 3, 2026, 11:29 AM282.9s1 file scanned

SKILL.md

name:: songsee
description:: Generate spectrograms and audio feature visualizations (mel, chroma, MFCC, tempogram, etc.) from audio files via CLI. Useful for audio analysis, music production debugging, and visual documentation.
version:: 1.0.0
author:: community
license:: MIT
tags:: [Audio, Visualization, Spectrogram, Music, Analysis]
homepage:: https://github.com/steipete/songsee

songsee

Generate spectrograms and multi-panel audio feature visualizations from audio files.

Prerequisites

Requires Go:

go install github.com/steipete/songsee/cmd/songsee@latest

Optional: ffmpeg for formats beyond WAV/MP3.

Quick Start

# Basic spectrogram
songsee track.mp3

# Save to specific file
songsee track.mp3 -o spectrogram.png

# Multi-panel visualization grid
songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux

# Time slice (start at 12.5s, 8s duration)
songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg

# From stdin
cat track.mp3 | songsee - --format png -o out.png

Visualization Types

Use --viz with comma-separated values:

Multiple --viz types render as a grid in a single image.

Common Flags

Notes

WAV and MP3 are decoded natively; other formats require ffmpeg
Output images can be inspected with vision_analyze for automated audio analysis
Useful for comparing audio outputs, debugging synthesis, or documenting audio processing pipelines

Related Skills

garrettroi/skills/voice_sanitizer

development

VerifiedTrustedCommunity

# Voice Sanitizer This skill cleans up text before it is sent to the Text-to-Speech (TTS) engine. It removes technical jargon, code blocks, and long URLs to ensure the agent sounds natural and conversational in voice chat. ## Usage To sanitize text for speech, run the following command in the terminal: ```bash python3 /app/skills/voice_sanitizer/sanitizer.py "Your long, technical text with `code` and https://links.com/long-url" ``` ### Example Output ```text Your long, technical text with a

SKILL.mdUpdated May 22, 2026

garrettroi/skills/voice_sanitizer

garrettroi/video-generator

tools

VerifiedTrustedCommunity

Professional AI video production workflow. Use when creating videos, short films, commercials, or any video content using AI generation tools.

SKILL.mdUpdated May 22, 2026

garrettroi/video-generator

garrettroi/vault_client

tools

VerifiedTrustedCommunity

Secure API key access from the centralized vault. Fetch keys on-demand without storing them in environment variables.

SKILL.mdUpdated May 22, 2026

garrettroi/vault_client

garrettroi/skills/task_board

testing

VerifiedTrustedCommunity

# Task Board — Persistent Task Tracking for Open Manus This skill provides a shared task board backed by Redis. Harmony uses it to track delegated work across all agents, and agents use it to report progress and completion. ## When to Use - **Harmony**: Use this whenever you delegate a task to an agent. Add the task to the board, then check the board periodically to follow up. - **Worker Agents**: Use this to update your task status or mark tasks as complete. ## Commands ### Add a new task

SKILL.mdUpdated May 22, 2026

garrettroi/skills/task_board

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/garrettroi/open-manus.git

# Copy into Claude Code skills folder (global)
cp -r open-manus/skills/media/songsee ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

garrettroi/open-manus

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT