skills/audio-extract/SKILL.md
Extracts audio track from a video file. Use when you need to get audio from video, prepare audio for transcription, or separate audio from video content. Runs locally with no API key required.
npx skillsauth add agntswrm/agent-media audio-extractInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Extracts the audio track from a video file. This is a local operation using the bundled ffmpeg binary - no API keys or external services required.
npx agent-media@latest audio extract --in <path> [options]
| Option | Required | Description |
|--------|----------|-------------|
| --in | Yes | Input video file path or URL (supports mp4, webm, mkv, avi, mov) |
| --format | No | Output audio format: mp3 (default) or wav |
| --out | No | Output path, filename or directory (default: ./) |
Returns a JSON object with the extracted audio file:
{
"ok": true,
"media_type": "audio",
"action": "extract",
"provider": "local",
"output_path": "extracted_123_abc.mp3",
"mime": "audio/mpeg",
"bytes": 24779
}
Extract audio as MP3 (default):
npx agent-media@latest audio extract --in video.mp4
Extract audio as WAV:
npx agent-media@latest audio extract --in video.mp4 --format wav
Custom output directory:
npx agent-media@latest audio extract --in video.mp4 --out ./audio-files
Since transcription services work best with audio files (smaller uploads, faster processing), use this workflow:
# Step 1: Extract audio from video (local, instant)
npx agent-media@latest audio extract --in interview.mp4 --format mp3
# Output: extracted_xxx.mp3
# Step 2: Transcribe the audio (cloud API)
npx agent-media@latest audio transcribe --in extracted_xxx.mp3 --provider fal
This action uses the local provider with bundled ffmpeg (via ffmpeg-static). No API keys required.
data-ai
Generates video from text prompts or animates static images. Use when you need to create videos from descriptions, animate images, or produce video content using AI.
development
Upscales an image using AI super-resolution to increase resolution with detail generation. Use when you need to enlarge images, improve low-resolution photos, or prepare images for large-format display.
testing
Resizes an image to specified dimensions. Use when you need to change image size, create thumbnails, or prepare images for specific display requirements.
content-media
Removes the background from an image, leaving the foreground subject with transparency. Use when you need to isolate subjects, create cutouts, or prepare images for compositing.