tools/audio/elevenlabs-music/SKILL.md
ElevenLabs AI music generation - create original music from text prompts via inference.sh CLI. Capabilities: text-to-music, custom duration up to 10 minutes, genre/mood/instrument control, royalty-free commercial use. Use for: background music, soundtracks, jingles, podcasts, video scores, game audio. Triggers: elevenlabs music, eleven labs music, ai music, generate music, music generation, compose music, ai composer, create song, soundtrack, background music, jingle, elevenlabs compose, music ai
npx skillsauth add inference-sh-5/skills elevenlabs-musicInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
Security scan pending...
This skill is queued for security scanning. Results will appear when the scan completes.
Install the belt CLI skill:
npx skills add belt-sh/cli
Generate original music from text prompts via inference.sh CLI.

Requires inference.sh CLI (
belt). Install instructions
belt login
# Generate music
belt app run elevenlabs/music --input '{"prompt": "Upbeat electronic dance track with driving synths"}'
| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| prompt | string | required | Description of desired music (max 2000 chars) |
| duration_seconds | number | 30 | Duration in seconds (5-600, max 10 minutes) |
# Lo-fi study beats
belt app run elevenlabs/music --input '{
"prompt": "Lo-fi hip hop beat, chill study music, vinyl crackle, mellow piano",
"duration_seconds": 120
}'
# Corporate background
belt app run elevenlabs/music --input '{
"prompt": "Light corporate background music, positive, motivational, clean",
"duration_seconds": 60
}'
# Epic trailer
belt app run elevenlabs/music --input '{
"prompt": "Epic cinematic orchestral score, dramatic build-up, brass and strings, trailer music",
"duration_seconds": 45
}'
# Suspense
belt app run elevenlabs/music --input '{
"prompt": "Dark suspenseful score, tension building, minimal piano, horror atmosphere",
"duration_seconds": 60
}'
# Jazz
belt app run elevenlabs/music --input '{
"prompt": "Smooth jazz quartet, saxophone lead, walking bass, brushed drums",
"duration_seconds": 90
}'
# Electronic
belt app run elevenlabs/music --input '{
"prompt": "Techno beat, 128 BPM, driving bass, atmospheric synths, club music",
"duration_seconds": 120
}'
# Acoustic
belt app run elevenlabs/music --input '{
"prompt": "Acoustic guitar folk song, fingerpicking, warm and intimate",
"duration_seconds": 60
}'
# Podcast intro (10 seconds)
belt app run elevenlabs/music --input '{
"prompt": "Podcast intro jingle, professional, tech-themed, catchy",
"duration_seconds": 10
}'
# Social media clip
belt app run elevenlabs/music --input '{
"prompt": "Trendy upbeat pop, social media vibe, energetic, youthful",
"duration_seconds": 15
}'
# Notification sound
belt app run elevenlabs/music --input '{
"prompt": "Short positive notification melody, clean, satisfying",
"duration_seconds": 5
}'
# Battle theme
belt app run elevenlabs/music --input '{
"prompt": "Intense battle music, fast tempo, orchestral with electric guitar, boss fight",
"duration_seconds": 120
}'
# Exploration theme
belt app run elevenlabs/music --input '{
"prompt": "Peaceful exploration music, fantasy RPG, harp and flute, magical atmosphere",
"duration_seconds": 180
}'
# Menu music
belt app run elevenlabs/music --input '{
"prompt": "Ambient menu screen music, sci-fi, ethereal synths, space theme",
"duration_seconds": 60
}'
Genre: pop, rock, electronic, jazz, classical, hip-hop, lo-fi, ambient, orchestral, folk, R&B, metal, country
Mood: happy, sad, energetic, calm, dramatic, epic, mysterious, uplifting, dark, romantic, tense
Instruments: piano, guitar, synth, drums, bass, strings, brass, choir, saxophone, violin, flute, harp
Tempo: slow, moderate, fast, 80 BPM, 120 BPM, 140 BPM
Style: cinematic, minimal, layered, atmospheric, rhythmic, melodic, ambient
# 1. Generate background music
belt app run elevenlabs/music --input '{
"prompt": "Soft ambient background music for narration, non-intrusive",
"duration_seconds": 60
}' > music.json
# 2. Generate voiceover
belt app run elevenlabs/tts --input '{
"text": "Welcome to our product tour. Let me show you what we have built.",
"voice": "george"
}' > voice.json
# 3. Merge together
belt app run infsh/media-merger --input '{
"media": ["<music-url>", "<voice-url>"]
}'
# ElevenLabs sound effects (combine with music)
npx skills add inference-sh/skills@elevenlabs-sound-effects
# ElevenLabs TTS (add voice over music)
npx skills add inference-sh/skills@elevenlabs-tts
# Other music models (Diffrythm, Tencent)
npx skills add inference-sh/skills@ai-music-generation
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
Browse all audio apps: belt app store --category audio
data-ai
Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.
tools
Generate videos with ByteDance Seedance 2.0 via inference.sh CLI. Unified model for text-to-video, image-to-video, and reference-to-video with synchronized audio, up to 1080p, 4-15s duration. Pro and Fast variants. Studio variants with private asset library for portrait consistency. Use for: social media videos, music videos, product demos, animated content, AI video with sound. Triggers: seedance, seedance 2, bytedance video, seedance t2v, seedance i2v, seedance r2v, video with audio, seedance 2.0, bytedance seedance, seedance studio
tools
Generate talking head avatar videos with Pruna P-Video-Avatar via inference.sh CLI. Turn a portrait image into a realistic speaking video with built-in TTS. 18x faster and 6x cheaper than competitors. Models: P-Video-Avatar, P-Image (for portrait generation). Capabilities: text-to-avatar, audio-driven avatars, 30 voices, 10 languages, 720p/1080p, built-in TTS, dynamic backgrounds, full-body control. Use for: AI presenters, product demos, explainer videos, virtual influencers, marketing, education, multilingual content, UGC, gaming avatars. Triggers: avatar video, talking head, ai avatar, p-video-avatar, pruna avatar, video avatar, ai presenter, digital human, virtual presenter, lipsync, talking avatar, ai spokesperson, heygen alternative, synthesia alternative, veed alternative, fabric alternative, omnihuman alternative
tools
Generate and edit videos with Alibaba HappyHorse 1.0 models via inference.sh CLI. Models: HappyHorse T2V, I2V, R2V, Video Edit. Capabilities: text-to-video, image-to-video, reference-to-video, video editing with natural language, character preservation, 720P/1080P, up to 15 seconds. Use for: physically realistic video, video editing, character-consistent content, product demos, social media. Triggers: happyhorse, happy horse, alibaba video, happyhorse 1.0, dashscope video, alibaba happyhorse, video editing ai, ai video editor