tools/audio/elevenlabs-sound-effects/SKILL.md
Generate AI sound effects from text descriptions with ElevenLabs via inference.sh CLI. Capabilities: text-to-sound-effect, custom duration, royalty-free audio. Use for: video production, game audio, podcasts, films, presentations, social media. Triggers: sound effects, sfx, sound generation, ai sound effects, generate sound, foley, audio effects, sound design, text to sound, elevenlabs sound, eleven labs sfx, ambient sound, cinematic sound, game sound effects
npx skillsauth add 1nfsh/skills elevenlabs-sound-effectsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
Security scan pending...
This skill is queued for security scanning. Results will appear when the scan completes.
Install the belt CLI skill:
npx skills add belt-sh/cli
Generate sound effects from text descriptions via inference.sh CLI.

Requires inference.sh CLI (
belt). Install instructions
belt login
# Generate a sound effect
belt app run elevenlabs/sound-effects --input '{"text": "Thunder rumbling in the distance"}'
| Parameter | Type | Description |
|-----------|------|-------------|
| text | string | Description of the sound effect (max 1000 chars) |
| duration_seconds | number | Duration 0.5-22 seconds (optional, auto if omitted) |
| prompt_influence | number | 0-1, how literal to interpret prompt (default: 0.3) |
# Epic trailer hit
belt app run elevenlabs/sound-effects --input '{"text": "Cinematic braam, deep bass impact"}'
# Suspense drone
belt app run elevenlabs/sound-effects --input '{
"text": "Dark atmospheric drone, tension building, horror",
"duration_seconds": 10
}'
# Whoosh transition
belt app run elevenlabs/sound-effects --input '{
"text": "Fast cinematic whoosh transition",
"duration_seconds": 1.5
}'
# Rain
belt app run elevenlabs/sound-effects --input '{
"text": "Heavy rain on a tin roof with occasional thunder",
"duration_seconds": 15
}'
# Forest ambience
belt app run elevenlabs/sound-effects --input '{
"text": "Forest ambience with birds chirping and gentle wind",
"duration_seconds": 20
}'
# Ocean waves
belt app run elevenlabs/sound-effects --input '{
"text": "Ocean waves crashing on a beach, calming",
"duration_seconds": 15
}'
# Power-up
belt app run elevenlabs/sound-effects --input '{
"text": "Retro game power-up sound, ascending tones",
"duration_seconds": 1
}'
# Explosion
belt app run elevenlabs/sound-effects --input '{
"text": "Sci-fi laser explosion, futuristic",
"duration_seconds": 3
}'
# UI click
belt app run elevenlabs/sound-effects --input '{
"text": "Soft UI button click, subtle and clean",
"duration_seconds": 0.5
}'
# Doorbell
belt app run elevenlabs/sound-effects --input '{"text": "Classic doorbell ring"}'
# Typing
belt app run elevenlabs/sound-effects --input '{
"text": "Mechanical keyboard typing, fast, clicky",
"duration_seconds": 5
}'
# Notification
belt app run elevenlabs/sound-effects --input '{
"text": "Pleasant notification chime, positive",
"duration_seconds": 1
}'
Control how literally the model interprets your description:
| Value | Effect | Best For | |-------|--------|----------| | 0.0 | Very loose interpretation | Creative, surprising results | | 0.3 | Balanced (default) | General purpose | | 0.7 | Close to description | Specific sound needs | | 1.0 | Very literal | Exact sound reproduction |
# Loose interpretation - creative result
belt app run elevenlabs/sound-effects --input '{
"text": "Magical fairy dust sparkle",
"prompt_influence": 0.1
}'
# Literal interpretation - precise result
belt app run elevenlabs/sound-effects --input '{
"text": "Single gunshot, pistol, indoor range",
"prompt_influence": 0.8
}'
Be specific: "Heavy rain on metal roof" > "rain sound"
Include context: "Footsteps on gravel, slow walking pace" > "footsteps"
Describe mood: "Eerie wind howling through abandoned building" > "wind"
Specify material: "Glass shattering on concrete floor" > "breaking glass"
# 1. Generate sound effect
belt app run elevenlabs/sound-effects --input '{
"text": "Dramatic reveal swoosh with bass drop",
"duration_seconds": 2
}' > sfx.json
# 2. Merge with video
belt app run infsh/media-merger --input '{
"media": ["video.mp4", "<sfx-url>"]
}'
# ElevenLabs music generation
npx skills add inference-sh/skills@elevenlabs-music
# ElevenLabs TTS (combine voice with effects)
npx skills add inference-sh/skills@elevenlabs-tts
# AI music generation (Diffrythm, Tencent)
npx skills add inference-sh/skills@ai-music-generation
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
Browse all audio apps: belt app store --category audio
data-ai
Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.
development
Build and deploy applications on inference.sh. Use when getting started, understanding the platform, creating apps, configuring resources, or needing an overview of inference.sh app development. Supports both Python and Node.js. Triggers: inference.sh app, belt app, inf.yml, inference.py, inference.js, deploy app, app development, build app, create app, GPU app, VRAM, app resources, app secrets, app integrations, multi-function app
data-ai
Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.
tools
Generate videos with ByteDance Seedance 2.0 via inference.sh CLI. Unified model for text-to-video, image-to-video, and reference-to-video with synchronized audio, up to 1080p, 4-15s duration. Pro and Fast variants. Studio variants with private asset library for portrait consistency. Use for: social media videos, music videos, product demos, animated content, AI video with sound. Triggers: seedance, seedance 2, bytedance video, seedance t2v, seedance i2v, seedance r2v, video with audio, seedance 2.0, bytedance seedance, seedance studio