tools/audio/elevenlabs-sound-effects/SKILL.md
Generate AI sound effects from text descriptions with ElevenLabs via inference.sh CLI. Capabilities: text-to-sound-effect, custom duration, royalty-free audio. Use for: video production, game audio, podcasts, films, presentations, social media. Triggers: sound effects, sfx, sound generation, ai sound effects, generate sound, foley, audio effects, sound design, text to sound, elevenlabs sound, eleven labs sfx, ambient sound, cinematic sound, game sound effects
npx skillsauth add inference-sh/agent-skills elevenlabs-sound-effectsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate sound effects from text descriptions via inference.sh CLI.

Requires inference.sh CLI (
belt). Install instructions
belt login
# Generate a sound effect
belt app run elevenlabs/sound-effects --input '{"text": "Thunder rumbling in the distance"}'
| Parameter | Type | Description |
|-----------|------|-------------|
| text | string | Description of the sound effect (max 1000 chars) |
| duration_seconds | number | Duration 0.5-22 seconds (optional, auto if omitted) |
| prompt_influence | number | 0-1, how literal to interpret prompt (default: 0.3) |
# Epic trailer hit
belt app run elevenlabs/sound-effects --input '{"text": "Cinematic braam, deep bass impact"}'
# Suspense drone
belt app run elevenlabs/sound-effects --input '{
"text": "Dark atmospheric drone, tension building, horror",
"duration_seconds": 10
}'
# Whoosh transition
belt app run elevenlabs/sound-effects --input '{
"text": "Fast cinematic whoosh transition",
"duration_seconds": 1.5
}'
# Rain
belt app run elevenlabs/sound-effects --input '{
"text": "Heavy rain on a tin roof with occasional thunder",
"duration_seconds": 15
}'
# Forest ambience
belt app run elevenlabs/sound-effects --input '{
"text": "Forest ambience with birds chirping and gentle wind",
"duration_seconds": 20
}'
# Ocean waves
belt app run elevenlabs/sound-effects --input '{
"text": "Ocean waves crashing on a beach, calming",
"duration_seconds": 15
}'
# Power-up
belt app run elevenlabs/sound-effects --input '{
"text": "Retro game power-up sound, ascending tones",
"duration_seconds": 1
}'
# Explosion
belt app run elevenlabs/sound-effects --input '{
"text": "Sci-fi laser explosion, futuristic",
"duration_seconds": 3
}'
# UI click
belt app run elevenlabs/sound-effects --input '{
"text": "Soft UI button click, subtle and clean",
"duration_seconds": 0.5
}'
# Doorbell
belt app run elevenlabs/sound-effects --input '{"text": "Classic doorbell ring"}'
# Typing
belt app run elevenlabs/sound-effects --input '{
"text": "Mechanical keyboard typing, fast, clicky",
"duration_seconds": 5
}'
# Notification
belt app run elevenlabs/sound-effects --input '{
"text": "Pleasant notification chime, positive",
"duration_seconds": 1
}'
Control how literally the model interprets your description:
| Value | Effect | Best For | |-------|--------|----------| | 0.0 | Very loose interpretation | Creative, surprising results | | 0.3 | Balanced (default) | General purpose | | 0.7 | Close to description | Specific sound needs | | 1.0 | Very literal | Exact sound reproduction |
# Loose interpretation - creative result
belt app run elevenlabs/sound-effects --input '{
"text": "Magical fairy dust sparkle",
"prompt_influence": 0.1
}'
# Literal interpretation - precise result
belt app run elevenlabs/sound-effects --input '{
"text": "Single gunshot, pistol, indoor range",
"prompt_influence": 0.8
}'
Be specific: "Heavy rain on metal roof" > "rain sound"
Include context: "Footsteps on gravel, slow walking pace" > "footsteps"
Describe mood: "Eerie wind howling through abandoned building" > "wind"
Specify material: "Glass shattering on concrete floor" > "breaking glass"
# 1. Generate sound effect
belt app run elevenlabs/sound-effects --input '{
"text": "Dramatic reveal swoosh with bass drop",
"duration_seconds": 2
}' > sfx.json
# 2. Merge with video
belt app run infsh/media-merger --input '{
"media": ["video.mp4", "<sfx-url>"]
}'
# ElevenLabs music generation
npx skills add inference-sh/skills@elevenlabs-music
# ElevenLabs TTS (combine voice with effects)
npx skills add inference-sh/skills@elevenlabs-tts
# AI music generation (Diffrythm, Tencent)
npx skills add inference-sh/skills@ai-music-generation
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
Browse all audio apps: belt app list --category audio
development
Render videos from React/Remotion component code via inference.sh. Pass TSX code, get MP4. Supports all Remotion APIs: useCurrentFrame, useVideoConfig, spring, interpolate, AbsoluteFill, Sequence. Configurable resolution, FPS, duration, codec. Use for: programmatic video generation, animated graphics, motion design, data-driven videos, React animations to video. Triggers: remotion, render video from code, tsx to video, react video, programmatic video, remotion render, code to video, animated video, motion graphics code, react animation video
tools
Generate videos with Pruna P-Video and WAN models via inference.sh CLI. Models: P-Video, WAN-T2V, WAN-I2V. Capabilities: text-to-video, image-to-video, audio support, 720p/1080p, fast inference. Pruna optimizes models for speed without quality loss. Triggers: pruna video, p-video, pruna ai video, fast video generation, optimized video, wan t2v, wan i2v, economic video generation, cheap video generation, pruna text to video, pruna image to video
documentation
Still-to-video conversion guide: model selection, motion prompting, and camera movement. Covers Wan 2.5 i2v, Seedance, Fabric, Grok Video with when to use each. Use for: animating images, creating video from stills, adding motion, product animations. Triggers: image to video, i2v, animate image, still to video, add motion to image, image animation, photo to video, animate still, wan i2v, image2video, bring image to life, animate photo, motion from image
tools
Generate videos with Google Veo models via inference.sh CLI. Models: Veo 3.1, Veo 3.1 Fast, Veo 3, Veo 3 Fast, Veo 2. Capabilities: text-to-video, cinematic output, high quality video generation. Triggers: veo, google veo, veo 3, veo 2, veo 3.1, vertex ai video, google video generation, google video ai, veo model, veo video