skills/skill-audio/SKILL.md
Generate high-quality audio using ElevenLabs, OpenAI TTS, and Google Text-to-Speech APIs. Support for text-to-speech, voice cloning, multiple languages, and various voice options.
npx skillsauth add hasna/skills audioInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill provides a unified interface for generating audio from text using multiple AI-powered text-to-speech providers.
This CLI is API-backed. Set SKILL_API_KEY when routing through the hosted skills/connectors runtime; provider-specific keys are managed by that runtime.
bun run src/index.ts generate --provider elevenlabs --text "Hello world" --voice rachel --output ./output.mp3
bun run src/index.ts generate --provider openai --text "Hello world" --voice nova --output ./output.mp3
bun run src/index.ts generate --provider google --text "Hello world" --output ./output.mp3
bun run src/index.ts voices --provider elevenlabs
bun run src/index.ts voices --provider openai
bun run src/index.ts voices --provider google
Set up API keys as environment variables:
export ELEVENLABS_API_KEY=your_elevenlabs_key
export OPENAI_API_KEY=your_openai_key
export GOOGLE_API_KEY=your_google_key
tools
Generate hosted voiceover variants and short jingles
tools
Generate premium video highlight packages with clip plans, captions, thumbnails, chapter markers, social copy, edit decisions, and manifest metadata.
testing
Generate high-quality articles using parallel AI agents. Supports research, writing, and optional cover image generation. Write single articles or batch process multiple topics with configurable parallelism.
testing
Generate videos using OpenAI Sora, Minimax Hailuo, Gemini Veo, or Seedance through the hosted Skills runtime with provider-cost pricing.