Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

elevenlabs/elevenlabs-transcribe

Name: elevenlabs-transcribe
Author: elevenlabs

openclaw/elevenlabs-transcribe/SKILL.md

npx skillsauth add elevenlabs/skills elevenlabs-transcribe

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

ElevenLabs Speech-to-Text

Official ElevenLabs skill for speech-to-text transcription.

Convert audio to text with state-of-the-art accuracy. Supports 90+ languages, speaker diarization, and realtime streaming.

Prerequisites

ffmpeg installed (brew install ffmpeg on macOS)
ELEVENLABS_API_KEY environment variable set
Python 3.8+ (dependencies auto-install on first run)

Usage

{baseDir}/scripts/transcribe.sh <audio_file> [options]
{baseDir}/scripts/transcribe.sh --url <stream_url> [options]
{baseDir}/scripts/transcribe.sh --mic [options]

Examples

Batch Transcription

Transcribe a local audio file:

{baseDir}/scripts/transcribe.sh recording.mp3

With speaker identification:

{baseDir}/scripts/transcribe.sh meeting.mp3 --diarize

Get full JSON response with timestamps:

{baseDir}/scripts/transcribe.sh interview.wav --diarize --json

Realtime Streaming

Stream from a URL (e.g., live radio, podcast):

{baseDir}/scripts/transcribe.sh --url https://npr-ice.streamguys1.com/live.mp3

Transcribe from microphone:

{baseDir}/scripts/transcribe.sh --mic

Stream a local file in realtime (useful for testing):

{baseDir}/scripts/transcribe.sh audio.mp3 --realtime

Quiet Mode for Agents

Suppress status messages on stderr:

{baseDir}/scripts/transcribe.sh --mic --quiet

Options

| Option | Description | |--------|-------------| | --diarize | Identify different speakers in the audio | | --lang CODE | ISO language hint (e.g., en, pt, es, fr) | | --json | Output full JSON with timestamps and metadata | | --events | Tag audio events (laughter, music, applause) | | --realtime | Stream local file instead of batch processing | | --partials | Show interim transcripts during realtime mode | | -q, --quiet | Suppress status messages (recommended for agents) |

Output Format

Text Mode (default)

Plain text transcription:

The quick brown fox jumps over the lazy dog.

JSON Mode (`--json`)

{
  "text": "The quick brown fox jumps over the lazy dog.",
  "language_code": "eng",
  "language_probability": 0.98,
  "words": [
    {"text": "The", "start": 0.0, "end": 0.15, "type": "word", "speaker_id": "speaker_0"}
  ]
}

Realtime Mode

Final transcripts print as they're committed. With --partials:

[partial] The quick
[partial] The quick brown fox
The quick brown fox jumps over the lazy dog.

Supported Formats

Audio: MP3, WAV, M4A, FLAC, OGG, WebM, AAC, AIFF, Opus Video: MP4, AVI, MKV, MOV, WMV, FLV, WebM, MPEG, 3GPP

Limits: Up to 3GB file size, 10 hours duration

Error Handling

The script exits with non-zero status on errors:

Missing API key: Set ELEVENLABS_API_KEY environment variable
File not found: Check the file path exists
Missing ffmpeg: Install with your package manager
API errors: Check API key validity and rate limits

When to Use Each Mode

| Scenario | Command | |----------|---------| | Transcribe a recording | ./transcribe.sh file.mp3 | | Meeting with multiple speakers | ./transcribe.sh meeting.mp3 --diarize | | Live radio/podcast stream | ./transcribe.sh --url <url> | | Voice input from user | ./transcribe.sh --mic --quiet | | Need word timestamps | ./transcribe.sh file.mp3 --json |

elevenlabs/elevenlabs-transcribe

openclaw/elevenlabs-transcribe/SKILL.md

Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.

180 stars

content-media

Updated Apr 21, 2026

$ install --global

skillsauth

npx skillsauth add elevenlabs/skills elevenlabs-transcribe

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 21, 2026, 10:07 AM93.4s4 files scanned

SKILL.md

name:: elevenlabs-transcribe
description:: Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
homepage:: https://elevenlabs.io/speech-to-text
metadata:: {"clawdbot":{"emoji":"🎙️","requires":{"bins":["ffmpeg","python3"],"env":["ELEVENLABS_API_KEY"]},"primaryEnv":"ELEVENLABS_API_KEY"}}

ElevenLabs Speech-to-Text

Official ElevenLabs skill for speech-to-text transcription.

Convert audio to text with state-of-the-art accuracy. Supports 90+ languages, speaker diarization, and realtime streaming.

Prerequisites

ffmpeg installed (brew install ffmpeg on macOS)
ELEVENLABS_API_KEY environment variable set
Python 3.8+ (dependencies auto-install on first run)

Usage

{baseDir}/scripts/transcribe.sh <audio_file> [options]
{baseDir}/scripts/transcribe.sh --url <stream_url> [options]
{baseDir}/scripts/transcribe.sh --mic [options]

Examples

Batch Transcription

Transcribe a local audio file:

{baseDir}/scripts/transcribe.sh recording.mp3

With speaker identification:

{baseDir}/scripts/transcribe.sh meeting.mp3 --diarize

Get full JSON response with timestamps:

{baseDir}/scripts/transcribe.sh interview.wav --diarize --json

Realtime Streaming

Stream from a URL (e.g., live radio, podcast):

{baseDir}/scripts/transcribe.sh --url https://npr-ice.streamguys1.com/live.mp3

Transcribe from microphone:

{baseDir}/scripts/transcribe.sh --mic

Stream a local file in realtime (useful for testing):

{baseDir}/scripts/transcribe.sh audio.mp3 --realtime

Quiet Mode for Agents

Suppress status messages on stderr:

{baseDir}/scripts/transcribe.sh --mic --quiet

Options

Output Format

Text Mode (default)

Plain text transcription:

The quick brown fox jumps over the lazy dog.

JSON Mode (`--json`)

{
  "text": "The quick brown fox jumps over the lazy dog.",
  "language_code": "eng",
  "language_probability": 0.98,
  "words": [
    {"text": "The", "start": 0.0, "end": 0.15, "type": "word", "speaker_id": "speaker_0"}
  ]
}

Realtime Mode

Final transcripts print as they're committed. With --partials:

[partial] The quick
[partial] The quick brown fox
The quick brown fox jumps over the lazy dog.

Supported Formats

Audio: MP3, WAV, M4A, FLAC, OGG, WebM, AAC, AIFF, Opus Video: MP4, AVI, MKV, MOV, WMV, FLV, WebM, MPEG, 3GPP

Limits: Up to 3GB file size, 10 hours duration

Error Handling

The script exits with non-zero status on errors:

Missing API key: Set ELEVENLABS_API_KEY environment variable
File not found: Check the file path exists
Missing ffmpeg: Install with your package manager
API errors: Check API key validity and rate limits

When to Use Each Mode

Related Skills

elevenlabs/agents

development

VerifiedTrustedCommunity

Build voice AI agents with ElevenLabs. Use when creating voice assistants, customer service bots, interactive voice characters, or any real-time voice conversation experience.

219SKILL.mdUpdated Apr 21, 2026

elevenlabs/voice-changer

tools

VerifiedTrustedCommunity

Transform the voice in an audio recording into a different target voice while preserving emotion, timing, and delivery using the ElevenLabs Voice Changer (speech-to-speech) API. Use when converting one voice to another, changing the speaker/narrator of an existing recording, dubbing a voice-over in a different voice, creating character voices from a scratch performance, anonymizing a speaker, or any "voice conversion / voice transfer / speech-to-speech" task. Make sure to use this skill whenever the user mentions voice changing, voice conversion, speech-to-speech, swapping a voice in audio, re-voicing a clip, or applying a different voice to an existing recording — even if they don't explicitly say "voice changer".

212SKILL.mdUpdated May 5, 2026

elevenlabs/voice-changer

elevenlabs/speech-to-text

content-media

VerifiedTrustedCommunity

Transcribe audio to text using ElevenLabs Scribe v2. Use when converting audio/video to text, generating subtitles, transcribing meetings, or processing spoken content.

212SKILL.mdUpdated Apr 21, 2026

elevenlabs/speech-to-text

elevenlabs/voice-isolator

development

VerifiedTrustedCommunity

Remove background noise and isolate vocals/speech from audio using ElevenLabs Voice Isolator (audio isolation) API. Use when cleaning up noisy recordings, removing music or background ambience from dialogue, isolating speech from field recordings, preparing audio for transcription, extracting vocals, or any "denoise / clean up / isolate voice" task.

183SKILL.mdUpdated Apr 23, 2026

elevenlabs/voice-isolator

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/elevenlabs/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/openclaw/elevenlabs-transcribe ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

elevenlabs/skills

180 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

elevenlabs/elevenlabs-transcribe

$ install --global

Security Scan Results

SKILL.md

ElevenLabs Speech-to-Text

Prerequisites

Usage

Examples

Batch Transcription

Realtime Streaming

Quiet Mode for Agents

Options

Output Format

Text Mode (default)

JSON Mode (--json)

Realtime Mode

Supported Formats

Error Handling

When to Use Each Mode

Related Skills

elevenlabs/agents

elevenlabs/voice-changer

elevenlabs/speech-to-text

elevenlabs/voice-isolator

elevenlabs/elevenlabs-transcribe

$ install --global

Security Scan Results

SKILL.md

ElevenLabs Speech-to-Text

Prerequisites

Usage

Examples

Batch Transcription

Realtime Streaming

Quiet Mode for Agents

Options

Output Format

Text Mode (default)

JSON Mode (--json)

Realtime Mode

Supported Formats

Error Handling

When to Use Each Mode

Related Skills

elevenlabs/agents

elevenlabs/voice-changer

elevenlabs/speech-to-text

elevenlabs/voice-isolator

JSON Mode (`--json`)

JSON Mode (`--json`)