Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

antoniocascais/voice-mode

Name: voice-mode
Author: antoniocascais

skills/voice-mode/SKILL.md

npx skillsauth add antoniocascais/claude-code-toolkit voice-mode

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Voice Mode

Voice conversation mode where all responses are spoken aloud via Pocket TTS.

Setup

The tts.sh script lives in this skill's scripts/ directory. Resolve it relative to this SKILL.md:

SKILL_DIR="<absolute path to this skill's directory>"
TTS="${SKILL_DIR}/scripts/tts.sh"

Use ${TTS} for all commands below.

Activation

On activation, ALWAYS run these steps in order before anything else:

Check the TTS container is running:
```
${TTS} ensure
```
If this fails (exit code 1), tell the user the container is down and stop. Do NOT attempt to start it.

Confirm voice mode is active by speaking:

${TTS} play "Voice mode activated. I'm listening." -v eponine

Response Rules

While voice mode is active:

ALWAYS speak every response using tts.sh:

${TTS} play "<response text>" -v eponine

Prefer concise responses — aim for 1-3 sentences when used standalone. When combined with another skill, match the response length that skill requires.
Write naturally for speech — avoid markdown, bullet points, code blocks, URLs. Write as you'd speak in conversation.
Also output text — print a brief text version so the conversation is readable in the terminal.
Handle STT input gracefully — user input arrives as [STT]...[/STT] tags from their whisper script. The transcription may be imperfect. Infer intent from context rather than asking for clarification on every garbled word.
Split long responses — if you need to say more than ~2 sentences, make multiple tts.sh calls so audio starts playing sooner.

Voice Selection

Default voice: eponine

If the user provided an argument (e.g., /voice-mode jean), use that voice instead.

Available: alba, marius, javert, jean, fantine, cosette, eponine, azelma

Deactivation

Voice mode ends when the user says "stop voice mode", "text mode", or "stop talking". Confirm with a final spoken message: "Voice mode off. Back to text."

Configuration

All configurable via environment variables:

POCKET_TTS_PORT — server port (default: 18731)
POCKET_TTS_VOICE — default voice (default: eponine)
POCKET_TTS_SPEED — playback speed (default: 1.2)

Dependencies

Docker with pocket-tts container running (docker compose up -d from the pocket-tts repo)
mpv (audio playback)
curl

antoniocascais/voice-mode

skills/voice-mode/SKILL.md

Activates voice conversation mode using Pocket TTS Docker container. Use when user says "voice mode", "let's talk", "talk to me", "speak your responses", or wants Claude to respond with spoken audio. Speaks all responses through TTS and plays via speakers.

5 stars

devops

Updated Apr 4, 2026

$ install --global

skillsauth

npx skillsauth add antoniocascais/claude-code-toolkit voice-mode

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 25, 2026, 9:42 PM1.8s1 file scanned

SKILL.md

name:: voice-mode
description:: >-
- Bash(*/tts.sh:: *)
argument-hint:: [voice]

Voice Mode

Voice conversation mode where all responses are spoken aloud via Pocket TTS.

Setup

The tts.sh script lives in this skill's scripts/ directory. Resolve it relative to this SKILL.md:

SKILL_DIR="<absolute path to this skill's directory>"
TTS="${SKILL_DIR}/scripts/tts.sh"

Use ${TTS} for all commands below.

Activation

On activation, ALWAYS run these steps in order before anything else:

Check the TTS container is running:
```
${TTS} ensure
```
If this fails (exit code 1), tell the user the container is down and stop. Do NOT attempt to start it.

Confirm voice mode is active by speaking:

${TTS} play "Voice mode activated. I'm listening." -v eponine

Response Rules

While voice mode is active:

ALWAYS speak every response using tts.sh:

${TTS} play "<response text>" -v eponine

Prefer concise responses — aim for 1-3 sentences when used standalone. When combined with another skill, match the response length that skill requires.
Write naturally for speech — avoid markdown, bullet points, code blocks, URLs. Write as you'd speak in conversation.
Also output text — print a brief text version so the conversation is readable in the terminal.
Handle STT input gracefully — user input arrives as [STT]...[/STT] tags from their whisper script. The transcription may be imperfect. Infer intent from context rather than asking for clarification on every garbled word.
Split long responses — if you need to say more than ~2 sentences, make multiple tts.sh calls so audio starts playing sooner.

Voice Selection

Default voice: eponine

If the user provided an argument (e.g., /voice-mode jean), use that voice instead.

Available: alba, marius, javert, jean, fantine, cosette, eponine, azelma

Deactivation

Voice mode ends when the user says "stop voice mode", "text mode", or "stop talking". Confirm with a final spoken message: "Voice mode off. Back to text."

Configuration

All configurable via environment variables:

POCKET_TTS_PORT — server port (default: 18731)
POCKET_TTS_VOICE — default voice (default: eponine)
POCKET_TTS_SPEED — playback speed (default: 1.2)

Dependencies

Docker with pocket-tts container running (docker compose up -d from the pocket-tts repo)
mpv (audio playback)
curl

Related Skills

antoniocascais/workflow-review

tools

VerifiedTrustedCommunity

Reviews Claude Code sessions and proposes workflow improvements. Use when: (1) /workflow-review command, (2) "review my workflow", "how can I improve", (3) after long sessions when nudged, (4) start of session with pending review. Analyzes tool usage patterns, CLAUDE.md configuration, and compares against CC best practices. Proposes: CLAUDE.md updates, new skills, underused CC features. Saves session summaries to .claude/workflow-reviews/ for cross-session continuity.

5SKILL.mdUpdated Apr 4, 2026

antoniocascais/workflow-review

antoniocascais/test-quality

testing

VerifiedTrustedCommunity

Guides strong, effective unit test generation using proven testing techniques. Use when writing unit tests, reviewing test quality, improving existing tests, generating test cases, checking test coverage strength, or when tests exist but may be weak. Triggers on: unit test, test quality, test coverage, write tests, improve tests, review tests, test strength, mutation testing, boundary testing.

5SKILL.mdUpdated Apr 4, 2026

antoniocascais/test-quality

antoniocascais/skill-forge

development

VerifiedTrustedCommunity

Creates new Claude Code skills with proper structure and best practices. Use when user wants to create a skill, update an existing skill, add a new command, scaffold a workflow, define skill hooks, or asks "how do I make a skill".

5SKILL.mdUpdated Apr 4, 2026

antoniocascais/skill-forge

antoniocascais/quiz

testing

VerifiedTrustedCommunity

Generates multiple choice quiz questions based on current conversation context. Use when testing understanding, reviewing what was discussed, or wanting a knowledge check on the session.

5SKILL.mdUpdated Apr 4, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/antoniocascais/claude-code-toolkit.git

# Copy into Claude Code skills folder (global)
cp -r claude-code-toolkit/skills/voice-mode ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

antoniocascais/claude-code-toolkit

5 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT