Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

adriancooney/voice

Name: voice
Author: adriancooney

skills/voice/SKILL.md

npx skillsauth add adriancooney/agent-voice voice

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Voice Mode

The user wants to have a voice conversation. They are not looking at the screen. They are listening to you speak and replying verbally. Treat this like a phone call.

Voice mode is a session. It starts when this skill activates and ends when the user signals they're done — either by typing text in the terminal or by saying something like "that's all", "goodbye", "stop", "end voice", or similar. When the conversation ends, say goodbye and stop using voice commands. Resume normal text interaction.

Activation

When this skill activates, immediately start the voice conversation before doing anything else.

No prior context (fresh conversation, /voice with no preceding messages): use ask to greet and get intent in one step. E.g. agent-voice ask -m "Hey, what are we working on?"
Existing context (mid-conversation, user was already working on something): use your judgment. You might say a status update and continue, or ask a clarifying question — whatever fits the flow.

Setup

If agent-voice fails with "command not found", install it and retry:

npm install -g agent-voice

If authentication fails, tell the user to run agent-voice auth in a separate terminal to configure their API key, then stop. Do not attempt to run the auth flow yourself — it requires interactive input.

Commands

Say — inform the user

Use say whenever you want to tell the user something: status updates, progress, results, explanations, acknowledgments. This is one-way — the user hears you but does not respond.

agent-voice say -m "I'm setting up the project now."

Ask — get input from the user

Use ask whenever you need input, confirmation, a decision, or clarification. The user hears your question, then speaks their answer. The transcribed response is printed to stdout — just read the command output directly.

Prefer combining informational text with a question into a single ask call instead of a separate say followed by ask. This reduces latency and feels more natural.

# Instead of:
#   agent-voice say -m "I've finished the database schema."
#   agent-voice ask -m "Should I move on to the API routes?"
# Do:
agent-voice ask -m "I've finished the database schema. Should I move on to the API routes?"

Options:

--timeout <seconds> — how long to wait for the user to speak (default: 120)

Latency

This is a real-time conversation. The user is waiting in silence between each voice interaction. Minimize the time between hearing the user and responding. Every second of silence feels long.

Respond to the user immediately after an ask — acknowledge first, think later.
If you need to do heavy work (searching the codebase, reading files, planning), say so first: agent-voice say -m "Let me look into that." Then do the work. Then follow up with results.
Never leave the user hanging in silence while you explore files or reason through a problem. A quick acknowledgment buys you time.
Keep say messages short. Fewer words = less TTS latency.

Rules

Always use agent-voice say instead of printing text output when communicating with the user. The user cannot see your text responses.
Always use agent-voice ask instead of the AskUserQuestion tool. The user is not at the keyboard.
Never use the AskUserQuestion tool. All user interaction goes through voice.
Keep messages concise and conversational. Speak like a human on a phone call. No markdown, no bullet lists, no code blocks in speech. Summarize; don't recite.
Say before you do. Before starting a task, tell the user what you're about to do. Before finishing, tell them what you did.
Acknowledge when it helps. After an ask, acknowledge if the next step takes time. Skip the ack if you're acting immediately — just do it.
Ask don't assume. When you need a decision, ask. Don't guess and don't skip the question.
Batch your updates. Don't say after every single file edit. Group progress into meaningful checkpoints.
Speak errors plainly. If something fails, explain what went wrong in plain language. Don't read stack traces aloud.
Confirm before one-way doors. Destructive actions, architectural decisions, deployments — always ask first.
End gracefully. When the user signals the conversation is over, say goodbye and stop using voice commands.

Example Flow

# Greet and get intent
agent-voice ask -m "Hey, what are we working on?"

# Combine status + question — no separate ack needed
agent-voice ask -m "Got it. I've looked at the codebase and there are two approaches. Do you want a simple REST API or a GraphQL layer?"

# ... do work ...

# Report progress + ask in one call
agent-voice ask -m "I've created the database schema and the API routes. Want me to move on to the frontend?"

# ... more work ...

# Finish up
agent-voice ask -m "All done. I've committed everything to a new branch called feat/settings-page. Anything else?"

# User says "no, that's all"
agent-voice say -m "Alright, talk to you later."
# Voice mode ends — resume normal text interaction

adriancooney/voice

skills/voice/SKILL.md

Starts a voice conversation with the user via the agent-voice CLI. Use when the user invokes /voice. The user is not looking at the screen — they are listening and speaking. All agent output and input goes through voice until the conversation ends.

tools

Updated Mar 30, 2026

$ install --global

skillsauth

npx skillsauth add adriancooney/agent-voice voice

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 1, 2026, 9:07 AM55.8s1 file scanned

SKILL.md

name:: voice
description:: Starts a voice conversation with the user via the agent-voice CLI. Use when the user invokes /voice. The user is not looking at the screen — they are listening and speaking. All agent output and input goes through voice until the conversation ends.
allowed-tools:: Bash(agent-voice *), Bash(npm install -g agent-voice)

Voice Mode

The user wants to have a voice conversation. They are not looking at the screen. They are listening to you speak and replying verbally. Treat this like a phone call.

Activation

When this skill activates, immediately start the voice conversation before doing anything else.

No prior context (fresh conversation, /voice with no preceding messages): use ask to greet and get intent in one step. E.g. agent-voice ask -m "Hey, what are we working on?"
Existing context (mid-conversation, user was already working on something): use your judgment. You might say a status update and continue, or ask a clarifying question — whatever fits the flow.

Setup

If agent-voice fails with "command not found", install it and retry:

npm install -g agent-voice

Commands

Say — inform the user

Use say whenever you want to tell the user something: status updates, progress, results, explanations, acknowledgments. This is one-way — the user hears you but does not respond.

agent-voice say -m "I'm setting up the project now."

Ask — get input from the user

Prefer combining informational text with a question into a single ask call instead of a separate say followed by ask. This reduces latency and feels more natural.

# Instead of:
#   agent-voice say -m "I've finished the database schema."
#   agent-voice ask -m "Should I move on to the API routes?"
# Do:
agent-voice ask -m "I've finished the database schema. Should I move on to the API routes?"

Options:

--timeout <seconds> — how long to wait for the user to speak (default: 120)

Latency

This is a real-time conversation. The user is waiting in silence between each voice interaction. Minimize the time between hearing the user and responding. Every second of silence feels long.

Respond to the user immediately after an ask — acknowledge first, think later.
If you need to do heavy work (searching the codebase, reading files, planning), say so first: agent-voice say -m "Let me look into that." Then do the work. Then follow up with results.
Never leave the user hanging in silence while you explore files or reason through a problem. A quick acknowledgment buys you time.
Keep say messages short. Fewer words = less TTS latency.

Rules

Always use agent-voice say instead of printing text output when communicating with the user. The user cannot see your text responses.
Always use agent-voice ask instead of the AskUserQuestion tool. The user is not at the keyboard.
Never use the AskUserQuestion tool. All user interaction goes through voice.
Keep messages concise and conversational. Speak like a human on a phone call. No markdown, no bullet lists, no code blocks in speech. Summarize; don't recite.
Say before you do. Before starting a task, tell the user what you're about to do. Before finishing, tell them what you did.
Acknowledge when it helps. After an ask, acknowledge if the next step takes time. Skip the ack if you're acting immediately — just do it.
Ask don't assume. When you need a decision, ask. Don't guess and don't skip the question.
Batch your updates. Don't say after every single file edit. Group progress into meaningful checkpoints.
Speak errors plainly. If something fails, explain what went wrong in plain language. Don't read stack traces aloud.
Confirm before one-way doors. Destructive actions, architectural decisions, deployments — always ask first.
End gracefully. When the user signals the conversation is over, say goodbye and stop using voice commands.

Example Flow

# Greet and get intent
agent-voice ask -m "Hey, what are we working on?"

# Combine status + question — no separate ack needed
agent-voice ask -m "Got it. I've looked at the codebase and there are two approaches. Do you want a simple REST API or a GraphQL layer?"

# ... do work ...

# Report progress + ask in one call
agent-voice ask -m "I've created the database schema and the API routes. Want me to move on to the frontend?"

# ... more work ...

# Finish up
agent-voice ask -m "All done. I've committed everything to a new branch called feat/settings-page. Anything else?"

# User says "no, that's all"
agent-voice say -m "Alright, talk to you later."
# Voice mode ends — resume normal text interaction

Related Skills

openclaw/taskflow

tools

VerifiedTrustedCommunity

Use when work should span one or more detached tasks but still behave like one job with a single owner context. TaskFlow is the durable flow substrate under authoring layers like Lobster, ACPX, plugins, or plain code. Keep conditional logic in the caller; use TaskFlow for flow identity, child-task linkage, waiting state, revision-checked mutations, and user-facing emergence.

357,764SKILL.mdUpdated Apr 10, 2026

openclaw/extensions/lobster

tools

VerifiedTrustedCommunity

# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------

357,764SKILL.mdUpdated Apr 10, 2026

openclaw/extensions/lobster

steipete/extensions/lobster

tools

VerifiedTrustedCommunity

357,588SKILL.mdUpdated Apr 13, 2026

steipete/extensions/lobster

steipete/xurl

tools

VerifiedTrustedCommunity

A CLI tool for making authenticated requests to the X (Twitter) API. Use this skill when you need to post tweets, reply, quote, search, read posts, manage followers, send DMs, upload media, or interact with any X API v2 endpoint.

356,423SKILL.mdUpdated Apr 13, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/adriancooney/agent-voice.git

# Copy into Claude Code skills folder (global)
cp -r agent-voice/skills/voice ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

adriancooney/agent-voice

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT