workspace/skills/webex-voice-interface/SKILL.md
Respond to WebEx voice clips with both text and an MP3 voice reply using edge-tts. Voice IN is already handled by OpenClaw transcription. Use when a user sends a voice message in WebEx, you need to reply with audio, or you want to generate a spoken MP3 response.
npx skillsauth add automateyournetwork/netclaw webex-voice-interfaceInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
User sends voice clip in WebEx space
|
v
OpenClaw transcribes automatically (built-in)
|
v
NetClaw processes with full skill set
(pyATS, NetBox, ServiceNow, all 43 MCP servers)
|
v
python3 $MCP_CALL "python3 -u $TTS_MCP_SCRIPT" text_to_speech -> MP3 file
|
v
Upload MP3 to WebEx thread + post text response
Treat the transcribed voice message identically to a typed text message. Use the full NetClaw skill set -- pyATS, NetBox, ServiceNow, etc.
After composing your text response, call text_to_speech:
python3 $MCP_CALL "python3 -u $TTS_MCP_SCRIPT" text_to_speech '{"text":"R1 has 3 OSPF neighbors, all in FULL state on Area 0...","voice":"en-US-GuyNeural"}'
This returns JSON with an output_path to the generated MP3 file.
To list available voices:
python3 $MCP_CALL "python3 -u $TTS_MCP_SCRIPT" list_voices '{"language":"en"}'
Post the text response in the WebEx space/thread AND upload the MP3 file as an attachment via the Messages API (multipart/form-data with files parameter):
Voice Response [MP3 audio file attached]
R1 has 3 OSPF neighbors, all in FULL state on Area 0:
- 2.2.2.2 (R2) via Gi1 -- FULL/DR
- 3.3.3.3 (R3) via Gi2 -- FULL/BDR
Always deliver text AND voice. Text is primary (searchable, accessible). Voice is supplementary.
| Voice | Description | |-------|-------------| | en-US-GuyNeural | Professional male -- default | | en-US-JennyNeural | Professional female | | en-US-AriaNeural | Conversational female | | en-GB-RyanNeural | British male |
Users can request a voice change:
Call list_voices to see all 300+ available voices.
| Phase | Latency | |-------|---------| | edge-tts synthesis | 1-2 seconds | | WebEx MP3 upload | < 1 second |
Voice synthesis adds minimal overhead to the response time.
If TTS fails, deliver the text response immediately. Do not block on voice.
WebEx Messages API supports file attachments via multipart upload:
POST https://webexapis.com/v1/messages
Content-Type: multipart/form-data
- roomId: <space-id>
- parentId: <thread-parent-message-id> (if threading)
- text: "Voice Response: R1 has 3 OSPF neighbors..."
- files: @/tmp/netclaw-tts/response.mp3
Files up to 100 MB are supported. MP3 voice responses are typically under 1 MB.
Record voice interactions in the GAIT audit trail:
Input: Voice clip from @user (transcript: "What are your interfaces?")
Action: Queried R1 interfaces via pyATS
Output: 4 interfaces found -- text + voice response delivered to WebEx
testing
Human-in-the-loop escalation via HumanRail — route low-confidence agent decisions, pre-destructive operation approvals, and ambiguous incident tickets to real human engineers. Human answers are verified and returned as structured output. Workers are paid via Lightning Network. Use when the agent is uncertain, when a destructive change needs explicit human sign-off beyond a ServiceNow CR, or when an ambiguous ticket requires human triage before automated handling.
testing
Manage EVE-NG node lifecycle. Use when listing nodes, checking runtime state, creating or deleting nodes, starting or stopping nodes or whole labs, verifying node details, or wiping node NVRAM back to factory defaults.
development
Manage EVE-NG labs and platform inventory. Use when listing labs, checking lab metadata, creating or deleting labs, importing or exporting lab archives, checking EVE-NG health or auth, or verifying available node images before build work.
tools
Execute live CLI commands on running EVE-NG nodes over telnet console. Use when running show commands, making live config changes, verifying protocol state, testing connectivity, checking console readiness, or interacting with IOS, Junos, VPCS, EOS, or NX-OS nodes.