skills/qwen-tts/SKILL.md
Text-to-speech using Qwen3-TTS CustomVoice MLX model. Supports 9 speakers and multiple emotion/style instructions. Optimized for Apple Silicon. Use when user wants audio speech output.
npx skillsauth add stvlynn/skills qwen-ttsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Text-to-speech using Qwen3-TTS CustomVoice model, running locally on Apple Silicon via MLX.
mlx-community/Qwen3-TTS-12Hz-0.6B-CustomVoice-4bit (4-bit quantized, ~600MB)cd /path/to/skills/skills/qwen-tts
python3 -m venv venv
source venv/bin/activate
模型从 ModelScope 镜像下载(国内更快):
pip install -r scripts/requirements.txt
首次运行时会自动下载模型。如需提前下载:
source venv/bin/activate
export HF_ENDPOINT="https://hf-mirror.com"
python3 -c "from mlx_audio.tts.utils import load_model; load_model('mlx-community/Qwen3-TTS-12Hz-0.6B-CustomVoice-4bit')"
source venv/bin/activate
python3 scripts/tts.py "你好,这是一段测试语音。" --output /tmp
# Should produce a WAV file in /tmp/
| 问题 | 解决方法 |
|------|----------|
| ModuleNotFoundError: mlx | 确认使用 Apple Silicon Mac,MLX 不支持 Intel Mac |
| 模型下载缓慢 | 设置 export HF_ENDPOINT="https://hf-mirror.com" |
| 内存不足 | 4-bit 模型约需 1.5GB 内存,关闭其他大型应用 |
| 无声音输出 | 检查输出文件是否为 0 字节,可能是文本过短 |
⚠️ 以下为示例默认值。请根据实际使用场景修改 speaker 和 instruct。
Serena(示例)撒娇语气(示例)Chinese1.0| Speaker | Language |
|---------|----------|
| Serena | Chinese |
| Vivian | Chinese |
| Uncle_Fu | Chinese |
| Eric | Chinese |
| Dylan | Chinese |
| Ryan | English |
| Aiden | English |
| Ono_Anna | Japanese |
| Sohee | Korean |
撒娇语气 — coquettish冷静分析 — calm analysis惊讶 — surprised兴奋 — excited神秘 — mysterious开心 — happy委屈 — wronged/sadAlso supports free-form natural language instructions, e.g. 用特别愤怒的语气说.
# Default settings
python3 scripts/tts.py "你好!"
# Custom speaker
python3 scripts/tts.py "Hello!" --speaker Ryan --language English
# Custom emotion
python3 scripts/tts.py "其实我真的有发现..." --instruct 冷静分析
# Full customization
python3 scripts/tts.py "哥哥,你回来啦!" \
--speaker Serena \
--instruct 撒娇语气 \
--speed 1.0
# Custom output directory
python3 scripts/tts.py "测试" --output /tmp
# Skip auto-cleanup of old files
python3 scripts/tts.py "测试" --no-cleanup
~/tts-output/ (override with $QWEN_TTS_OUTPUT_DIR)tts_{timestamp}_{index}.wav--no-cleanup)tools
小红书搜索、发布、获取帖子详情。使用本地 MCP 服务器访问小红书内容,需要先登录。适用于搜索旅游攻略、美食推荐、获取帖子详情等场景。
tools
Manage Telegram sticker packs via tsticker CLI. Init, push, sync, download, and trace sticker packs. Use when user wants to create/update Telegram sticker packs, push stickers to Telegram, sync packs, or manage sticker collections. Integrates with create-sticker for end-to-end sticker generation → publish workflow.
tools
Reuse local Youtu-Tip GUI capabilities through a safe adapter CLI so OpenClaw/Codex-style agents can inspect desktop GUI state and perform guarded single-step actions on macOS.
development
Privacy-respecting web search powered by a local SearXNG instance. Use when searching the web, looking up information, researching topics, or needing quick answers. Supports multiple categories including general, images, videos, news, and more. Aggregates results from 70+ search engines without tracking.