279458179/openclaw-whisper-asr/SKILL.md
本地 Whisper 语音识别配置。自动将飞书/Telegram 等渠道的语音消息转成文字。 适用于需要离线、低延迟语音转文字的场景。
npx skillsauth add openclaw/skills whisper-asrInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
通过 whisper.cpp 在服务器上配置本地语音识别,用于:
sudo apt-get update
sudo apt-get install -y ffmpeg
cd /home/brew/.openclaw/workspace
git clone https://github.com/ggml-org/whisper.cpp.git
cd whisper.cpp
sh ./models/download-ggml-model.sh base
模型选择建议:
| 模型 | 大小 | 内存 | 推荐场景 | |------|------|------|---------| | tiny | 75 MB | ~273 MB | 快速测试 | | base | 142 MB | ~388 MB | 平衡推荐 | | small | 466 MB | ~852 MB | 更高精度 |
cd whisper.cpp
cmake -B build
cmake --build build -j --config Release
飞书语音通常是 ogg 格式,需要转换为 whisper 需要的格式:
ffmpeg -i input.ogg -ar 16000 -ac 1 -c:a pcm_s16le output.wav
./build/bin/whisper-cli \
-m models/ggml-base.bin \
-f output.wav \
--language zh \
--no-timestamps
常用参数:
-m: 模型路径-f: 输入音频文件--language zh: 指定中文--no-timestamps: 不输出时间戳-t 4: 线程数(默认自动)ffmpeg -i input.ogg -ar 16000 -ac 1 -c:a pcm_s16le /tmp/audio.wav && \
./build/bin/whisper-cli -m models/ggml-base.bin -f /tmp/audio.wav --language zh --no-timestamps
| 项目 | 路径 |
|------|------|
| whisper.cpp 目录 | /home/brew/.openclaw/workspace/whisper.cpp |
| 可执行文件 | /home/brew/.openclaw/workspace/whisper.cpp/build/bin/whisper-cli |
| 模型目录 | /home/brew/.openclaw/workspace/whisper.cpp/models/ |
| base 模型 | /home/brew/.openclaw/workspace/whisper.cpp/models/ggml-base.bin |
A: 尝试使用更大的模型(small/medium),或在安静环境下录音。
A: 增加线程数:./whisper-cli -t 8 ...
A: 不指定 --language 会自动检测。也可指定 --language en 等。
# 量化(减少模型大小)
./build/bin/quantize models/ggml-base.bin models/ggml-base-q5.bin q5_0
# 使用量化模型
./build/bin/whisper-cli -m models/ggml-base-q5.bin -f audio.wav --language zh
本技能参考 whisper.cpp 官方文档
tools
Use when the user wants to connect to, test, or use the McDonalds service at mcp.mcd.cn, including checking authentication, probing MCP endpoints, listing tools, or calling McDonalds MCP tools through a reusable local CLI.
development
Web scraping platform — Twitter/X data, Vinted marketplace, and general web scraping API
development
SlowMist AI Agent Security Review — comprehensive security framework for skills, repositories, URLs, on-chain addresses, and products (Claude Code version)
data-ai
去除中文文本中的 AI 写作痕迹,使其读起来自然。基于维基百科 AI 写作特征指南,检测 24 种 AI 模式。触发词:humanizer-cn、去除 AI 痕迹、去除 AI 写作痕迹、中文文本人性化。