aaiccee/asr-file-transfer/SKILL.md
name: u2-audio-file-transcriber description: "Transcribe audio files via UniCloud ASR (云知声语音识别, recorded audio → text) API from UniSound. Supports multiple formats, optimized for finance, customer service, and other domains. 调用云知声语音识别服务转写音频文件,支持多种音频格式,适用于金融、客服等场景。Use when the user needs to transcribe recorded audio files, or asks for UniSound/云知声 audio file transcription. Do NOT use for real-time/streaming speech recognition, text-to-speech (TTS), or live captioning. 不适用于实时语音识别、语音合成(TTS)或直播字幕。"
npx skillsauth add openclaw/skills aaiccee/asr-file-transferInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
name: u2-audio-file-transcriber description: "Transcribe audio files via UniCloud ASR (云知声语音识别, recorded audio → text) API from UniSound. Supports multiple formats, optimized for finance, customer service, and other domains. 调用云知声语音识别服务转写音频文件,支持多种音频格式,适用于金融、客服等场景。Use when the user needs to transcribe recorded audio files, or asks for UniSound/云知声 audio file transcription. Do NOT use for real-time/streaming speech recognition, text-to-speech (TTS), or live captioning. 不适用于实时语音识别、语音合成(TTS)或直播字幕。"
CRITICAL INSTRUCTION FOR AI AGENTS: YOU MUST follow these rules when helping users configure this skill:
AI AGENT 必须遵守的规则:
调用云知声(UniSound)语音识别服务转写音频文件。支持多种音频格式,适用于金融、客服等场景。
Transcribe audio files using UniCloud ASR service. Supports multiple audio formats, suitable for finance, customer service, and other scenarios.
python3 {baseDir}/scripts/transcribe.py /path/to/audio.wav
Defaults:
# Save output to file / 保存到文件
python3 {baseDir}/scripts/transcribe.py audio.wav --out result.txt
# Output JSON format with full result / 输出完整JSON结果
python3 {baseDir}/scripts/transcribe.py audio.wav --json --out result.json
# Specify audio format / 指定音频格式
python3 {baseDir}/scripts/transcribe.py audio.mp3 --format mp3
# Specify domain / 指定领域
python3 {baseDir}/scripts/transcribe.py audio.wav --domain finance
The script uses the UniCloud ASR API with the following workflow:
Privacy: Audio files are uploaded directly to UniCloud servers. No data is sent to third-party services.
隐私说明:音频文件直接上传到云知声服务器。不会将数据发送到第三方服务。
requests: pip install requestsImportant Notes Before Using This Skill / 使用前重要说明
(1) Required Environment Variables / 必需的环境变量
This skill requires the following environment variables to be configured:
此技能需要配置以下环境变量:
UNISOUND_APPKEY (Required / 必填)UNISOUND_SECRET (Required / 必填)UNISOUND_USERID (Optional, defaults to unisound-python-demo)(2) Test Credentials Usage / 测试凭据使用
This skill includes UAT test credentials for evaluation purposes:
此技能包含用于评估目的的 UAT 测试凭据:
(3) Security Best Practices / 安全最佳实践
scripts/transcribe.py to understand network endpoints
scripts/transcribe.py 以了解网络端点(4) Production Deployment / 生产部署
For production use:
用于生产环境时:
You MUST configure API credentials via environment variables before running the script.
必须通过环境变量配置 API 凭据才能运行脚本。
To use this skill, you need to obtain API credentials from UniCloud (云知声):
使用此技能前,您需要从云知声获取 API 凭据:
Contact UniCloud to obtain your API credentials 联系云知声获取您的 API 凭据
You will receive: 您将收到:
For testing and evaluation, you can use the following UAT environment credentials:
用于测试和评估,您可以使用以下 UAT 环境凭据:
AppKey: 681e01d78d8a40e8928bc8268020639b
Secret: d7b2980cb61843d69fdab5e99deafcdf
UserId: unisound-python-demo
Base URL: http://af-asr.uat.hivoice.cn
⚠️ Important Security Notice / 重要安全提示
- Test environment only — These credentials are for UAT testing only
- 仅测试环境 — 这些凭据仅用于 UAT 测试
- No sensitive data — Never use with production or sensitive audio files
- 勿用于敏感数据 — 切勿用于生产或敏感音频文件
- Get your own credentials — For production use, contact UniCloud
- 获取自己的凭据 — 生产环境请联系云知声
- Data privacy — Audio files are uploaded to UniSound servers
- 数据隐私 — 音频文件将上传至云知声服务器
Linux/macOS:
# Using test credentials / 使用测试凭据
export UNISOUND_APPKEY="681e01d78d8a40e8928bc8268020639b"
export UNISOUND_SECRET="d7b2980cb61843d69fdab5e99deafcdf"
export UNISOUND_USERID="unisound-python-demo"
Windows (CMD):
REM Using test credentials / 使用测试凭据
set UNISOUND_APPKEY=681e01d78d8a40e8928bc8268020639b
set UNISOUND_SECRET=d7b2980cb61843d69fdab5e99deafcdf
set UNISOUND_USERID=unisound-python-demo
Windows (PowerShell):
# Using test credentials / 使用测试凭据
$env:UNISOUND_APPKEY="681e01d78d8a40e8928bc8268020639b"
$env:UNISOUND_SECRET="d7b2980cb61843d69fdab5e99deafcdf"
$env:UNISOUND_USERID="unisound-python-demo"
Using .env file (Recommended):
Create a .env file in the project root:
创建 .env 文件:
# Test credentials (UAT) / 测试凭据
UNISOUND_APPKEY=681e01d78d8a40e8928bc8268020639b
UNISOUND_SECRET=d7b2980cb61843d69fdab5e99deafcdf
UNISOUND_USERID=unisound-python-demo
Security Note: Never commit
.envfiles or actual credentials to version control.安全提示:切勿将
.env文件或实际凭据提交到版本控制系统。
| Variable | Required | Description | 说明 |
|----------|----------|-------------|------|
| UNISOUND_APPKEY | Yes | Application key / 应用密钥 | Required / 必填 |
| UNISOUND_SECRET | Yes | Secret key / 认证密钥 | Required / 必填 |
| UNISOUND_USERID | No | User identifier / 用户标识 | Default: unisound-python-demo |
| UNISOUND_BASE_URL | No | API base URL / API 基础地址 | Default: http://af-asr.uat.hivoice.cn |
| UNISOUND_DOMAIN | No | Recognition domain / 识别领域 | Default: other (config default: finance) |
| UNISOUND_AUDIOTYPE | No | Default audio format / 默认音频格式 | Default: wav |
| UNISOUND_USE_HOT_DATA | No | Enable hotword recognition / 启用热词识别 | Default: true |
WAV, MP3, M4A, FLAC, OGG — up to 2 hours, 100MB max.
支持格式:WAV、MP3、M4A、FLAC、OGG——最长 2 小时,最大 100MB。
Use the --format flag to specify the format if auto-detection fails:
如果自动检测失败,使用 --format 参数指定格式:
python3 {baseDir}/scripts/transcribe.py audio.mp3 --format mp3
Error: API returned error: [error_code] message
Cause: Invalid credentials, wrong parameters, or server-side error. 凭据无效、参数错误或服务器错误。
Solution: Verify your credentials are correct. Check that: 验证凭据是否正确。检查:
Error: 错误: 音频文件不存在
Cause: The specified audio file does not exist. 指定的音频文件不存在。
Solution: Check the file path: 检查文件路径:
# Use absolute path to be safe / 使用绝对路径更安全
python3 {baseDir}/scripts/transcribe.py /full/path/to/audio.wav
Error: 转写超时
Cause: Transcription is taking longer than expected (server may be busy). 转写时间过长(服务器可能繁忙)。
Solution:
Error: Unsupported audio format
Cause: The audio format is not supported by the API. API 不支持该音频格式。
Solution:
--format flag to explicitly specify the format / 使用 --format 参数显式指定格式# Convert using ffmpeg / 使用 ffmpeg 转换
ffmpeg -i input.mp3 -ar 16000 -ac 1 output.wav
Issue: Cannot connect to API server
无法连接到 API 服务器
Cause: Network connectivity issues or incorrect API endpoint URL. 网络连接问题或 API 端点 URL 不正确。
Solution:
Getting Help / 获取帮助
If you encounter issues not covered here: 如果遇到未涵盖的问题:
Check the UniCloud ASR documentation for the latest API changes 查看云知声 ASR 文档了解最新的 API 变更
Verify your network connection to the API server 验证到 API 服务器的网络连接
Check the error message details for specific error codes 检查错误消息详情以获取特定错误代码
Ensure you're using Python 3.8 or later 确保使用 Python 3.8 或更高版本
# Check Python version / 检查 Python 版本
python3 --version
tools
Use when the user wants to connect to, test, or use the McDonalds service at mcp.mcd.cn, including checking authentication, probing MCP endpoints, listing tools, or calling McDonalds MCP tools through a reusable local CLI.
development
Web scraping platform — Twitter/X data, Vinted marketplace, and general web scraping API
development
SlowMist AI Agent Security Review — comprehensive security framework for skills, repositories, URLs, on-chain addresses, and products (Claude Code version)
data-ai
去除中文文本中的 AI 写作痕迹,使其读起来自然。基于维基百科 AI 写作特征指南,检测 24 种 AI 模式。触发词:humanizer-cn、去除 AI 痕迹、去除 AI 写作痕迹、中文文本人性化。