content-pipeline/tts-skill/SKILL.md
MiniMax TTS API - 文本转语音、声音克隆、声音设计
npx skillsauth add Jst-Well-Dan/Skill-Box tts-skillInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
这个 Skill 提供 MiniMax TTS API 的完整封装,支持文本转语音、声音克隆和声音设计功能。
确保已设置环境变量:
export MINIMAX_API_KEY="your-api-key"
详细配置说明见 setup.md
import sys
import os
# 获取 skill 目录路径
skill_dir = os.path.dirname(os.path.abspath(__file__))
sys.path.insert(0, os.path.join(skill_dir, "assets"))
from minimax_tts import text_to_audio, list_voices, voice_clone, voice_design, play_audio
| 功能 | 函数 | 说明 |
|------|------|------|
| 文本转语音 | text_to_audio() | 将文本转换为语音文件 |
| 列出声音 | list_voices() | 获取可用的声音列表 |
| 声音克隆 | voice_clone() | 基于音频文件克隆声音 |
| 声音设计 | voice_design() | 根据文字描述生成声音 |
| 播放音频 | play_audio() | 播放音频文件 |
text_to_audio(
text="你好,欢迎使用 MiniMax TTS 服务!",
voice_id="female-shaonv",
output_path="./hello.mp3"
)
voices = list_voices(voice_type="system")
for voice in voices:
print(f"{voice['voice_id']}: {voice['name']}")
voice_clone(
voice_id="my-custom-voice",
audio_file="./sample.mp3",
voice_name="我的声音"
)
voice_design(
prompt="一个温柔的年轻女性声音,带有轻微的南方口音",
preview_text="你好,这是我的声音"
)
| 模型 | 说明 | |------|------| | speech-02-hd | 高清版本,音质最佳 | | speech-02-turbo | 快速版本,延迟低 | | speech-01-hd | 旧版高清 | | speech-01-turbo | 旧版快速 | | speech-2.6-hd | 2.6 版高清 | | speech-2.6-turbo | 2.6 版快速 |
female-shaonv - 少女音female-yujie - 御姐音female-chengshu - 成熟女声male-qingnian - 青年男声male-chengshu - 成熟男声更多声音请使用 list_voices() 查询。
tools
Extract frames or short clips from videos using ffmpeg.
tools
Toolkit for styling artifacts with a theme. These artifacts can be slides, docs, reportings, HTML landing pages, etc. There are 10 pre-set themes with colors/fonts that you can apply to any artifact that has been creating, or can generate a new theme on-the-fly.
tools
Toolkit for creating animated GIFs optimized for Slack, with validators for size constraints and composable animation primitives. This skill applies when users request animated GIFs or emoji animations for Slack from descriptions like "make me a GIF for Slack of X doing Y".
development
Best practices for Remotion - Video creation in React