/SKILL.md
Use when user provides a short video platform URL or local video/audio file and wants subtitles/AI summary, or when user asks to list their own AI Douyin historical tasks. Triggers on v.douyin.com, xhslink.com, bilibili.com, b23.tv, YouTube URLs, local .mp4/.mp3/.wav files, or task history requests.
npx skillsauth add imlewc/video-to-subtitle-summary-skill video-to-subtitle-summaryInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
将短视频平台(抖音、小红书、B 站、YouTube 等)视频或本地视频/音频文件转换为字幕文本并生成 AI 摘要。
核心流程:
默认使用本地 faster-whisper,也支持通过环境变量切换到火山引擎 VC API。
YouTube 优先使用 yt-dlp 直接抓取人工字幕或自动字幕;只有没有可用字幕时,才需要下载音视频并回退到 ASR。
不适用于: 实时语音识别、直播字幕
| 依赖 | 用途 | 必需 |
| --- | --- | --- |
| AI Douyin API Key | 推荐的视频解析/下载代理;注册后可用免费额度,成功解析下载直链后扣 1 积分 | 仅抖音/小红书/B 站需要 |
| TikHub API | 可选高级/自托管方案:使用自己的 TikHub Token 直接解析 | 可选 |
| Python 3.9+ | 运行 faster-whisper helper | 仅 ASR_BACKEND=faster-whisper 时需要 |
| faster-whisper | 本地语音转文字 | 仅 ASR_BACKEND=faster-whisper 时需要 |
| 字节跳动 VC API | 云端语音转文字 | 仅 ASR_BACKEND=volcengine 时需要 |
| FFmpeg | 从视频提取音频 | ✅(音频文件可跳过) |
| yt-dlp | 下载 B 站视频;抓取 YouTube 字幕 | 仅 B 站或 YouTube 需要 |
| jq | 解析 AI Douyin/TikHub JSON 响应 | 在线视频模式需要 |
通过环境变量读取,支持以下任意方式配置:
方式一:.env 文件(推荐) — 在 skill 目录下创建 .env 文件:
ASR_BACKEND="faster-whisper"
VIDEO_INFO_PROVIDER="ai-douyin"
AI_DOUYIN_API_BASE="https://ai-douyin.top9.cc"
AI_DOUYIN_API_KEY="your_ai_douyin_api_key"
TIKHUB_TOKEN=""
FW_MODEL_SIZE="small"
FW_DEVICE="auto"
FW_COMPUTE_TYPE=""
FW_PYTHON=""
BYTEDANCE_VC_TOKEN="your_token"
BYTEDANCE_VC_APPID="your_appid"
方式二:Shell 配置 — 添加到 ~/.zshrc 或 ~/.bashrc:
export ASR_BACKEND="faster-whisper"
export VIDEO_INFO_PROVIDER="ai-douyin"
export AI_DOUYIN_API_BASE="https://ai-douyin.top9.cc"
export AI_DOUYIN_API_KEY="your_ai_douyin_api_key"
export TIKHUB_TOKEN=""
export FW_MODEL_SIZE="small"
export FW_DEVICE="auto"
export FW_COMPUTE_TYPE=""
export FW_PYTHON=""
export BYTEDANCE_VC_TOKEN="your_token"
export BYTEDANCE_VC_APPID="your_appid"
说明:
ASR_BACKEND:可选,默认 faster-whisperVIDEO_INFO_PROVIDER:可选,默认 ai-douyin;可改为 tikhub 使用自有 TikHub TokenAI_DOUYIN_API_BASE / AI_DOUYIN_API_KEY:推荐的视频解析代理;抖音/小红书/B 站需要;YouTube 不需要TIKHUB_TOKEN:可选高级/自托管方案;当 VIDEO_INFO_PROVIDER=tikhub 时需要FW_MODEL_SIZE / FW_DEVICE / FW_COMPUTE_TYPE:仅 faster-whisper 后端使用FW_PYTHON:可选,指定安装了 faster-whisper 的 Python;留空时优先使用安装 helper 创建的默认 venv,再回退系统 python3BYTEDANCE_VC_TOKEN / BYTEDANCE_VC_APPID:仅 volcengine 后端使用安装与运行时说明见 AI Douyin 配置指南、TikHub 申请指南、faster-whisper 安装指南 和 火山引擎开通指南
根据用户输入判断处理模式:
douyin.com、v.douyin.com、tiktok.comxiaohongshu.com、xhslink.combilibili.com、b23.tvyoutube.com、youtu.be.mp4、.mov、.avi、.mkv 等)→ 从步骤 3(提取音频)开始.mp3、.wav、.m4a、.flac 等)→ 跳过步骤 3,直接从步骤 4(转写)开始先读取 ASR_BACKEND,未配置时默认使用 faster-whisper:
SKILL_DIR="${SKILL_DIR:-$HOME/.codex/skills/video-to-subtitle-summary}"
[ -d "$SKILL_DIR" ] || SKILL_DIR="$HOME/.claude/skills/video-to-subtitle-summary"
ENV_FILE="$SKILL_DIR/.env"
read_env() {
local key="$1"
if [ -f "$ENV_FILE" ]; then
grep "^${key}=" "$ENV_FILE" | head -1 | cut -d'=' -f2- | tr -d '"' | tr -d "'"
else
printenv "$key"
fi
}
ASR_BACKEND="$(read_env ASR_BACKEND)"
[ -z "$ASR_BACKEND" ] && ASR_BACKEND="faster-whisper"
echo "ASR_BACKEND=$ASR_BACKEND"
支持值:
faster-whispervolcengine在开始任何处理之前,先检查当前模式和当前字幕后端需要的依赖。
SKILL_DIR="${SKILL_DIR:-$HOME/.codex/skills/video-to-subtitle-summary}"
[ -d "$SKILL_DIR" ] || SKILL_DIR="$HOME/.claude/skills/video-to-subtitle-summary"
ENV_FILE="$SKILL_DIR/.env"
read_env() {
local key="$1"
if [ -f "$ENV_FILE" ]; then
grep "^${key}=" "$ENV_FILE" | head -1 | cut -d'=' -f2- | tr -d '"' | tr -d "'"
else
printenv "$key"
fi
}
ASR_BACKEND="$(read_env ASR_BACKEND)"
[ -z "$ASR_BACKEND" ] && ASR_BACKEND="faster-whisper"
VIDEO_INFO_PROVIDER="$(read_env VIDEO_INFO_PROVIDER)"
[ -z "$VIDEO_INFO_PROVIDER" ] && VIDEO_INFO_PROVIDER="ai-douyin"
AI_DOUYIN_API_BASE="$(read_env AI_DOUYIN_API_BASE)"
[ -z "$AI_DOUYIN_API_BASE" ] && AI_DOUYIN_API_BASE="https://ai-douyin.top9.cc"
AI_DOUYIN_API_KEY="$(read_env AI_DOUYIN_API_KEY)"
TIKHUB_TOKEN="$(read_env TIKHUB_TOKEN)"
BYTEDANCE_VC_TOKEN="$(read_env BYTEDANCE_VC_TOKEN)"
BYTEDANCE_VC_APPID="$(read_env BYTEDANCE_VC_APPID)"
FW_PYTHON="$(read_env FW_PYTHON)"
[ -z "$FW_PYTHON" ] && [ -x "$HOME/.cache/video-to-subtitle-summary/faster-whisper-venv/bin/python" ] && FW_PYTHON="$HOME/.cache/video-to-subtitle-summary/faster-whisper-venv/bin/python"
[ -z "$FW_PYTHON" ] && FW_PYTHON="python3"
MISSING=""
if [ "{INPUT_MODE}" = "url" ]; then
if [ "{PLATFORM}" = "douyin" ] || [ "{PLATFORM}" = "xiaohongshu" ] || [ "{PLATFORM}" = "bilibili" ]; then
if [ "$VIDEO_INFO_PROVIDER" = "ai-douyin" ]; then
[ -z "$AI_DOUYIN_API_KEY" ] && MISSING="$MISSING AI_DOUYIN_API_KEY"
elif [ "$VIDEO_INFO_PROVIDER" = "tikhub" ]; then
[ -z "$TIKHUB_TOKEN" ] && MISSING="$MISSING TIKHUB_TOKEN"
else
MISSING="$MISSING invalid_VIDEO_INFO_PROVIDER"
fi
fi
if [ "$VIDEO_INFO_PROVIDER" = "ai-douyin" ] || [ "$VIDEO_INFO_PROVIDER" = "tikhub" ]; then
command -v jq >/dev/null 2>&1 || MISSING="$MISSING jq"
fi
if [ "{PLATFORM}" = "bilibili" ] || [ "{PLATFORM}" = "youtube" ]; then
command -v yt-dlp >/dev/null 2>&1 || MISSING="$MISSING yt-dlp"
fi
fi
if [ "{NEEDS_FFMPEG}" = "yes" ] && [ "{PLATFORM}" != "youtube" ]; then
command -v ffmpeg >/dev/null 2>&1 || MISSING="$MISSING ffmpeg"
fi
if [ "$ASR_BACKEND" = "faster-whisper" ]; then
command -v "$FW_PYTHON" >/dev/null 2>&1 || [ -x "$FW_PYTHON" ] || MISSING="$MISSING FW_PYTHON"
"$FW_PYTHON" - <<'PY' >/dev/null 2>&1 || MISSING="$MISSING faster-whisper"
import faster_whisper
import ctranslate2
PY
elif [ "$ASR_BACKEND" = "volcengine" ]; then
[ -z "$BYTEDANCE_VC_TOKEN" ] && MISSING="$MISSING BYTEDANCE_VC_TOKEN"
[ -z "$BYTEDANCE_VC_APPID" ] && MISSING="$MISSING BYTEDANCE_VC_APPID"
else
MISSING="$MISSING invalid_ASR_BACKEND"
fi
if [ -n "$MISSING" ]; then
echo "ERROR: 缺少必需依赖或配置:$MISSING"
echo "ASR_BACKEND=$ASR_BACKEND VIDEO_INFO_PROVIDER=$VIDEO_INFO_PROVIDER"
echo "ASR_BACKEND 可选值: faster-whisper / volcengine"
echo "VIDEO_INFO_PROVIDER 可选值: ai-douyin / tikhub"
exit 1
else
echo "OK: 运行依赖已就绪 (ASR_BACKEND=$ASR_BACKEND)"
fi
如果检查失败:
ASR_BACKEND=faster-whisper:优先运行 python3 "$SKILL_DIR/scripts/install_faster_whisper.py",或参考 docs/faster-whisper-setup.mdASR_BACKEND=volcengine:参考 docs/bytedance-vc-setup.mdAI_DOUYIN_API_KEY:注册 AI Douyin 领取免费额度并创建 API Key;余额不足时充值积分,或改用 VIDEO_INFO_PROVIDER=tikhub + TIKHUB_TOKEN根据 URL 域名识别平台:
| 平台 | URL 特征 | 默认处理方式 |
| --- | --- | --- |
| 抖音/TikTok | douyin.com、v.douyin.com、tiktok.com | AI Douyin 代理解析下载直链;可选 TikHub |
| 小红书 | xiaohongshu.com、xhslink.com | AI Douyin 代理解析下载直链;可选 TikHub |
| B 站 | bilibili.com、b23.tv | AI Douyin 代理解析下载直链;必要时可回退 yt-dlp |
| YouTube | youtube.com、youtu.be | 不调用 AI Douyin/TikHub,直接进入步骤 2 抓字幕 |
AI Douyin 适合不想单独注册 TikHub 的用户。注册 https://ai-douyin.top9.cc 后领取免费额度并创建 API Key,成功解析下载直链后扣 1 积分;失败不扣。余额不足时接口返回 HTTP 402 / insufficient balance。
SKILL_DIR="${SKILL_DIR:-$HOME/.codex/skills/video-to-subtitle-summary}"
[ -d "$SKILL_DIR" ] || SKILL_DIR="$HOME/.claude/skills/video-to-subtitle-summary"
ENV_FILE="$SKILL_DIR/.env"
read_env() {
local key="$1"
if [ -f "$ENV_FILE" ]; then
grep "^${key}=" "$ENV_FILE" | head -1 | cut -d'=' -f2- | tr -d '"' | tr -d "'"
else
printenv "$key"
fi
}
AI_DOUYIN_API_BASE="$(read_env AI_DOUYIN_API_BASE)"
[ -z "$AI_DOUYIN_API_BASE" ] && AI_DOUYIN_API_BASE="https://ai-douyin.top9.cc"
AI_DOUYIN_API_KEY="$(read_env AI_DOUYIN_API_KEY)"
# API Base 支持填 https://ai-douyin.top9.cc 或 https://ai-douyin.top9.cc/api/v1
case "$AI_DOUYIN_API_BASE" in
*/api/v1) AI_DOUYIN_DOWNLOAD_URL_ENDPOINT="$AI_DOUYIN_API_BASE/video/download-url" ;;
*/api) AI_DOUYIN_DOWNLOAD_URL_ENDPOINT="$AI_DOUYIN_API_BASE/v1/video/download-url" ;;
*) AI_DOUYIN_DOWNLOAD_URL_ENDPOINT="${AI_DOUYIN_API_BASE%/}/api/v1/video/download-url" ;;
esac
curl -sS -w '\n%{http_code}' -X POST "$AI_DOUYIN_DOWNLOAD_URL_ENDPOINT" \
-H "X-API-Key: $AI_DOUYIN_API_KEY" \
-H "Content-Type: application/json" \
-d "$(jq -n --arg url "{ORIGINAL_URL}" '{url: $url}')" \
> /tmp/video_analysis/download_url_response.txt
HTTP_CODE=$(tail -n1 /tmp/video_analysis/download_url_response.txt)
sed '$d' /tmp/video_analysis/download_url_response.txt > /tmp/video_analysis/download_url.json
if [ "$HTTP_CODE" = "402" ]; then
echo "ERROR: AI Douyin 余额不足(insufficient balance)。请到 https://ai-douyin.top9.cc 购买积分,或改用 VIDEO_INFO_PROVIDER=tikhub + TIKHUB_TOKEN。"
exit 1
elif [ "$HTTP_CODE" = "401" ]; then
echo "ERROR: AI Douyin API Key 缺失或无效。请检查 AI_DOUYIN_API_KEY。"
exit 1
elif [ "$HTTP_CODE" -lt 200 ] || [ "$HTTP_CODE" -ge 300 ]; then
echo "ERROR: AI Douyin 解析失败 (HTTP $HTTP_CODE)"
cat /tmp/video_analysis/download_url.json
exit 1
fi
VIDEO_URL=$(jq -r '.download_url // empty' /tmp/video_analysis/download_url.json)
VIDEO_URL_COUNT=$(jq -r '(.download_urls // [.download_url] | map(select(. != null and . != "")) | length)' /tmp/video_analysis/download_url.json)
EXTRACTED_URL=$(jq -r '.extracted_url // empty' /tmp/video_analysis/download_url.json)
DOWNLOAD_COST=$(jq -r '.cost // 1' /tmp/video_analysis/download_url.json)
[ -z "$VIDEO_URL" ] && echo "ERROR: 未返回 download_url" && cat /tmp/video_analysis/download_url.json && exit 1
提取关键字段:
jq '{download_url, download_urls_count: (.download_urls // [] | length), extracted_url, cost}' /tmp/video_analysis/download_url.json
当 VIDEO_INFO_PROVIDER=tikhub 时,使用自己的 TikHub Token 直接解析。注意不要把真实 Token 写入日志或回复。
抖音/TikTok:
curl -s -X GET "https://api.tikhub.io/api/v1/hybrid/video_data?url={ENCODED_URL}&minimal=true" \
-H "Authorization: Bearer your_tikhub_api_token" \
-H "Accept: application/json"
优先提取无水印地址:
jq -r '.data.video_data.nwm_video_url // .data.video.play_addr.url_list[0] // empty'
小红书:
curl -s -X GET "https://api.tikhub.io/api/v1/xiaohongshu/web/get_note_info_v7?share_text={ENCODED_URL}" \
-H "Authorization: Bearer your_tikhub_api_token" \
-H "Accept: application/json"
B 站:
curl -s -X GET "https://api.tikhub.io/api/v1/bilibili/web/fetch_one_video_v3?url={ENCODED_URL}" \
-H "Authorization: Bearer your_tikhub_api_token" \
-H "Accept: application/json"
B 站如未获得可下载直链,可在步骤 2 中使用
yt-dlp下载。
当用户要求查看自己的历史 task / 最近任务 / 任务列表时,调用 AI Douyin 的 GET /api/v1/tasks。该接口使用 X-API-Key 认证,只返回当前 API Key 对应用户自己的任务。
SKILL_DIR="${SKILL_DIR:-$HOME/.codex/skills/video-to-subtitle-summary}"
[ -d "$SKILL_DIR" ] || SKILL_DIR="$HOME/.claude/skills/video-to-subtitle-summary"
python3 "$SKILL_DIR/scripts/list_ai_douyin_tasks.py" \
--page 1 \
--page-size 20
可选筛选:
python3 "$SKILL_DIR/scripts/list_ai_douyin_tasks.py" --status completed --page 1 --page-size 10
python3 "$SKILL_DIR/scripts/list_ai_douyin_tasks.py" --search "关键词" --json
脚本默认读取 $SKILL_DIR/.env 或环境变量中的 AI_DOUYIN_API_BASE / AI_DOUYIN_API_KEY。输出给用户时不要展示真实 API Key。
根据平台使用不同的下载方式:
YouTube(优先直接抓字幕,不下载视频):
mkdir -p /tmp/video_analysis/{VIDEO_ID}
SKILL_DIR="${SKILL_DIR:-$HOME/.codex/skills/video-to-subtitle-summary}"
[ -d "$SKILL_DIR" ] || SKILL_DIR="$HOME/.claude/skills/video-to-subtitle-summary"
python3 "$SKILL_DIR/scripts/download_youtube_subtitles.py" \
"https://www.youtube.com/watch?v={VIDEO_ID}" \
--output-dir /tmp/video_analysis/{VIDEO_ID} \
--languages zh-Hans,zh-Hant,zh,en
输出文件固定为:
/tmp/video_analysis/{VIDEO_ID}/subtitle.srt/tmp/video_analysis/{VIDEO_ID}/text.txt如果命令提示没有可用字幕,再使用 yt-dlp 下载音频或视频,并从步骤 3 继续走 ASR_BACKEND。
抖音 / TikTok / 小红书 / B 站(已有 VIDEO_URL 下载直链时):
mkdir -p /tmp/video_analysis/{VIDEO_ID}
SKILL_DIR="${SKILL_DIR:-$HOME/.codex/skills/video-to-subtitle-summary}"
[ -d "$SKILL_DIR" ] || SKILL_DIR="$HOME/.claude/skills/video-to-subtitle-summary"
python3 "$SKILL_DIR/scripts/download_video_candidates.py" \
--response-json /tmp/video_analysis/download_url.json \
--output /tmp/video_analysis/{VIDEO_ID}/video.mp4 \
--timeout 30
B 站(没有下载直链时回退):
mkdir -p /tmp/video_analysis/{BVID}
yt-dlp -o /tmp/video_analysis/{BVID}/video.mp4 "https://www.bilibili.com/video/{BVID}/"
本地文件模式下,如果输入是本地视频文件,将
{VIDEO_ID}替换为文件名(不含扩展名),输入路径替换为实际视频路径。
ffmpeg -i /tmp/video_analysis/{VIDEO_ID}/video.mp4 -q:a 0 -map a -y /tmp/video_analysis/{VIDEO_ID}/audio.mp3
ASR_BACKEND 选择字幕后端如果平台是 YouTube 且步骤 2 已成功生成 subtitle.srt 和 text.txt,跳过本步骤,直接进入步骤 5 总结。
ASR_BACKEND=faster-whisper(默认)helper 路径:
$SKILL_DIR/scripts/transcribe_faster_whisper.py
执行命令:
SKILL_DIR="${SKILL_DIR:-$HOME/.codex/skills/video-to-subtitle-summary}"
[ -d "$SKILL_DIR" ] || SKILL_DIR="$HOME/.claude/skills/video-to-subtitle-summary"
FW_PYTHON="${FW_PYTHON:-$HOME/.cache/video-to-subtitle-summary/faster-whisper-venv/bin/python}"
[ -x "$FW_PYTHON" ] || FW_PYTHON="python3"
"$FW_PYTHON" "$SKILL_DIR/scripts/transcribe_faster_whisper.py" \
/tmp/video_analysis/{VIDEO_ID}/audio.mp3 \
--output-dir /tmp/video_analysis/{VIDEO_ID}
说明:
FW_MODEL_SIZE、FW_DEVICE、FW_COMPUTE_TYPEFW_DEVICE=auto 时,只有检测到 NVIDIA/CUDA 才会使用 device="cuda"/tmp/video_analysis/{VIDEO_ID}/subtitle.srt/tmp/video_analysis/{VIDEO_ID}/text.txtASR_BACKEND=volcengine提交任务:
SKILL_DIR="${SKILL_DIR:-$HOME/.codex/skills/video-to-subtitle-summary}"
[ -d "$SKILL_DIR" ] || SKILL_DIR="$HOME/.claude/skills/video-to-subtitle-summary"
ENV_FILE="$SKILL_DIR/.env"
if [ -f "$ENV_FILE" ]; then
BYTEDANCE_VC_TOKEN=$(grep "^BYTEDANCE_VC_TOKEN=" "$ENV_FILE" | cut -d'=' -f2- | tr -d '"' | tr -d "'")
BYTEDANCE_VC_APPID=$(grep "^BYTEDANCE_VC_APPID=" "$ENV_FILE" | cut -d'=' -f2- | tr -d '"' | tr -d "'")
else
BYTEDANCE_VC_TOKEN="$BYTEDANCE_VC_TOKEN"
BYTEDANCE_VC_APPID="$BYTEDANCE_VC_APPID"
fi
curl -s -X POST "https://openspeech.bytedance.com/api/v1/vc/submit?appid=$BYTEDANCE_VC_APPID&language=zh-CN&words_per_line=20&max_lines=2" \
-H "Content-Type: audio/mpeg" \
-H "Authorization: Bearer;$BYTEDANCE_VC_TOKEN" \
--data-binary @/tmp/video_analysis/{VIDEO_ID}/audio.mp3
轮询结果:
SKILL_DIR="${SKILL_DIR:-$HOME/.codex/skills/video-to-subtitle-summary}"
[ -d "$SKILL_DIR" ] || SKILL_DIR="$HOME/.claude/skills/video-to-subtitle-summary"
ENV_FILE="$SKILL_DIR/.env"
if [ -f "$ENV_FILE" ]; then
BYTEDANCE_VC_TOKEN=$(grep "^BYTEDANCE_VC_TOKEN=" "$ENV_FILE" | cut -d'=' -f2- | tr -d '"' | tr -d "'")
BYTEDANCE_VC_APPID=$(grep "^BYTEDANCE_VC_APPID=" "$ENV_FILE" | cut -d'=' -f2- | tr -d '"' | tr -d "'")
else
BYTEDANCE_VC_TOKEN="$BYTEDANCE_VC_TOKEN"
BYTEDANCE_VC_APPID="$BYTEDANCE_VC_APPID"
fi
curl -s "https://openspeech.bytedance.com/api/v1/vc/query?appid=$BYTEDANCE_VC_APPID&id={TASK_ID}" \
-H "Authorization: Bearer;$BYTEDANCE_VC_TOKEN"
结果处理:
jq -r '.utterances[].text' subtitle.json | tr '\n' ' ' > /tmp/video_analysis/{VIDEO_ID}/text.txt
生成 SRT:
import json
with open('subtitle.json', 'r') as f:
data = json.load(f)
def ms_to_srt(ms):
h, m, s, millis = ms//3600000, (ms%3600000)//60000, (ms%60000)//1000, ms%1000
return f"{h:02d}:{m:02d}:{s:02d},{millis:03d}"
with open('/tmp/video_analysis/{VIDEO_ID}/subtitle.srt', 'w') as f:
for i, u in enumerate(data['utterances'], 1):
f.write(f"{i}\n{ms_to_srt(u['start_time'])} --> {ms_to_srt(u['end_time'])}\n{u['text']}\n\n")
直接由 Claude 完成,无需调用第三方总结 API。
读取 text.txt 后生成。
总结时应一并提供以下上下文:
desctitletitle推荐直接使用如下提示方式:
以下是一个视频的分析素材,请基于这些信息生成总结:
原视频标题:{ORIGINAL_TITLE}
来源平台:{PLATFORM}
作者:{AUTHOR}
说明:下面的正文来自平台字幕、自动字幕或语音识别,可能存在少量识别误差、断句问题或专有名词错误。请以原视频标题和上下文为参考,在不改变原意的前提下做适度修正,再完成总结。
语音识别文本:
{TEXT_CONTENT}
请输出:
1. AI生成标题:简洁概括,不超过30字;可以参考原视频标题,但不要机械照抄,必要时可根据正文纠正明显错误
2. AI摘要:提炼主要观点和关键信息,200-300字
3. 核心要点:输出3-5条结构化要点
如果原视频标题与正文明显冲突:
## 视频分析结果
### 视频信息
| 项目 | 内容 |
| --- | --- |
| 视频ID | xxx |
| 作者 | xxx |
| 时长 | xxx |
### AI生成标题
xxx
### AI摘要
xxx
### 核心要点
1. xxx
2. xxx
### 生成文件
- 视频: /tmp/video_analysis/{ID}/video.mp4
- 音频: /tmp/video_analysis/{ID}/audio.mp3
- SRT字幕: /tmp/video_analysis/{ID}/subtitle.srt
- 纯文本: /tmp/video_analysis/{ID}/text.txt
| 问题 | 解决方案 |
| --- | --- |
| 缺少环境变量 | 先确认 ASR_BACKEND;抖音/小红书/B站默认需要 AI_DOUYIN_API_KEY,自有 TikHub 模式需要 TIKHUB_TOKEN;火山后端需要 BYTEDANCE_VC_TOKEN 和 BYTEDANCE_VC_APPID |
| ASR_BACKEND 无效 | 只支持 faster-whisper 和 volcengine |
| faster-whisper 导入失败 | 执行 python3 "$SKILL_DIR/scripts/install_faster_whisper.py",安装 helper 会先测速 PyPI 镜像并创建独立 venv;自定义 venv 时设置 FW_PYTHON=/path/to/venv/bin/python |
| 火山 API 认证失败 | Authorization 必须是 Bearer;token(分号无空格) |
| 首次运行较慢 | faster-whisper 首次会下载模型,等待下载完成后重试 |
| CUDA 环境不可用 | FW_DEVICE=auto 会自动回退到 CPU;只有 NVIDIA/CUDA 才走 GPU |
| 视频下载失败 | AI Douyin 返回 401 时检查 API Key;402 时充值积分或切换 TikHub;直链下载加 User-Agent;B 站可回退 yt-dlp |
| FFmpeg 找不到 | macOS: brew install ffmpeg / Linux: sudo apt install ffmpeg / Windows: choco install ffmpeg |
| yt-dlp 找不到 | macOS: brew install yt-dlp / 通用: python3 -m pip install -U yt-dlp(B 站 / YouTube 需要) |
data-ai
Example TaskFlow authoring pattern for inbox triage. Use when messages need different treatment based on intent, with some routes notifying immediately, some waiting on outside answers, and others rolling into a later summary.
data-ai
Example TaskFlow authoring pattern for inbox triage. Use when messages need different treatment based on intent, with some routes notifying immediately, some waiting on outside answers, and others rolling into a later summary.
data-ai
OpenProse VM skill pack. Activate on any `prose` command, .prose files, or OpenProse mentions; orchestrates multi-agent workflows.
data-ai
OpenProse VM skill pack. Activate on any `prose` command, .prose files, or OpenProse mentions; orchestrates multi-agent workflows.