skills/gemini-video-analyzer/SKILL.md
Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe speech, identify objects and actions. Use when: (1) User sends a video file and wants it analyzed, (2) Video summarization or description needed, (3) Extracting text, UI elements, or information from screen recordings, (4) Answering questions about video content, (5) Comparing multiple videos, (6) Analyzing tutorials, demos, or walkthroughs.
npx skillsauth add genesis-plan/hongchen-lingjing gemini-video-analyzerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.
# Analyze a video with default prompt (full description)
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4
# Ask a specific question
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"
# Manage uploaded files
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup
MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.
| Task | Example Prompt |
|------|---------------|
| General description | (default — no prompt needed) |
| UI/text extraction | "What text and UI elements are visible?" |
| Tutorial summary | "Summarize the steps shown in this tutorial" |
| Bug report from video | "Describe what went wrong in this screen recording" |
| Meeting notes | "Summarize the key points discussed" |
| Content comparison | Upload 2 videos, ask for differences |
Set GOOGLE_AI_API_KEY in your environment or .env file. Get a free key at aistudio.google.com.
Default model: gemini-2.5-flash (fast, cheap, excellent vision). Override with --model gemini-2.5-pro for complex analysis.
See references/gemini-files-api.md for file upload limits, processing details, and advanced options.
Built by M. Abidi · LinkedIn · YouTube · GitHub · Book a Call
testing
Generate viral Xiaohongshu (Little Red Book) copy with proven templates. Use when user asks for: (1) Xiaohongshu post copy, (2) Social media viral content, (3) Chinese social media marketing copy, (4) 爆款文案, (5) 小红书文案, (6) 种草文案
content-media
小红书视频发布。使用浏览器自动化在网页版小红书创作者服务平台发布视频笔记。当用户说"发布视频到小红书"、"发小红书视频"时使用此技能。
development
小红书全能助手 — 文案生成、封面制作、内容发布与管理。当用户要求写小红书笔记、生成小红书文案/标题/封面、发小红书、搜索小红书、评论点赞收藏等任何小红书相关操作时使用。支持一站式从文案创作到自动发布的完整流程。封面AI生图需配置可选环境变量(GEMINI_API_KEY 或 IMG_API_KEY 或 HUNYUAN_SECRET_ID+KEY)。
business
Auto-generate structured weekly business reports covering KPIs, accomplishments, blockers, and plans. Save hours of reporting time every week.