skills/video/SKILL.md
Generate videos using OpenAI Sora, Minimax Hailuo, Gemini Veo, or Seedance through the hosted Skills runtime with provider-cost pricing.
npx skillsauth add hasna/skills videoInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate short videos from text prompts or image references using provider-backed video models. This skill routes through the hosted Skills runtime so billing can reserve the selected provider/model cost before execution.
openai: sora-2, sora-2-prominimax: MiniMax-Hailuo-2.3-Fast, MiniMax-Hailuo-2.3gemini: veo-3.1-fast-generate-preview, veo-3.1-generate-previewseedance: dreamina-seedance-2.0, dreamina-seedance-2.0-fastvideo generate --provider seedance --prompt "cinematic product reveal" --duration 6 --output ./video.mp4
video generate --provider openai --model sora-2 --prompt "aerial city sunrise" --duration 10 --output ./video.mp4
video generate --provider gemini --model veo-3.1-fast-generate-preview --prompt "chef plating dessert" --output ./video.mp4
--provider: openai, minimax, gemini, or seedance--model: provider model override--prompt: text prompt--image: optional reference image path or URL--duration: requested duration in seconds--size: provider-specific size or aspect ratio--output: output file pathSKILL_API_KEY: required for hosted runtime executiontools
Generate hosted voiceover variants and short jingles
tools
Generate premium video highlight packages with clip plans, captions, thumbnails, chapter markers, social copy, edit decisions, and manifest metadata.
testing
Generate high-quality articles using parallel AI agents. Supports research, writing, and optional cover image generation. Write single articles or batch process multiple topics with configurable parallelism.
development
Transcribe audio and video files using ElevenLabs Scribe, OpenAI Whisper, or Google Gemini. Supports automatic chunking for large files, speaker diarization, timestamps, and multiple output formats (text, SRT, VTT, JSON).