tools/video/happyhorse/SKILL.md
Generate and edit videos with Alibaba HappyHorse 1.0 models via inference.sh CLI. Models: HappyHorse T2V, I2V, R2V, Video Edit. Capabilities: text-to-video, image-to-video, reference-to-video, video editing with natural language, character preservation, 720P/1080P, up to 15 seconds. Use for: physically realistic video, video editing, character-consistent content, product demos, social media. Triggers: happyhorse, happy horse, alibaba video, happyhorse 1.0, dashscope video, alibaba happyhorse, video editing ai, ai video editor
npx skillsauth add inference-sh-8/skills happyhorseInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Install the belt CLI skill:
npx skills add belt-sh/cli
Generate and edit physically realistic videos with Alibaba's HappyHorse 1.0 models via inference.sh CLI.
Requires inference.sh CLI (
belt). Install instructions
belt login
belt app run alibaba/happyhorse-1-0-t2v --input '{"prompt": "a horse galloping across a sunlit meadow"}'
| Model | App ID | Best For |
|-------|--------|----------|
| T2V | alibaba/happyhorse-1-0-t2v | Text-to-video, physically realistic motion |
| I2V | alibaba/happyhorse-1-0-i2v | Animate a single image |
| R2V | alibaba/happyhorse-1-0-r2v | Preserve characters from up to 9 reference images |
| Video Edit | alibaba/happyhorse-1-0-video-edit | Edit existing videos with natural language |
All models support 720P/1080P resolution, up to 15 seconds duration.
belt app run alibaba/happyhorse-1-0-t2v --input '{
"prompt": "a golden retriever running through autumn leaves in a park, slow motion",
"duration": 10,
"resolution": "1080P",
"ratio": "16:9"
}'
Animate a still image:
belt app run alibaba/happyhorse-1-0-i2v --input '{
"first_frame": "https://your-image.jpg",
"prompt": "gentle camera zoom, clouds moving in the sky",
"duration": 8,
"resolution": "720P"
}'
Generate videos that preserve characters from reference images (up to 9):
belt app run alibaba/happyhorse-1-0-r2v --input '{
"prompt": "a woman walking through a busy market street",
"reference_images": ["https://portrait.jpg"],
"duration": 10,
"resolution": "720P"
}'
belt app run alibaba/happyhorse-1-0-r2v --input '{
"prompt": "two friends sitting at a cafe having coffee",
"reference_images": ["https://person1.jpg", "https://person2.jpg"],
"ratio": "16:9"
}'
Edit existing videos using natural language instructions:
belt app run alibaba/happyhorse-1-0-video-edit --input '{
"video": "https://your-video.mp4",
"prompt": "change the background to a snowy mountain landscape"
}'
belt app run alibaba/happyhorse-1-0-video-edit --input '{
"video": "https://your-video.mp4",
"prompt": "replace the person with the character from the reference image",
"reference_images": ["https://character.jpg"]
}'
belt app run alibaba/happyhorse-1-0-video-edit --input '{
"video": "https://your-video.mp4",
"prompt": "make the scene look like a rainy day",
"audio_setting": "generate"
}'
| Resolution | Price | |------------|-------| | 720P | $0.14 per second | | 1080P | $0.24 per second |
Video Edit is billed on input + output duration.
| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| prompt | string | required | Text description of the video |
| duration | integer | 5 | Duration in seconds (3–15) |
| resolution | enum | 720P | 720P or 1080P |
| ratio | enum | 16:9 | 16:9, 9:16, 1:1, 4:3, 3:4, 21:9 |
| seed | integer | random | Reproducible generation |
| watermark | boolean | false | Add HappyHorse watermark |
| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| first_frame | file | required | First frame image (JPEG, PNG, WebP) |
| prompt | string | - | Optional text description |
| duration | integer | 5 | Duration in seconds (3–15) |
| resolution | enum | 720P | 720P or 1080P |
| seed | integer | random | Reproducible generation |
| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| prompt | string | required | Text description of the scene |
| reference_images | array | required | Up to 9 character reference images |
| duration | integer | 5 | Duration in seconds (3–15) |
| resolution | enum | 720P | 720P or 1080P |
| ratio | enum | 16:9 | 16:9, 9:16, 1:1, 4:3, 3:4, 21:9 |
| seed | integer | random | Reproducible generation |
| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| video | file | required | Video to edit (MP4/MOV, H.264) |
| prompt | string | required | Editing instruction |
| reference_images | array | - | Up to 5 reference images |
| audio_setting | enum | auto | auto, generate, or keep_original |
| resolution | enum | 720P | 720P or 1080P |
| seed | integer | random | Reproducible generation |
belt app store search "happyhorse"
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
# All video generation models
npx skills add inference-sh/skills@ai-video-generation
# Seedance 2.0
npx skills add inference-sh/skills@seedance
# Google Veo
npx skills add inference-sh/skills@google-veo
# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation
Browse all video apps: belt app store --category video
data-ai
Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.
development
Declarative UI widgets from JSON for React/Next.js from ui.inference.sh. Render rich interactive UIs from structured agent responses. Capabilities: forms, buttons, cards, layouts, inputs, selects, checkboxes. Use for: agent-generated UIs, dynamic forms, data display, interactive cards. Triggers: widgets, declarative ui, json ui, widget renderer, agent widgets, dynamic ui, form widgets, card widgets, shadcn widgets, structured output ui
tools
Tool lifecycle UI components for React/Next.js from ui.inference.sh. Display tool calls: pending, progress, approval required, results. Capabilities: tool status, progress indicators, approval flows, results display. Use for: showing agent tool calls, human-in-the-loop approvals, tool output. Triggers: tool ui, tool calls, tool status, tool approval, tool results, agent tools, mcp tools ui, function calling ui, tool lifecycle, tool pending
development
Chat UI building blocks for React/Next.js from ui.inference.sh. Components: container, messages, input, typing indicators, avatars. Capabilities: chat interfaces, message lists, input handling, streaming. Use for: building custom chat UIs, messaging interfaces, AI assistants. Triggers: chat ui, chat component, message list, chat input, shadcn chat, react chat, chat interface, messaging ui, conversation ui, chat building blocks