skills/make-ugc-video/SKILL.md
End-to-end UGC video in one call. Provide EITHER a text description of the person, OR a portrait URL (R2-hosted), OR an uploaded image. The pipeline auto-generates the missing portrait, builds a character sheet, and produces a 5/10/15s vertical selfie video with native lip-synced audio of your script.
npx skillsauth add gitroomhq/agent-media Make UGC VideoInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
End-to-end UGC video in one call. Provide EITHER a text description of the person, OR a portrait URL (R2-hosted), OR an uploaded image. The pipeline auto-generates the missing portrait, builds a character sheet, and produces a 5/10/15s vertical selfie video with native lip-synced audio of your script.
Call this skill when the user asks for the outcome described above. It runs on the agent-media vNext primitive runtime via the mcp__agent-media__make_ugc_video MCP tool. Authentication is the user's existing agent-media Bearer token (issued by agent-media login).
Preferred path: MCP tool mcp__agent-media__make_ugc_video. Schema is auto-published via tools/list against the same MCP server, so don't restate the schema here — trust the server's response.
Fallback path: REST.
POST https://api.agent-media.ai/v1/skills/make_ugc_video/run
Authorization: Bearer $AGENT_MEDIA_API_KEY
Content-Type: application/json
Idempotency-Key: <any unique string per intent>
{
"description": "a friendly young woman, soft daylight, candid framing",
"character_description": "Maya, 27 years old",
"script": "Okay this is wild, I tried the new flow and it actually works.",
"duration": 5,
"subtitles": true,
"subtitles_style": "hormozi"
}
~225/~365/~505 (5s/10s/15s)360–600sGET https://api.agent-media.ai/v1/skills/runs/<skill_run_id>
Authorization: Bearer $AGENT_MEDIA_API_KEY
Returns per-step status with intermediate artifact URLs as each primitive completes.
agent-media login.This file is auto-generated by scripts/generate-public-skill.ts from the registry at services/api-v2/src/skills/registry.ts. Do not hand-edit; CI rejects drift.
content-media
Generate a 5/10/15s vertical UGC video where your character holds, wears, and shows a product. Provide a character_sheet_url (R2-hosted) and the product image (product_image_url — any https URL — OR product_image_base64; re-hosted to R2 automatically). Two modes: script for a lip-synced talking-head product review (2-4 words/sec), OR scene_action for a silent demo / b-roll. Set subject (e.g. "a young woman") to lock the person's gender/appearance so a gendered product can't drift it. framing: "close_up" (chest-up, default) or "full_body" (head-to-toe, for turn-arounds / showing the whole outfit). Both the person and the exact product are locked from the reference images.
development
Publish a generated agent-media video to the user's connected TikTok, Instagram, or X. Connect channels (OAuth) and post or schedule via the REST API. Use after producing a video with make_ugc_video / make_simple_selfie.
testing
Playbook for orchestrating an end-to-end UGC video on the agent-media vNext runtime. Read this before deciding whether to call the one-shot make_ugc_video skill or to chain the four primitives (make_portrait → make_character_sheet → make_simple_selfie → make_subtitles) manually.
tools
Generate a photographic storyboard / wireframe board from a character sheet (R2-hosted) + script. Multi-panel grid showing the same person performing the action progression, 4 / 6 / 8 / 10 numbered panels.