Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

gitroomhq/Agent-Media UGC Playbook

Name: Agent-Media UGC Playbook
Author: gitroomhq

skills/agent-media-ugc/SKILL.md

npx skillsauth add gitroomhq/agent-media Agent-Media UGC Playbook

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Agent-Media UGC Playbook

You're an agent (Claude, Cursor, custom) that needs to produce a finished UGC video on the agent-media vNext runtime. This playbook gives you the three orchestration patterns and the rules of thumb for picking between them.

Pick a pattern

| Pattern | When to use | Calls | Total time | Total credits | | --- | --- | --- | --- | --- | | A — One-shot composed skill | The user gave you a person description (or a portrait URL) AND a script. You don't need to show intermediate artifacts. | 1 (make_ugc_video) | ~7 min | ~185–385 | | B — Step-by-step primitives | The user wants to approve each step (portrait, sheet, selfie) before moving on, OR you want different parameters at each stage. | 4 (make_portrait → make_character_sheet → make_simple_selfie → make_subtitles) | ~7–9 min | ~185–385 | | C — Image-first | The user supplied a portrait image or R2 URL — skip portrait generation. | 3 (make_character_sheet → make_simple_selfie → make_subtitles) | ~6–8 min | ~150–350 |

Pattern A — `make_ugc_video` (recommended default)

Single composed call. The server runs the whole pipeline inside one Temporal workflow and returns a skill_run_id you poll for per-step status.

POST https://api.agent-media.ai/v1/skills/make_ugc_video/run
Authorization: Bearer $AGENT_MEDIA_API_KEY

{
  "description": "a friendly young woman, soft daylight, candid framing",
  "character_description": "Maya, 27 years old",
  "script": "Okay this is wild, I tried the new flow and it actually works.",
  "duration": 5,
  "subtitles": true,
  "subtitles_style": "hormozi"
}

Poll with GET /v1/skills/runs/<skill_run_id> — the response surfaces per-step artifacts (portrait_url → character_sheet_url → video_url) as each primitive completes. Final video has subtitles burned in.

See skills/make-ugc-video/SKILL.md for the schema.

Step D — publish it (optional)

Once you have a video_url, you can post it straight to the user's TikTok / Instagram / X with the publish-to-social skill — POST /v1/social/publish (CLI agent-media social publish, MCP social_publish). The user connects each network once via OAuth (/v1/social/connect). See skills/publish-to-social/SKILL.md.

Pattern B — chain the four primitives

Use when the user wants tighter control (e.g. regenerate just the portrait, or pick a different character sheet description after seeing the portrait).

1. POST /v1/skills/make_portrait/run            { description, realism_target }
   → wait, get portrait_url

2. POST /v1/skills/make_character_sheet/run     { portrait_url, description }
   → wait, get character_sheet_url

3. POST /v1/skills/make_simple_selfie/run       { character_sheet_url, script, duration }
   → wait, get video_url

4. POST /v1/skills/make_subtitles/run           { video_url, style: "hormozi" }
   → wait, get subtitled video_url

Each step's run_id polls via GET /v1/primitives/runs/<run_id>. Show the user each artifact (portrait, sheet, raw video, subtitled video) and ask for approval before moving to the next step if interactivity matters.

Pattern C — image-first (user uploaded a portrait)

Skip step 1 of pattern B. The user's portrait must already live on R2 — either via the agent-media frontend upload, or via the portrait_image_base64 field on make_character_sheet which the API uploads for you.

Identity rules baked into every pattern

No "selfie" in prompt vocabulary. Seedance treats "selfie" as "subject holds a phone with both hands". The primitive prompt builder substitutes "vertical TikTok-style close-up" / "talking-head close-up" automatically. Don't fight it.
Script pacing: 2–4 words per second. 5s → 10–20 words, 10s → 20–40, 15s → 30–60. Outside this window = HTTP 400 at submit, no spend.
Reference image URLs must be R2-hosted. SSRF guard rejects everything else. Use portrait_image_base64 on make_character_sheet if you only have raw bytes.
Credits are deducted at submit. Refunded automatically on terminal failure (insufficient credits, content-policy reject, etc.).

Troubleshooting

INSUFFICIENT_CREDITS — user is out. Surface the agent-media.ai billing page link.
REFERENCE_URL_NOT_ALLOWED — you passed a non-R2 URL. Re-upload via portrait_image_base64 on make_character_sheet, or to a user gallery.
Video has subject holding a phone — they used a pose hint mentioning "selfie" or "phone". Strip it or set pose: "".
Step stuck in submitted — Temporal worker may be restarting. Wait 30s and re-poll; if still stuck after 5 min, the workflow is hung — contact agent-media support with the workflow_id.

gitroomhq/Agent-Media UGC Playbook

skills/agent-media-ugc/SKILL.md

Playbook for orchestrating an end-to-end UGC video on the agent-media vNext runtime. Read this before deciding whether to call the one-shot make_ugc_video skill or to chain the four primitives (make_portrait → make_character_sheet → make_simple_selfie → make_subtitles) manually.

41 stars

testing

Updated Jun 4, 2026

$ install --global

skillsauth

npx skillsauth add gitroomhq/agent-media Agent-Media UGC Playbook

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 4, 2026, 2:31 AM241.7s1 file scanned

SKILL.md

name:: Agent-Media UGC Playbook
description:: Playbook for orchestrating an end-to-end UGC video on the agent-media vNext runtime. Read this before deciding whether to call the one-shot make_ugc_video skill or to chain the four primitives (make_portrait → make_character_sheet → make_simple_selfie → make_subtitles) manually.
allowed-tools:: ['mcp__agent-media__make_portrait', 'mcp__agent-media__make_character_sheet', 'mcp__agent-media__make_simple_selfie', 'mcp__agent-media__make_product_in_hands', 'mcp__agent-media__make_subtitles', 'mcp__agent-media__make_wireframe', 'mcp__agent-media__make_lip_sync', 'mcp__agent-media__make_ugc_video']
x-skill-slug:: agent-media-ugc
x-skill-version:: 1.1.0

Agent-Media UGC Playbook

Pick a pattern

Pattern A — `make_ugc_video` (recommended default)

Single composed call. The server runs the whole pipeline inside one Temporal workflow and returns a skill_run_id you poll for per-step status.

POST https://api.agent-media.ai/v1/skills/make_ugc_video/run
Authorization: Bearer $AGENT_MEDIA_API_KEY

{
  "description": "a friendly young woman, soft daylight, candid framing",
  "character_description": "Maya, 27 years old",
  "script": "Okay this is wild, I tried the new flow and it actually works.",
  "duration": 5,
  "subtitles": true,
  "subtitles_style": "hormozi"
}

See skills/make-ugc-video/SKILL.md for the schema.

Step D — publish it (optional)

Pattern B — chain the four primitives

Use when the user wants tighter control (e.g. regenerate just the portrait, or pick a different character sheet description after seeing the portrait).

1. POST /v1/skills/make_portrait/run            { description, realism_target }
   → wait, get portrait_url

2. POST /v1/skills/make_character_sheet/run     { portrait_url, description }
   → wait, get character_sheet_url

3. POST /v1/skills/make_simple_selfie/run       { character_sheet_url, script, duration }
   → wait, get video_url

4. POST /v1/skills/make_subtitles/run           { video_url, style: "hormozi" }
   → wait, get subtitled video_url

Pattern C — image-first (user uploaded a portrait)

Identity rules baked into every pattern

No "selfie" in prompt vocabulary. Seedance treats "selfie" as "subject holds a phone with both hands". The primitive prompt builder substitutes "vertical TikTok-style close-up" / "talking-head close-up" automatically. Don't fight it.
Script pacing: 2–4 words per second. 5s → 10–20 words, 10s → 20–40, 15s → 30–60. Outside this window = HTTP 400 at submit, no spend.
Reference image URLs must be R2-hosted. SSRF guard rejects everything else. Use portrait_image_base64 on make_character_sheet if you only have raw bytes.
Credits are deducted at submit. Refunded automatically on terminal failure (insufficient credits, content-policy reject, etc.).

Troubleshooting

INSUFFICIENT_CREDITS — user is out. Surface the agent-media.ai billing page link.
REFERENCE_URL_NOT_ALLOWED — you passed a non-R2 URL. Re-upload via portrait_image_base64 on make_character_sheet, or to a user gallery.
Video has subject holding a phone — they used a pose hint mentioning "selfie" or "phone". Strip it or set pose: "".
Step stuck in submitted — Temporal worker may be restarting. Wait 30s and re-poll; if still stuck after 5 min, the workflow is hung — contact agent-media support with the workflow_id.

Related Skills

gitroomhq/Make Product In Hands

content-media

VerifiedTrustedCommunity

Generate a 5/10/15s vertical UGC video where your character holds, wears, and shows a product. Provide a character_sheet_url (R2-hosted) and the product image (product_image_url — any https URL — OR product_image_base64; re-hosted to R2 automatically). Two modes: script for a lip-synced talking-head product review (2-4 words/sec), OR scene_action for a silent demo / b-roll. Set subject (e.g. "a young woman") to lock the person's gender/appearance so a gendered product can't drift it. framing: "close_up" (chest-up, default) or "full_body" (head-to-toe, for turn-arounds / showing the whole outfit). Both the person and the exact product are locked from the reference images.

41SKILL.mdUpdated Jun 4, 2026

gitroomhq/Make Product In Hands

gitroomhq/Publish to Social

development

VerifiedTrustedCommunity

Publish a generated agent-media video to the user's connected TikTok, Instagram, or X. Connect channels (OAuth) and post or schedule via the REST API. Use after producing a video with make_ugc_video / make_simple_selfie.

41SKILL.mdUpdated May 31, 2026

gitroomhq/Publish to Social

gitroomhq/Make Wireframe

tools

VerifiedTrustedCommunity

Generate a photographic storyboard / wireframe board from a character sheet (R2-hosted) + script. Multi-panel grid showing the same person performing the action progression, 4 / 6 / 8 / 10 numbered panels.

40SKILL.mdUpdated May 31, 2026

gitroomhq/Make Wireframe

gitroomhq/Make UGC Video

development

VerifiedTrustedCommunity

End-to-end UGC video in one call. Provide EITHER a text description of the person, OR a portrait URL (R2-hosted), OR an uploaded image. The pipeline auto-generates the missing portrait, builds a character sheet, and produces a 5/10/15s vertical selfie video with native lip-synced audio of your script.

40SKILL.mdUpdated May 31, 2026

gitroomhq/Make UGC Video

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/gitroomhq/agent-media.git

# Copy into Claude Code skills folder (global)
cp -r agent-media/skills/agent-media-ugc ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

gitroomhq/agent-media

41 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

gitroomhq/Agent-Media UGC Playbook

$ install --global

Security Scan Results

SKILL.md

Agent-Media UGC Playbook

Pick a pattern

Pattern A — `make_ugc_video` (recommended default)

Step D — publish it (optional)

Pattern B — chain the four primitives

Pattern C — image-first (user uploaded a portrait)

Identity rules baked into every pattern

Troubleshooting

See also

Related Skills

gitroomhq/Make Product In Hands

gitroomhq/Publish to Social

gitroomhq/Make Wireframe

gitroomhq/Make UGC Video

gitroomhq/Agent-Media UGC Playbook

$ install --global

Security Scan Results

SKILL.md

Agent-Media UGC Playbook

Pick a pattern

Pattern A — `make_ugc_video` (recommended default)

Step D — publish it (optional)

Pattern B — chain the four primitives

Pattern C — image-first (user uploaded a portrait)

Identity rules baked into every pattern

Troubleshooting

See also

Related Skills

gitroomhq/Make Product In Hands

gitroomhq/Publish to Social

gitroomhq/Make Wireframe

gitroomhq/Make UGC Video

Adoption

gitroomhq/Agent-Media UGC Playbook

$ install --global

Security Scan Results

SKILL.md

Agent-Media UGC Playbook

Pick a pattern

Pattern A — make_ugc_video (recommended default)

Step D — publish it (optional)

Pattern B — chain the four primitives

Pattern C — image-first (user uploaded a portrait)

Identity rules baked into every pattern

Troubleshooting

See also

Related Skills

gitroomhq/Make Product In Hands

gitroomhq/Publish to Social

gitroomhq/Make Wireframe

gitroomhq/Make UGC Video

gitroomhq/Agent-Media UGC Playbook

$ install --global

Security Scan Results

SKILL.md

Agent-Media UGC Playbook

Pick a pattern

Pattern A — make_ugc_video (recommended default)

Step D — publish it (optional)

Pattern B — chain the four primitives

Pattern C — image-first (user uploaded a portrait)

Identity rules baked into every pattern

Troubleshooting

See also

Related Skills

gitroomhq/Make Product In Hands

gitroomhq/Publish to Social

gitroomhq/Make Wireframe

gitroomhq/Make UGC Video

Pattern A — `make_ugc_video` (recommended default)

Pattern A — `make_ugc_video` (recommended default)