Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

samuraigpt/muapi-ugc-video-factory

Name: muapi-ugc-video-factory
Author: samuraigpt

library/motion/ugc-video-factory/SKILL.md

npx skillsauth add samuraigpt/embedai muapi-ugc-video-factory

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

UGC Video Factory

Turn a person photo + product photo (+ optional script & environment) into a vertical 9:16 UGC-style video ad with native dialogue audio.

A three-stage pipeline:

GPT writes a director-grade ultra-realistic lifestyle photography prompt from your inputs.
Nano-Banana Pro Edit fuses the person + product into a single hero photo (1K, 9:16).
Seedance 2.0 VIP Image-to-Video animates the hero photo into a 10s vertical UGC clip with synced spoken audio.

Inputs

| Name | Type | Required | Default | Description | |:---|:---|:---|:---|:---| | person | image_url | yes | — | Photo of the person who will appear in the ad (face + upper body works best). | | product | image_url | yes | — | Clear photo of the product (preferably on neutral background, logo/text legible). | | script | text | no | Okay… first of all, ship happens. And this hat is honestly my favorite. It also comes in navy and black, so you can pick your vibe. | The exact line the on-screen person will say (kept short — 1–2 sentences fit 10s comfortably). | | environment | text | no | study room, laptop in front of it | Scene / context where the person is using the product (e.g. "bathroom mirror, morning routine", "coffee shop window seat"). |

If person or product is missing, ask the user to upload them (muapi upload file <path>) or offer to generate placeholders before continuing.

Steps

Run the three steps sequentially — each step's output feeds the next.

Step 1 — Director Prompt (GPT)

Use a GPT model (gpt-5.1 or whichever chat model is available to the executing agent) with temperature 0 and max ~200 tokens to produce the hero-image prompt.

System prompt: You are a helpful assistant.

User prompt (substitute {{person}}, {{product}}, {{environment}}):

Uploaded images are being analyzed. Ultra-realistic lifestyle photography with {{person}} and {{product}} and {{environment}}.

If the product is wearable (e.g., hat, glasses, hooded sweatshirt), the person wears the product naturally.

If the product is carried in the hand (e.g., cream, bottle, thermos), the person holds the product naturally.

The product is clearly visible and is the main focus of the image. The logo or text on the product must be legible.

The person has a natural and modern look with a minimalist style.

The scene is consistent with the context of the product's use: {{environment}}.

Lighting: soft natural daylight.
Background: clean, aesthetic, slightly blurred (shallow depth of field).
Style: high-end commercial lifestyle photography, realistic textures, 4K quality, vertical 9:16 composition, social-media advertising style. The background and environment should be appropriate to the product (e.g. a woman with a serum could be at home). The person's facial details and the product must remain unchanged.

Capture the GPT response as {{step1_prompt}}.

Step 2 — Hero Image (Nano-Banana Pro Edit)

Submit a muapi image edit call against the nano-banana-pro-edit model:

Reference images (image_urls): [ {{person}}, {{product}} ] — order matters; person first.
Prompt: {{step1_prompt}} from Step 1.
Aspect ratio: 9:16
Num images: 1
Resolution: 1K
Output format: jpeg

Capture the resulting image URL as {{hero_image}}. Briefly show it to the user for approval before kicking off the video step.

Step 3 — UGC Video (Seedance 2.0 VIP Image-to-Video)

Submit a muapi video from-image call against seedance-2-vip-image-to-video (or the -fast variant if the executing agent wants lower latency).

Start image: {{hero_image}} from Step 2.
Aspect ratio: 9:16
Duration: 10 seconds.
Generate audio: true (native dialogue).
CFG scale: 0.5
Negative prompt: blur, distort, low quality
Prompt (substitute {{script}}):

Create a 10-second vertical UGC-style video (9:16).

A person is interacting naturally with their setting and product.

The product is used naturally:
- If wearable → the person is wearing it.
- If handheld → the person is holding or applying it.

The video is a single, uninterrupted shot. No cuts. No color changes. No text on screen.

The person looks directly at the camera with a relaxed and natural expression.
They interact comfortably with the product using their hands (adjusting, holding, pointing).

They say in a natural, conversational tone:

"{{script}}"

Subtle hand gestures while speaking.
End with a small smile or nod.

Style: authentic UGC, handheld phone feel, light natural movement, soft daylight, shallow depth of field, TikTok/Reels aesthetic.

Poll the result with muapi predict wait <request_id> and download to the user's outputs directory.

Notes

VIP tier supports 9:16 and durations 4–15s; 10s is the sweet spot for a 1–2 sentence script.
Keep the script short — Seedance 2.0 will compress longer scripts and clip words.
Seedance VIP tolerates realistic human faces in references (unlike Chinese tier), making it the right choice for UGC.
If you want lower latency at the same quality, swap to seedance-2-vip-image-to-video-fast.
For multi-shot ads, generate several {{hero_image}} variations in Step 2 and animate each independently — Seedance VIP does not multi-image i2v at 9:16 + audio.

Trigger Keywords

ugc video factory, ugc video ad, person plus product video, talking product ad, ugc reel, lifestyle product video, vertical ugc video

Notes for the Executing Agent

This recipe is LLM-orchestrated: read each phase, gather any missing inputs from the user, then call muapi CLI commands. Run muapi auth configure first if MUAPI_API_KEY is unset.
For local files supplied by the user, upload them first: muapi upload file <path> --output-json --jq '.url'.
Substitute {{input_name}} placeholders with the user's actual inputs before issuing each call.
If the muapi CLI does not yet alias nano-banana-pro-edit or seedance-2-vip-image-to-video, fall back to the raw API: curl -X POST https://api.muapi.ai/api/v1/<endpoint> -H "x-api-key: $MUAPI_API_KEY" -H 'content-type: application/json' -d '{...}', then poll with muapi predict wait <request_id>.

samuraigpt/muapi-ugc-video-factory

library/motion/ugc-video-factory/SKILL.md

Turn a person photo + a product photo + an optional script into a vertical 9:16 UGC-style video ad. Generates a lifestyle hero image (Nano-Banana Pro Edit), then animates it with native audio using Seedance 2.0 VIP image-to-video.

3,298 stars

development

Updated May 20, 2026

$ install --global

skillsauth

npx skillsauth add samuraigpt/embedai muapi-ugc-video-factory

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 20, 2026, 5:58 AM228.5s1 file scanned

SKILL.md

slug:: muapi-ugc-video-factory
name:: muapi-ugc-video-factory
version:: 1.0.0
description:: Turn a person photo + a product photo + an optional script into a vertical 9:16 UGC-style video ad. Generates a lifestyle hero image (Nano-Banana Pro Edit), then animates it with native audio using Seedance 2.0 VIP image-to-video.
acceptLicenseTerms:: true

UGC Video Factory

Turn a person photo + product photo (+ optional script & environment) into a vertical 9:16 UGC-style video ad with native dialogue audio.

A three-stage pipeline:

GPT writes a director-grade ultra-realistic lifestyle photography prompt from your inputs.
Nano-Banana Pro Edit fuses the person + product into a single hero photo (1K, 9:16).
Seedance 2.0 VIP Image-to-Video animates the hero photo into a 10s vertical UGC clip with synced spoken audio.

Inputs

If person or product is missing, ask the user to upload them (muapi upload file <path>) or offer to generate placeholders before continuing.

Steps

Run the three steps sequentially — each step's output feeds the next.

Step 1 — Director Prompt (GPT)

Use a GPT model (gpt-5.1 or whichever chat model is available to the executing agent) with temperature 0 and max ~200 tokens to produce the hero-image prompt.

System prompt: You are a helpful assistant.

User prompt (substitute {{person}}, {{product}}, {{environment}}):

Uploaded images are being analyzed. Ultra-realistic lifestyle photography with {{person}} and {{product}} and {{environment}}.

If the product is wearable (e.g., hat, glasses, hooded sweatshirt), the person wears the product naturally.

If the product is carried in the hand (e.g., cream, bottle, thermos), the person holds the product naturally.

The product is clearly visible and is the main focus of the image. The logo or text on the product must be legible.

The person has a natural and modern look with a minimalist style.

The scene is consistent with the context of the product's use: {{environment}}.

Lighting: soft natural daylight.
Background: clean, aesthetic, slightly blurred (shallow depth of field).
Style: high-end commercial lifestyle photography, realistic textures, 4K quality, vertical 9:16 composition, social-media advertising style. The background and environment should be appropriate to the product (e.g. a woman with a serum could be at home). The person's facial details and the product must remain unchanged.

Capture the GPT response as {{step1_prompt}}.

Step 2 — Hero Image (Nano-Banana Pro Edit)

Submit a muapi image edit call against the nano-banana-pro-edit model:

Reference images (image_urls): [ {{person}}, {{product}} ] — order matters; person first.
Prompt: {{step1_prompt}} from Step 1.
Aspect ratio: 9:16
Num images: 1
Resolution: 1K
Output format: jpeg

Capture the resulting image URL as {{hero_image}}. Briefly show it to the user for approval before kicking off the video step.

Step 3 — UGC Video (Seedance 2.0 VIP Image-to-Video)

Submit a muapi video from-image call against seedance-2-vip-image-to-video (or the -fast variant if the executing agent wants lower latency).

Start image: {{hero_image}} from Step 2.
Aspect ratio: 9:16
Duration: 10 seconds.
Generate audio: true (native dialogue).
CFG scale: 0.5
Negative prompt: blur, distort, low quality
Prompt (substitute {{script}}):

Create a 10-second vertical UGC-style video (9:16).

A person is interacting naturally with their setting and product.

The product is used naturally:
- If wearable → the person is wearing it.
- If handheld → the person is holding or applying it.

The video is a single, uninterrupted shot. No cuts. No color changes. No text on screen.

The person looks directly at the camera with a relaxed and natural expression.
They interact comfortably with the product using their hands (adjusting, holding, pointing).

They say in a natural, conversational tone:

"{{script}}"

Subtle hand gestures while speaking.
End with a small smile or nod.

Style: authentic UGC, handheld phone feel, light natural movement, soft daylight, shallow depth of field, TikTok/Reels aesthetic.

Poll the result with muapi predict wait <request_id> and download to the user's outputs directory.

Notes

VIP tier supports 9:16 and durations 4–15s; 10s is the sweet spot for a 1–2 sentence script.
Keep the script short — Seedance 2.0 will compress longer scripts and clip words.
Seedance VIP tolerates realistic human faces in references (unlike Chinese tier), making it the right choice for UGC.
If you want lower latency at the same quality, swap to seedance-2-vip-image-to-video-fast.
For multi-shot ads, generate several {{hero_image}} variations in Step 2 and animate each independently — Seedance VIP does not multi-image i2v at 9:16 + audio.

Trigger Keywords

ugc video factory, ugc video ad, person plus product video, talking product ad, ugc reel, lifestyle product video, vertical ugc video

Notes for the Executing Agent

This recipe is LLM-orchestrated: read each phase, gather any missing inputs from the user, then call muapi CLI commands. Run muapi auth configure first if MUAPI_API_KEY is unset.
For local files supplied by the user, upload them first: muapi upload file <path> --output-json --jq '.url'.
Substitute {{input_name}} placeholders with the user's actual inputs before issuing each call.
If the muapi CLI does not yet alias nano-banana-pro-edit or seedance-2-vip-image-to-video, fall back to the raw API: curl -X POST https://api.muapi.ai/api/v1/<endpoint> -H "x-api-key: $MUAPI_API_KEY" -H 'content-type: application/json' -d '{...}', then poll with muapi predict wait <request_id>.

Related Skills

samuraigpt/muapi-color-analysis-board

development

VerifiedTrustedCommunity

Turn a portrait photo into a high-end editorial "Color Analysis Board" in a luxury fashion-magazine style (Dior / Ralph Lauren aesthetic) — best colors, undertone, makeup guide, capsule wardrobe, hair & jewelry recommendations, all laid out on a clean beige/ivory grid.

3,298SKILL.mdUpdated May 20, 2026

samuraigpt/muapi-color-analysis-board

samuraigpt/muapi-freeze-effect-video

development

VerifiedTrustedCommunity

Generate a cinematic "freeze effect" video where time stops mid-scene, the subject walks through the frozen world, then time resumes with a snap.

3,298SKILL.mdUpdated May 17, 2026

samuraigpt/muapi-freeze-effect-video

samuraigpt/muapi-youtube-thumbnail

development

VerifiedTrustedCommunity

Design a high-CTR YouTube thumbnail — striking imagery, bold text placement, and emotional face/subject if needed.

3,298SKILL.mdUpdated May 16, 2026

samuraigpt/muapi-youtube-thumbnail

samuraigpt/muapi-url-to-design

development

VerifiedTrustedCommunity

Analyze a website URL and generate a redesigned, improved UI — recreate the visual design with modern aesthetics, better hierarchy, and fresh brand direction.

3,298SKILL.mdUpdated May 16, 2026

samuraigpt/muapi-url-to-design

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/samuraigpt/embedai.git

# Copy into Claude Code skills folder (global)
cp -r embedai/library/motion/ugc-video-factory ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

samuraigpt/embedai

3,298 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT