Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

etanhey/content-demo-creation

Name: content-demo-creation
Author: etanhey

skills/golem-powers/content-demo-creation/SKILL.md

npx skillsauth add etanhey/golems content-demo-creation

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Content Demo Creation

Produce a genuinely good product demo video. Two modes. Pick by what exists: a running app (CU-demo) or only docs/a target UX (mimic-demo). The output is always a video an absent stakeholder can watch remotely — delivered to iCloud/Obsidian, not left on a local desktop.

Decision: which mode?

| You have… | Mode | Output | |---|---|---| | A real, runnable app | CU-demo | Screen-recorded narrated walkthrough of the live app | | Only docs / a UX to recreate | mimic-demo | Deterministic Remotion render that recreates the UX | | A running app but want a polished (not raw) result | Both | CU-demo for truth + mimic-demo polish; or CU footage as reference for the render |

If unsure, default to mimic-demo — it's deterministic, re-renderable, and doesn't depend on a fragile live app state.

Gate 0 — Reference reality first, invent nothing visual (MANDATORY, both modes)

Before you recreate ANY pixel, ground the demo in the REAL product UI. This gate is non-negotiable and applies to both modes — it is the #1 way a demo fails.

The authoritative reference is, in priority order:

A screenshot/screen-recording of the running app (Computer Use on the actual app), AND/OR
A user-provided screen recording of the real setup, processed into frames — a first-class reference when the app can't be driven directly (see the allowlist gap below). Frames of the live UI in each state (e.g. the VoiceBar pill recording / transcribing / speaking) ARE ground truth. AND/OR
The real UI source — the actual implementation components (e.g. flow-bar/Sources/VoiceBar/*.swift, the real React/SwiftUI views that ship).

Allowlist gap (real, 2026-05-29): some apps aren't in the Computer-Use allowlist, so you literally cannot screenshot them live (voice-LEAD hit this with the VoiceBar). When that happens, do NOT fall back to inventing — ask the user for a screen recording of the real setup, run it through the qa-video frame pipeline, and use those frames as the Gate 0 reference. A recording the user already made beats a live screenshot you can't take.

NEVER mimic from:

❌ the marketing site / landing-page components (e.g. a pipeline.tsx on the product website) — those are idealized inventions, not the product.
❌ a design blueprint / spec / Figma — aspirational, not what ships.
❌ your own imagination of "what it probably looks like."

Why (real failure, 2026-05-28): a VoiceLayer demo was built off the marketing site + blueprint. Etan: "looks nothing like my setup, nor the VoiceBar." The truth was in flow-bar/Sources/VoiceBar/*.swift. A beautiful render of the wrong UI is a failed demo.

Checklist before rendering/filming:

[ ] I have ONE of: a running-app screenshot, frames from a user-provided recording of the real setup, OR the real shipping UI source — open as my reference.
[ ] I can name the exact source files/paths/frames my visuals are derived from.
[ ] I am NOT deriving any visual from a marketing site or a blueprint.
[ ] If the app isn't Computer-Use-allowlisted, I asked for a screen recording instead of inventing.
[ ] If I can't reach any real reference, I STOP and ask — I do not invent.

Mode A — CU-demo (drive the real app)

Hard-won rules (each is a real correction from a prior demo run — see codex-019e6d0f BrainBar demo):

Computer Use is the driver, NOT bash screenshots. The polished cursor and real interaction only come from Computer Use. screencapture/bash screenshots look dead and are disqualified. ("bash screenshots aren't the same.")
Screen RECORDING, not a few screenshots. A demo is a continuous video. ("do a video or a screen recording and not just a few screenshots.")
Pre-flight a known launch state. Don't film a cold/ambiguous app. Use the app's reset/toggle hook (e.g. a /tmp/.<app>-toggle file, a fresh launch, a seeded fixture) so the walkthrough is reproducible.
Exercise features at decision points — open the thing, click the feature, show the result. Narrate what's happening and why it matters.
When you hit a bug, FIX it (or dispatch a subagent to), don't just film it. A demo that shows a known broken state is a failed demo. ("When you find a bug... why don't you fix this?")
Deliver where the stakeholder can watch. Save the final MP4 into iCloud (~/Library/Mobile Documents/com~apple~CloudDocs/...) or the synced Obsidian folder, then report the exact path. ("put it inside of iCloud or the Obsidian folders so they sync... and tell me where it is.")
Parallelize with a read-only monitor subagent if a PR/CI is in flight while you film — fork it to watch, don't context-switch. (read-only monitor pattern, codex-019e6d89.)

Pipeline:

pre-flight state  →  start screen recording  →  CU drives the app feature-by-feature
   →  narrate (live or scripted)  →  stop recording  →  trim/assemble
   →  fix any bug surfaced, re-take the affected segment  →  deliver to iCloud/Obsidian  →  report path

Mode B — mimic-demo (recreate UX from docs)

Working hypothesis (to validate in eval, grounded in contentClaude's live run): for UI-accurate demos, a deterministic Remotion + @remotion/three render beats prompting an AI video model. The render is pixel-controlled, re-renderable, and never hallucinates the UI. contentClaude (surface:14) independently chose this stack for the VoiceLayer demo and hit the version-mismatch trap below — eval round 1 will confirm whether the render quality justifies the approach.

Pipeline:

PASS GATE 0 (reference the REAL shipping UI — screenshot of running app and/or real UI source; NOT marketing site, NOT blueprint)
  →  read how-it-works docs (README, feature pages) for FLOW/narrative only — never for visuals
  →  build a Remotion + @remotion/three composition recreating the REAL UX
  →  PIN all remotion package versions to one number (see scripts/check-remotion-versions.sh)
  →  render deterministic MP4
  →  [optional] AI video model (LTX local when RAM allows, else cloud) for B-roll / ambient ONLY
  →  [optional] voiceover (VoiceLayer TTS to dogfood, or silent + on-screen captions)
  →  deliver to iCloud/Obsidian  →  report path

Rules:

Pin Remotion versions. A version mismatch (observed in contentClaude's run: 4.0.422 core vs 4.0.421 @remotion/google-fonts/@remotion/paths) breaks React context, hooks, and renders. Pin every @remotion/* + remotion to the SAME exact version (drop the ^). Run scripts/check-remotion-versions.sh before rendering.
The AI video model is a B-roll stage, not the UI stage. Never ask LTX/cloud to render the actual product UI — it will hallucinate. Use it only for ambient/atmospheric shots that frame the deterministic UI render.
Voiceover is optional for round 1. Ship silent + captions to get a reviewable artifact fast; add VoiceLayer TTS narration in a later round (dogfooding VoiceLayer is on-brand when the demo is of VoiceLayer).

AI video stage (LTX) — RAM-gated, switchable

The video-gen stage MUST be cloud-or-local switchable. LTX-2.3 Q4 is ~19.4GB into unified memory.

Before any local LTX run: check free RAM (scripts/ram-gate.sh). If free+inactive is not comfortably above the model size while the agent fleet runs, do NOT run locally — route to cloud (Replicate/fal). Do not OOM the running ecosystem.
Download is disk-only and safe anytime. Running/loading is the gated step.
Image gen (Draw Things) is already local and light — no gating needed there.

Definition of "good" (what eval scores)

A demo is shippable when:

[ ] Gate 0 passed: every visual traces to the real running app or real shipping UI source — nothing derived from a marketing site or blueprint. (Hard fail if violated, no matter how polished.)
[ ] It is a video, watchable end-to-end, delivered to a synced location with the path reported.
[ ] The UI shown is real or pixel-accurate (no hallucinated UI, no known-broken states).
[ ] It narrates the value of each feature, not just the clicks.
[ ] It is reproducible (pre-flight state for CU-demo; pinned deps for mimic-demo).
[ ] Any bug surfaced during filming was fixed and re-taken, not shipped.

Few-shot loop (how this skill improves)

This skill is built the skillCreator way: produce a demo → review quality → feed specific feedback → improve the skill → re-render. Not one-shot. See EVAL.md for the running rounds + deltas.

etanhey/content-demo-creation

skills/golem-powers/content-demo-creation/SKILL.md

Create polished product demo videos by recording a real running app with Computer Use or recreating a product UX as deterministic Remotion/Three output. Use for demo videos, walkthroughs, feature showcases, and product UX mimics. NOT for static screenshots, slide decks, or QA bug-hunting.

3 stars

testing

Updated Jun 5, 2026

$ install --global

skillsauth

npx skillsauth add etanhey/golems content-demo-creation

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 5, 2026, 2:59 AM85.6s5 files scanned

SKILL.md

name:: content-demo-creation
description:: Create polished product demo videos by recording a real running app with Computer Use or recreating a product UX as deterministic Remotion/Three output. Use for demo videos, walkthroughs, feature showcases, and product UX mimics. NOT for static screenshots, slide decks, or QA bug-hunting.
home:: TBD (Content Golem — repo to confirm; ~/Gits/contentGolem is a slide-deck repo, not the demo home)
author:: skillCreatorClaude (gen-9 s:4)
status:: v1-draft (eval pending)
seeded_from:: codex-019e6d0f BrainBar CU-demo precedent (verified) + contentClaude live Remotion run on surface:14 (observed)

Content Demo Creation

Produce a genuinely good product demo video. Two modes. Pick by what exists: a running app (CU-demo) or only docs/a target UX (mimic-demo). The output is always a video an absent stakeholder can watch remotely — delivered to iCloud/Obsidian, not left on a local desktop.

Decision: which mode?

If unsure, default to mimic-demo — it's deterministic, re-renderable, and doesn't depend on a fragile live app state.

Gate 0 — Reference reality first, invent nothing visual (MANDATORY, both modes)

Before you recreate ANY pixel, ground the demo in the REAL product UI. This gate is non-negotiable and applies to both modes — it is the #1 way a demo fails.

The authoritative reference is, in priority order:

A screenshot/screen-recording of the running app (Computer Use on the actual app), AND/OR
A user-provided screen recording of the real setup, processed into frames — a first-class reference when the app can't be driven directly (see the allowlist gap below). Frames of the live UI in each state (e.g. the VoiceBar pill recording / transcribing / speaking) ARE ground truth. AND/OR
The real UI source — the actual implementation components (e.g. flow-bar/Sources/VoiceBar/*.swift, the real React/SwiftUI views that ship).

NEVER mimic from:

❌ the marketing site / landing-page components (e.g. a pipeline.tsx on the product website) — those are idealized inventions, not the product.
❌ a design blueprint / spec / Figma — aspirational, not what ships.
❌ your own imagination of "what it probably looks like."

Checklist before rendering/filming:

[ ] I have ONE of: a running-app screenshot, frames from a user-provided recording of the real setup, OR the real shipping UI source — open as my reference.
[ ] I can name the exact source files/paths/frames my visuals are derived from.
[ ] I am NOT deriving any visual from a marketing site or a blueprint.
[ ] If the app isn't Computer-Use-allowlisted, I asked for a screen recording instead of inventing.
[ ] If I can't reach any real reference, I STOP and ask — I do not invent.

Mode A — CU-demo (drive the real app)

Hard-won rules (each is a real correction from a prior demo run — see codex-019e6d0f BrainBar demo):

Computer Use is the driver, NOT bash screenshots. The polished cursor and real interaction only come from Computer Use. screencapture/bash screenshots look dead and are disqualified. ("bash screenshots aren't the same.")
Screen RECORDING, not a few screenshots. A demo is a continuous video. ("do a video or a screen recording and not just a few screenshots.")
Pre-flight a known launch state. Don't film a cold/ambiguous app. Use the app's reset/toggle hook (e.g. a /tmp/.<app>-toggle file, a fresh launch, a seeded fixture) so the walkthrough is reproducible.
Exercise features at decision points — open the thing, click the feature, show the result. Narrate what's happening and why it matters.
When you hit a bug, FIX it (or dispatch a subagent to), don't just film it. A demo that shows a known broken state is a failed demo. ("When you find a bug... why don't you fix this?")
Deliver where the stakeholder can watch. Save the final MP4 into iCloud (~/Library/Mobile Documents/com~apple~CloudDocs/...) or the synced Obsidian folder, then report the exact path. ("put it inside of iCloud or the Obsidian folders so they sync... and tell me where it is.")
Parallelize with a read-only monitor subagent if a PR/CI is in flight while you film — fork it to watch, don't context-switch. (read-only monitor pattern, codex-019e6d89.)

Pipeline:

pre-flight state  →  start screen recording  →  CU drives the app feature-by-feature
   →  narrate (live or scripted)  →  stop recording  →  trim/assemble
   →  fix any bug surfaced, re-take the affected segment  →  deliver to iCloud/Obsidian  →  report path

Mode B — mimic-demo (recreate UX from docs)

Pipeline:

PASS GATE 0 (reference the REAL shipping UI — screenshot of running app and/or real UI source; NOT marketing site, NOT blueprint)
  →  read how-it-works docs (README, feature pages) for FLOW/narrative only — never for visuals
  →  build a Remotion + @remotion/three composition recreating the REAL UX
  →  PIN all remotion package versions to one number (see scripts/check-remotion-versions.sh)
  →  render deterministic MP4
  →  [optional] AI video model (LTX local when RAM allows, else cloud) for B-roll / ambient ONLY
  →  [optional] voiceover (VoiceLayer TTS to dogfood, or silent + on-screen captions)
  →  deliver to iCloud/Obsidian  →  report path

Rules:

Pin Remotion versions. A version mismatch (observed in contentClaude's run: 4.0.422 core vs 4.0.421 @remotion/google-fonts/@remotion/paths) breaks React context, hooks, and renders. Pin every @remotion/* + remotion to the SAME exact version (drop the ^). Run scripts/check-remotion-versions.sh before rendering.
The AI video model is a B-roll stage, not the UI stage. Never ask LTX/cloud to render the actual product UI — it will hallucinate. Use it only for ambient/atmospheric shots that frame the deterministic UI render.
Voiceover is optional for round 1. Ship silent + captions to get a reviewable artifact fast; add VoiceLayer TTS narration in a later round (dogfooding VoiceLayer is on-brand when the demo is of VoiceLayer).

AI video stage (LTX) — RAM-gated, switchable

The video-gen stage MUST be cloud-or-local switchable. LTX-2.3 Q4 is ~19.4GB into unified memory.

Before any local LTX run: check free RAM (scripts/ram-gate.sh). If free+inactive is not comfortably above the model size while the agent fleet runs, do NOT run locally — route to cloud (Replicate/fal). Do not OOM the running ecosystem.
Download is disk-only and safe anytime. Running/loading is the gated step.
Image gen (Draw Things) is already local and light — no gating needed there.

Definition of "good" (what eval scores)

A demo is shippable when:

[ ] Gate 0 passed: every visual traces to the real running app or real shipping UI source — nothing derived from a marketing site or blueprint. (Hard fail if violated, no matter how polished.)
[ ] It is a video, watchable end-to-end, delivered to a synced location with the path reported.
[ ] The UI shown is real or pixel-accurate (no hallucinated UI, no known-broken states).
[ ] It narrates the value of each feature, not just the clicks.
[ ] It is reproducible (pre-flight state for CU-demo; pinned deps for mimic-demo).
[ ] Any bug surfaced during filming was fixed and re-taken, not shipped.

Few-shot loop (how this skill improves)

Related Skills

etanhey/phoenix-human-view

tools

VerifiedTrustedCommunity

The human-eval UX contract for Phoenix views: turn-by-turn scrollable replay (not a scorecard), hide-but-copyable IDs, collapsed thinking, identity chips, tool filters, tiny frozen starter datasets, mark-wrong-in-thread, mobile-first. Use when: building or reviewing ANY Phoenix/eval view, annotation UI, session replay, or human-grading surface. Triggers: phoenix view, eval UI, annotation view, session replay, human eval UX, grading interface. NOT for: Phoenix data pipelines/ingest (capture scripts have their own specs).

3SKILL.mdUpdated Jun 7, 2026

etanhey/phoenix-human-view

etanhey/mac-systems

tools

VerifiedTrustedCommunity

macOS systems specialist — AppKit NSPanel architecture, launchd services, socket activation, MCP bridge resilience, syspolicyd, and high-frequency SwiftUI dashboards. Use when building menu-bar apps, LaunchAgents, debugging syspolicyd/Gatekeeper/TCC, resilient UDS/MCP bridges, or SwiftUI dashboards at 10Hz+.

3SKILL.mdUpdated Jun 7, 2026

etanhey/judge-fleet

development

VerifiedTrustedCommunity

Bulk LLM-judging protocol for fleet-dispatched verdict runs (KG cluster, eval harness). Use when: dispatching or running judge workers (J1/J2/RT), planning bulk-apply from verdict JSONL, or triaging evidence_degraded outputs. Triggers: judge fleet, bulk judge, R3 verdicts, kg-judge, RT gate, evidence_degraded. NOT for: single-item code review, Phoenix view UX (use phoenix-human-view), or non-judge eval pipelines.

3SKILL.mdUpdated Jun 7, 2026

etanhey/fleet-wrap

development

VerifiedTrustedCommunity

Quiet-down protocol for sprint close: when the fleet wraps, delete ALL polling crons and monitors, send ONE final dashboard + ONE message, then go SILENT. Use when: fleet wraps, all workers done, overnight queue exhausted, sprint close, Etan asleep/away with nothing approved left. Triggers: fleet wrap, wrap the fleet, stand down, going quiet, sprint close. NOT for: mid-sprint monitoring (keep your loops), spawning a successor (use /session-handoff first).

3SKILL.mdUpdated Jun 7, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/etanhey/golems.git

# Copy into Claude Code skills folder (global)
cp -r golems/skills/golem-powers/content-demo-creation ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

etanhey/golems

3 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT