Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

cuozg/skill-creator

Name: skill-creator
Author: cuozg

skills/skill-creator/SKILL.md

npx skillsauth add cuozg/oh-my-skills skill-creator

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Clean

VirusTotalMulti-engine malware detection

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Skill Creator

Create and iteratively improve skills. Core loop: Draft → Test → Evaluate → Improve → Repeat.

Skill Structure

skill-name/
├── SKILL.md          (required: YAML frontmatter + markdown instructions)
└── Bundled Resources (optional)
    ├── scripts/      Executable code for deterministic/repetitive tasks
    ├── references/   Docs loaded into context as needed
    └── assets/       Templates, icons, fonts

SKILL.md targets: Under 500 lines (ideal). Put detail in references/. YAML requires name + description.

Description field: Primary trigger mechanism. Make it "pushy" — list specific contexts and phrases that should trigger the skill. Agents tend to under-trigger; compensate with explicit use-case coverage.

Workflow

1. Capture Intent

Extract from conversation: tools used, steps, corrections, input/output formats. Ask only what's missing:

What should this skill enable?
When should it trigger? (phrases, contexts)
What's the expected output format?
Are test cases needed? (yes for verifiable outputs; often no for subjective tasks)

2. Interview & Research

Ask about edge cases, input/output formats, success criteria, dependencies. Check available MCPs. Research in parallel via subagents when useful.

3. Write SKILL.md

Write draft, then review with fresh eyes. Prefer imperative form. Explain the why — smart agents follow reasoning over mandates. Avoid ALL-CAPS MUST whenever possible. Keep lean.

4. Create Test Cases

2–3 realistic prompts a real user would type. Save to evals/evals.json:

{"skill_name": "example", "evals": [{"id": 1, "prompt": "...", "expected_output": "..."}]}

5. Run & Evaluate

See agents/ for spawning test runs, grading, benchmarking, and launching the viewer:

Spawn runs — one with-skill + one baseline per test case, all in same turn (with-skill AND baseline, never sequential)
Draft assertions while runs are in progress
Grade via agents/grader.md; aggregate via scripts/aggregate_benchmark
Launch viewer: nohup python eval-viewer/generate_review.py <workspace>/iteration-N --skill-name "name" --benchmark <workspace>/iteration-N/benchmark.json > /dev/null 2>&1 &
Read feedback from feedback.json after user reviews. Kill server after: kill $VIEWER_PID

6. Improve

Generalize from feedback → lean prompt → explain why → bundle repeated work (if all 3 test cases wrote create_chart.py, put it in scripts/). Iterate: improve → rerun in iteration-N+1/ → viewer with --previous-workspace → repeat.

7. Optimize Description (Optional)

After skill is done:

python -m scripts.run_loop --eval-set <path> --skill-path <path> --model <model-id> --max-iterations 5 --verbose

Generates 20 eval queries (10 should-trigger, 10 should-not-trigger). Review via assets/eval_review.html. Apply best_description to SKILL.md frontmatter.

Writing Principles

Generalize — rules preventing a class of problem, not just this example
Keep lean — remove instructions not pulling their weight
Explain why — "Read X before Y — prevents overwriting state" beats "ALWAYS read X"
Bundle repeated work — if multiple test runs independently wrote the same script, bundle it

Environment Notes

Claude Code: Full workflow — subagents, viewer server, description optimization
Claude.ai: Run tests inline, present results in conversation, skip benchmarks and description optimization
Cowork: Use --static <output_path> for viewer (no display), feedback via downloaded feedback.json
Updating existing skill: Preserve original directory name + name frontmatter. Copy to writable location before editing if installed path is read-only.

Reference Files

agents/grader.md — grading assertions against outputs
agents/comparator.md — blind A/B comparison
agents/analyzer.md — analyzing why one version won + benchmark patterns
references/schemas.md — JSON schemas for evals.json, grading.json, benchmark.json

cuozg/skill-creator

skills/skill-creator/SKILL.md

Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.

2 stars

testing

Updated May 13, 2026

$ install --global

skillsauth

npx skillsauth add cuozg/oh-my-skills skill-creator

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

4 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Clean

VirusTotalMulti-engine malware detection

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 13, 2026, 2:47 AM179.5s15 files scanned

SKILL.md

name:: skill-creator
description:: Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.

Skill Creator

Create and iteratively improve skills. Core loop: Draft → Test → Evaluate → Improve → Repeat.

Skill Structure

skill-name/
├── SKILL.md          (required: YAML frontmatter + markdown instructions)
└── Bundled Resources (optional)
    ├── scripts/      Executable code for deterministic/repetitive tasks
    ├── references/   Docs loaded into context as needed
    └── assets/       Templates, icons, fonts

SKILL.md targets: Under 500 lines (ideal). Put detail in references/. YAML requires name + description.

Workflow

1. Capture Intent

Extract from conversation: tools used, steps, corrections, input/output formats. Ask only what's missing:

What should this skill enable?
When should it trigger? (phrases, contexts)
What's the expected output format?
Are test cases needed? (yes for verifiable outputs; often no for subjective tasks)

2. Interview & Research

Ask about edge cases, input/output formats, success criteria, dependencies. Check available MCPs. Research in parallel via subagents when useful.

3. Write SKILL.md

Write draft, then review with fresh eyes. Prefer imperative form. Explain the why — smart agents follow reasoning over mandates. Avoid ALL-CAPS MUST whenever possible. Keep lean.

4. Create Test Cases

2–3 realistic prompts a real user would type. Save to evals/evals.json:

{"skill_name": "example", "evals": [{"id": 1, "prompt": "...", "expected_output": "..."}]}

5. Run & Evaluate

See agents/ for spawning test runs, grading, benchmarking, and launching the viewer:

Spawn runs — one with-skill + one baseline per test case, all in same turn (with-skill AND baseline, never sequential)
Draft assertions while runs are in progress
Grade via agents/grader.md; aggregate via scripts/aggregate_benchmark
Launch viewer: nohup python eval-viewer/generate_review.py <workspace>/iteration-N --skill-name "name" --benchmark <workspace>/iteration-N/benchmark.json > /dev/null 2>&1 &
Read feedback from feedback.json after user reviews. Kill server after: kill $VIEWER_PID

6. Improve

7. Optimize Description (Optional)

After skill is done:

python -m scripts.run_loop --eval-set <path> --skill-path <path> --model <model-id> --max-iterations 5 --verbose

Generates 20 eval queries (10 should-trigger, 10 should-not-trigger). Review via assets/eval_review.html. Apply best_description to SKILL.md frontmatter.

Writing Principles

Generalize — rules preventing a class of problem, not just this example
Keep lean — remove instructions not pulling their weight
Explain why — "Read X before Y — prevents overwriting state" beats "ALWAYS read X"
Bundle repeated work — if multiple test runs independently wrote the same script, bundle it

Environment Notes

Claude Code: Full workflow — subagents, viewer server, description optimization
Claude.ai: Run tests inline, present results in conversation, skip benchmarks and description optimization
Cowork: Use --static <output_path> for viewer (no display), feedback via downloaded feedback.json
Updating existing skill: Preserve original directory name + name frontmatter. Copy to writable location before editing if installed path is read-only.

Reference Files

agents/grader.md — grading assertions against outputs
agents/comparator.md — blind A/B comparison
agents/analyzer.md — analyzing why one version won + benchmark patterns
references/schemas.md — JSON schemas for evals.json, grading.json, benchmark.json

Related Skills

cuozg/unity-image-gen

tools

VerifiedTrustedCommunity

Generate Unity raster image assets through Unity MCP: game sprites, item art, backgrounds, UI icons, portraits, concept images, transparent cutouts, image edits, upscales, background removal, and Unity scene or Game View screenshots. Use when a Unity project needs image files imported under Assets or screenshots captured from the editor. Do not use for meshes, audio, animation, materials, gameplay code, UI Toolkit layout, or generic non-Unity image generation.

4SKILL.mdUpdated May 29, 2026

cuozg/unity-image-gen

cuozg/unity-technical

tools

VerifiedTrustedCommunity

Create Unity technical solution documents from user requirements, feature ideas, bug goals, specs, or codebase problems. Use when the user asks for a technical approach, architecture, implementation strategy, solution options, feasibility analysis, system design, or "how should we build/fix this" for Unity runtime, Editor, tools, assets, data, UI, WebGL, SDKs, or production pipelines.

4SKILL.mdUpdated May 26, 2026

cuozg/unity-technical

cuozg/unity-mcp-orchestrator

tools

VerifiedTrustedCommunity

Orchestrate Unity Editor via MCP (Model Context Protocol) tools and resources. Use when working with Unity projects through MCP for Unity - creating/modifying GameObjects, editing scripts, managing scenes, running tests, or any Unity Editor automation. Provides best practices, tool schemas, and workflow patterns for effective Unity-MCP integration.

4SKILL.mdUpdated May 21, 2026

cuozg/unity-mcp-orchestrator

cuozg/goal-todo

development

VerifiedTrustedCommunity

Convert a spec document into an implementation TODO list in the same spec folder. U se when the user says goal-todo, todo from spec, generate tasks from spec, turn this spec into todos, create implementation checklist, extract tasks, or asks to read a Docs/Specs design doc and produce what must be implemented. Includes UI/UX review and codebase investigation before writing the checklist. Do not use for implementing the tasks, creating new goal files, writing test cases, or verifying completed work.

4SKILL.mdUpdated May 21, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/cuozg/oh-my-skills.git

# Copy into Claude Code skills folder (global)
cp -r oh-my-skills/skills/skill-creator ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

cuozg/oh-my-skills

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT