skills/skill-creator/SKILL.md
Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.
npx skillsauth add cuozg/oh-my-skills skill-creatorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
4 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Create and iteratively improve skills. Core loop: Draft → Test → Evaluate → Improve → Repeat.
skill-name/
├── SKILL.md (required: YAML frontmatter + markdown instructions)
└── Bundled Resources (optional)
├── scripts/ Executable code for deterministic/repetitive tasks
├── references/ Docs loaded into context as needed
└── assets/ Templates, icons, fonts
SKILL.md targets: Under 500 lines (ideal). Put detail in references/. YAML requires name + description.
Description field: Primary trigger mechanism. Make it "pushy" — list specific contexts and phrases that should trigger the skill. Agents tend to under-trigger; compensate with explicit use-case coverage.
Extract from conversation: tools used, steps, corrections, input/output formats. Ask only what's missing:
Ask about edge cases, input/output formats, success criteria, dependencies. Check available MCPs. Research in parallel via subagents when useful.
Write draft, then review with fresh eyes. Prefer imperative form. Explain the why — smart agents follow reasoning over mandates. Avoid ALL-CAPS MUST whenever possible. Keep lean.
2–3 realistic prompts a real user would type. Save to evals/evals.json:
{"skill_name": "example", "evals": [{"id": 1, "prompt": "...", "expected_output": "..."}]}
See agents/ for spawning test runs, grading, benchmarking, and launching the viewer:
agents/grader.md; aggregate via scripts/aggregate_benchmarknohup python eval-viewer/generate_review.py <workspace>/iteration-N --skill-name "name" --benchmark <workspace>/iteration-N/benchmark.json > /dev/null 2>&1 &feedback.json after user reviews. Kill server after: kill $VIEWER_PIDGeneralize from feedback → lean prompt → explain why → bundle repeated work (if all 3 test cases wrote create_chart.py, put it in scripts/). Iterate: improve → rerun in iteration-N+1/ → viewer with --previous-workspace → repeat.
After skill is done:
python -m scripts.run_loop --eval-set <path> --skill-path <path> --model <model-id> --max-iterations 5 --verbose
Generates 20 eval queries (10 should-trigger, 10 should-not-trigger). Review via assets/eval_review.html. Apply best_description to SKILL.md frontmatter.
"Read X before Y — prevents overwriting state" beats "ALWAYS read X"--static <output_path> for viewer (no display), feedback via downloaded feedback.jsonname frontmatter. Copy to writable location before editing if installed path is read-only.agents/grader.md — grading assertions against outputsagents/comparator.md — blind A/B comparisonagents/analyzer.md — analyzing why one version won + benchmark patternsreferences/schemas.md — JSON schemas for evals.json, grading.json, benchmark.jsontools
Generate Unity raster image assets through Unity MCP: game sprites, item art, backgrounds, UI icons, portraits, concept images, transparent cutouts, image edits, upscales, background removal, and Unity scene or Game View screenshots. Use when a Unity project needs image files imported under Assets or screenshots captured from the editor. Do not use for meshes, audio, animation, materials, gameplay code, UI Toolkit layout, or generic non-Unity image generation.
tools
Create Unity technical solution documents from user requirements, feature ideas, bug goals, specs, or codebase problems. Use when the user asks for a technical approach, architecture, implementation strategy, solution options, feasibility analysis, system design, or "how should we build/fix this" for Unity runtime, Editor, tools, assets, data, UI, WebGL, SDKs, or production pipelines.
tools
Orchestrate Unity Editor via MCP (Model Context Protocol) tools and resources. Use when working with Unity projects through MCP for Unity - creating/modifying GameObjects, editing scripts, managing scenes, running tests, or any Unity Editor automation. Provides best practices, tool schemas, and workflow patterns for effective Unity-MCP integration.
development
Convert a spec document into an implementation TODO list in the same spec folder. U se when the user says goal-todo, todo from spec, generate tasks from spec, turn this spec into todos, create implementation checklist, extract tasks, or asks to read a Docs/Specs design doc and produce what must be implemented. Includes UI/UX review and codebase investigation before writing the checklist. Do not use for implementing the tasks, creating new goal files, writing test cases, or verifying completed work.