plugins/plugin-eval/skills/plugin-eval/SKILL.md
Help engineers evaluate a local skill or plugin, explain why it scored that way, show what to fix first, measure real token usage, benchmark starter scenarios, or decide what to run next. Use when the user says things like "evaluate this skill", "give me an analysis of the game dev skill", "why did this score that way", "what should I fix first", "measure the real token usage of this skill", or "what should I run next?".
npx skillsauth add openai/plugins plugin-evalInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this as the beginner-friendly umbrella entrypoint for local Codex skill and plugin evaluation.
plugin-eval start <path> --request "<user request>" --format markdown
plugin-eval analyze <path> --format markdown, then initialize a benchmark and show the setup questions needed to tailor benchmark.jsonplugin-eval analyze <path> --format markdownplugin-eval analyze <path> --format markdownplugin-eval analyze <path> --format markdownplugin-eval explain-budget <path> --format markdownplugin-eval measurement-planplugin-eval start <path> --request "What should I run next?" --format markdown../improve-skill/SKILL.md.../metric-pack-designer/SKILL.md.~/.codex/skills/<skill-name> firstskills/<skill-name> directory.plugin-eval/benchmark.jsonGive me an analysis of the game dev skill.Evaluate this skill.Evaluate this plugin.Why did this score that way?What should I fix first?Explain the token budget for this skill.Measure the real token usage of this skill.Help me benchmark this plugin.What should I run next?plugin-eval start <path> --request "Evaluate this skill." --format markdown
plugin-eval start <path> --request "Give me a full analysis of this skill, including benchmark setup." --format markdown
plugin-eval analyze <path> --format markdown
plugin-eval explain-budget <path> --format markdown
plugin-eval measurement-plan <path> --format markdown
plugin-eval init-benchmark <path>
plugin-eval benchmark <path> --dry-run
plugin-eval benchmark <path>
At a Glance, Why It Matters, Fix First, and Recommended Next Step.why content terse and easy to skim.plugin-eval start command that routes it, and the first local workflow command behind it.../evaluate-skill/SKILL.md.../evaluate-plugin/SKILL.md.../../references/chat-first-workflows.md../../references/technical-design.md../../references/evaluation-result-schema.mdtools
Top-level workflow skill for USD performance diagnosis and optimization. Use for slow loading, high memory, low FPS, or 'optimize my scene' requests; delegates auth/runtime setup to Phase 0 owners.
data-ai
Use when the user mentions MagicPath, designs, UI components, themes, canvas selections, or repo-to-canvas UI work; run magicpath-ai to search, inspect, install, or author components.
documentation
Use as the top-level router for Omniverse Realtime Viewer USD app requests and focused viewer reference documents.
tools
Turn Notion specs into implementation plans, tasks, and progress tracking; use when implementing PRDs/feature specs and creating Notion plans + tasks from them.