codex-skills/productionos-auto-optimize/SKILL.md
Self-improving agent optimization — generates challenger variants of any agent/command, benchmarks against baseline, promotes winners, logs learnings to instincts. Inspired by Karpathy's autoresearch pattern.
npx skillsauth add ShaheerKhawaja/ProductionOS productionos-auto-optimizeInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this alias when you want the same workflow through a top-level Codex-safe name without the productionos: namespace.
This is the Codex-native workflow wrapper for .claude/commands/auto-optimize.md.
Use it when the user wants this exact ProductionOS workflow, not just the umbrella productionos router.
target — Agent or command to optimize (e.g., 'code-reviewer', 'security-hardener', '/production-upgrade') Required.challengers — Number of challenger variants to generate (default: 3) Default: 3 Optional.benchmark — Benchmark to evaluate against: 'self-eval' (default) | 'test-suite' | 'llm-judge' | path to custom benchmark Default: self-eval Optional.hypothesis — Specific hypothesis to test (e.g., 'add chain-of-thought to security-hardener'). If omitted, auto-generates hypotheses. Optional.max_cost — Maximum cost in USD for the optimization run (default: 5) Default: 5 Optional.mode — Optimization mode: prompt (modify agent instructions) | model (test different models) | layers (test prompt composition layers) | params (test convergence parameters) Default: prompt Optional.metaclaw-learner, prompt-optimizer, rubric-evolverPREAMBLE.md, PROMPT-COMPOSITION.md.productionos/AUTO-OPTIMIZE-BASELINE.md, .productionos/AUTO-OPTIMIZE-HARVEST.md, .productionos/AUTO-OPTIMIZE-HYPOTHESES.md, .productionos/AUTO-OPTIMIZE-REPORT.md, .productionos/AUTO-OPTIMIZE-RESULTS.md, .productionos/analytics/skill-usage.jsonl, .productionos/calibration/, .productionos/challengers/challenger-{N}.md, .productionos/instincts/, .productionos/instincts/project/tools
Implementation planning workflow that turns approved ideas into dependency-aware execution plans.
development
Local RAG and Graph RAG over the SecondBrain wiki vault. Progressive context loading (hot cache -> index -> domain -> entity). Graph traversal via wikilink resolution. Use when agents need cross-project context, when answering questions that span multiple domains, or when building context for planning tasks. Triggers on: "wiki context", "cross-project context", "what do we know about", "check the wiki", "graph context", "/wiki-rag".
devops
UX improvement pipeline — creates user stories from UI guidelines, maps user journeys, identifies friction, dispatches fix agents. The user-experience equivalent of /production-upgrade.
development
Test-driven development workflow that writes failing tests first, implements minimally, and refactors safely.