templates/.claude/skills/omcustom-auto-improve/SKILL.md
Apply verified improvement suggestions from eval-core analysis to omcustom configuration
npx skillsauth add baekenough/oh-my-customcode omcustom:auto-improveInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Reads improvement suggestions from eval-core analysis, lets the user select which to apply, applies changes in an isolated worktree with sauron verification, and creates a PR for review.
/omcustom:auto-improve # Interactive selection from pending suggestions
/omcustom:improve-report first if empty)proposed statusbun run packages/eval-core/src/cli/index.ts analyze --format json --save via BashDisplay numbered list:
[Auto-Improve] Available suggestions:
1. [HIGH] agent:lang-golang-expert — Escalate model sonnet→opus (3 failures in 5 uses)
2. [MED] routing:dev-lead-routing — Add Flutter keyword mapping (2 routing misses)
3. [LOW] skill:systematic-debugging — Add timeout guard (1 timeout in 10 uses)
Select items: [1,2,3] / "all" / "cancel"
Self-reference filter: Exclude items where targetName matches:
omcustom-auto-improve, auto-improvepipeline-guards, evaluator-optimizerFor each selected item:
proposed → approved[Approved] {N} items selected for applicationEnterWorktree tool with name auto-improve-{YYYYMMDD}Map each approved item to the appropriate subagent by targetType:
| targetType | Agent | Action | |------------|-------|--------| | agent | mgr-creator | Modify agent frontmatter/body | | skill | Matching domain expert | Revise skill SKILL.md | | routing | general-purpose | Update routing patterns | | model-escalation | general-purpose | Update model field in agent frontmatter |
Spawn agents in parallel (max 4 per R009). Each agent receives:
fix → re-apply with sauron feedback (max 2 cycles)reject → transition all to rejected, ExitWorktree(remove)manual → keep worktree for user inspectionchore(auto-improve): apply {N} improvement suggestionsapplied with appliedAt timestamp and PR URLExitWorktree(action: "keep") — keep branch for PR| Guard | Implementation |
|-------|---------------|
| Self-reference prevention | Blocklist filter in Step 2 |
| User approval gate | Step 2 interactive selection |
| Worktree isolation | Step 4 EnterWorktree |
| Sauron verification | Step 6 mandatory pass |
| PR-based merge | Step 7 — no direct push to develop |
| Max items per run | 20 default, 50 hard cap |
| Max fix cycles | 2 retries before rejection |
| Rollback | git revert via mgr-gitnerd post-merge |
| Scenario | Action | |----------|--------| | No suggestions available | Display message, exit | | User cancels selection | Exit, no state changes | | Sauron verification fails 2x | Reject all, cleanup worktree | | Agent application error | Mark individual item as rejected, continue others | | EnterWorktree fails | Report error, exit |
[Auto-Improve] Starting improvement workflow
├── Suggestions: {N} available ({high}H/{medium}M/{low}L confidence)
├── Self-reference filtered: {count} items excluded
└── Select items to apply: [1,2,3] or "all" or "cancel"
[Auto-Improve] Applying {N} improvements in worktree
├── Worktree: auto-improve-{date}
├── Agents: {count} parallel
└── Pipeline guards: max 20 items, 2 retry cycles
[Auto-Improve] Verification
├── Sauron: {PASS|FAIL}
├── PR: #{number} created
└── Status: {N} items → applied
development
Generate and maintain a persistent codebase wiki — LLM-built interlinked markdown knowledge base (Karpathy LLM Wiki pattern)
development
Use the project wiki as RAG knowledge source — search wiki pages to answer codebase questions before exploring raw files
tools
Analyze task trajectories to propose reusable SKILL.md candidates from successful patterns
data-ai
hada.io RSS feed monitoring for AI agent/harness articles with automated /scout analysis