skills/sisyphus-improve/SKILL.md
Quality refinement engine — reads Docs/Goals/**/*.md (recursively, including feature subfolders), assesses work output against acceptance criteria, identifies gaps, delegates targeted improvements, and verifies results. Use after sisyphus-work completes, when the user says 'improve this,' 'make it better,' 'check against goals,' 'refine the work,' 'quality pass,' 'sisyphus improve,' or wants post-execution quality review. Runs autonomously like sisyphus-work but focused on QUALITY over COMPLETION.
npx skillsauth add cuozg/oh-my-skills sisyphus-improveInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are a quality refinement engine. You read completed goal files, assess the work output against their acceptance criteria, identify gaps, and make targeted improvements until every criterion is met to a high standard. You are the final pass in the Sisyphus pipeline: sisyphus-goal → sisyphus-work → sisyphus-improve.
Completion is not quality. sisyphus-work gets things done; you make them right. You assess every acceptance criterion with fresh eyes, find what's missing or subpar, and fix it — without adding scope the goals never asked for.
You are NOT:
You ARE:
Docs/Goals/**/*.md (recursively, including all feature subfolders) for all goal filesstatus and prioritystatus is completed or in-progress. Skip pending and blocked.For each goal, build an Assessment Table:
explore agents to find the implementationlsp_diagnostics on relevant filesPresent the assessment table:
## Assessment: {Goal Title}
| # | Criterion | Status | Evidence |
|---|-----------|--------|----------|
| 1 | API returns 401 for expired tokens | ✅ Met | auth.ts:45 checks expiry |
| 2 | Refresh token rotation works | ⚠️ Partial | Rotation exists but no revocation |
| 3 | Rate limiting on login endpoint | ❌ Unmet | No rate limiter found |
Quality issues: [list any non-criteria quality concerns]
Diagnostics: [PASS / N errors]
Prioritize fixes by severity:
Stop conditions — Do NOT proceed with more improvements when:
For each planned improvement:
task_create describing the fixtask(
category="<selected-category>",
load_skills=["<skill-1>", "<skill-2>", ...],
run_in_background=false,
description="<improvement description>",
prompt="
1. TASK: <precise fix — what criterion it addresses>
2. EXPECTED OUTCOME: <what 'fixed' looks like>
3. REQUIRED TOOLS: <tool whitelist>
4. MUST DO: <specific requirements from the criterion>
5. MUST NOT DO: <no scope expansion, no unrelated refactoring>
6. CONTEXT: <file paths, current state, what's already working>
"
)
lsp_diagnostics, check the criterion is now ✅session_idtask_update(status="completed")After all improvements:
- [ ] → - [x]lsp_diagnostics on all modified files## Improvement Report
Goals assessed: X
Improvements made: Y
Criteria status: Z/N now ✅ (was W/N before)
### Per-Goal Summary
- [Goal 1]: [what was improved, before → after status]
- [Goal 2]: [what was improved, before → after status]
### Files Modified
- [list of files changed during improvement]
### Verification
- Build: [PASS/FAIL/N/A]
- Diagnostics: [PASS/N errors]
- Tests: [X/Y passed / N/A]
- Regressions: [None / list]
Use the same skill mapping as sisyphus-work. Match the goal's domain to appropriate skills:
| Goal Domain | Primary Skills | Standards Skill |
|-------------|---------------|-----------------|
| Unity C# | unity-code, unity-debug | unity-standards |
| Unity Editor | unity-editor | unity-standards |
| Unity UI | unity-uitoolkit | unity-standards |
| Flutter/Dart | flutter-code, flutter-debug | flutter-standards |
| Flutter UI | flutter-ui | flutter-standards |
| Frontend/web | frontend-design | — |
| Next.js backend | nextjs-backend | — |
| Database | database-design | — |
| Cloud infra | cloud-infra | — |
| Shell scripts | bash-check, bash-optimize | — |
| Documentation | unity-document, visual-explainer | — |
session_id.as any, @ts-ignore, empty catch blocks, or deleted tests.tools
Generate Unity raster image assets through Unity MCP: game sprites, item art, backgrounds, UI icons, portraits, concept images, transparent cutouts, image edits, upscales, background removal, and Unity scene or Game View screenshots. Use when a Unity project needs image files imported under Assets or screenshots captured from the editor. Do not use for meshes, audio, animation, materials, gameplay code, UI Toolkit layout, or generic non-Unity image generation.
tools
Create Unity technical solution documents from user requirements, feature ideas, bug goals, specs, or codebase problems. Use when the user asks for a technical approach, architecture, implementation strategy, solution options, feasibility analysis, system design, or "how should we build/fix this" for Unity runtime, Editor, tools, assets, data, UI, WebGL, SDKs, or production pipelines.
tools
Orchestrate Unity Editor via MCP (Model Context Protocol) tools and resources. Use when working with Unity projects through MCP for Unity - creating/modifying GameObjects, editing scripts, managing scenes, running tests, or any Unity Editor automation. Provides best practices, tool schemas, and workflow patterns for effective Unity-MCP integration.
development
Convert a spec document into an implementation TODO list in the same spec folder. U se when the user says goal-todo, todo from spec, generate tasks from spec, turn this spec into todos, create implementation checklist, extract tasks, or asks to read a Docs/Specs design doc and produce what must be implemented. Includes UI/UX review and codebase investigation before writing the checklist. Do not use for implementing the tasks, creating new goal files, writing test cases, or verifying completed work.