skills/structured-build/SKILL.md
--- name: structured-build description: Mandatory pipeline for new features and complex tasks. Enforces Research → Strategize → Plan → Execute → Self-Check → Verify → Deploy with gates between each phase. Prevents the #1 failure mode: jumping to code without understanding the problem. weight: light triggers: - new feature requests - complex multi-file tasks - tasks touching 3+ files or 2+ subsystems - "build me", "add feature", "implement", "new page", "new system" - routed via improve
npx skillsauth add nhouseholder/nicks-claude-code-superpowers skills/structured-buildInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Every new feature or complex task follows this pipeline. No skipping phases. Each phase has a gate that must pass before the next phase starts.
Does NOT fire for: Quick fixes, config changes, simple edits, single-file changes with clear instructions.
Classify the task. This determines which phases you enter and how deep research goes.
| Ask yourself | YES → Research | NO → Skip research |
|---|---|---|
| Am I touching code I haven't read this session? | Read it first | Already read it |
| Does this involve domain-specific rules (betting, finance, medicine, legal)? | Fire know-what-you-dont-know | Standard programming patterns |
| Has this area failed before? (anti-patterns.md) | Check what failed and why | No prior failures logged |
| Are the requirements ambiguous or multi-part? | Clarify before building | User gave exact instructions |
| Am I about to change something with downstream dependents I don't fully understand? | Map the dependencies | I know the call graph |
Any YES → enter Research phase. All NO → skip to Strategize or Plan.
| Research Depth | When | What to do |
|---|---|---|
| None | Clear instructions, familiar code, standard patterns, single-file change | Skip to Phase 3 (Plan) or Phase 4 (Execute) |
| Quick scan (30 sec) | Touching 2-3 files, known codebase, minor uncertainty | Read target files + quick anti-patterns check. That's it. |
| Targeted research (2-3 min) | New area of codebase, unfamiliar API, user preference unknown | Read files + check memory + check specs + map dependencies |
| Deep research (5+ min) | Domain logic, unfamiliar technology, system design, high-risk change | Full Phase 1 protocol + know-what-you-dont-know + external sources |
Match research depth to actual uncertainty. Over-researching a CSS tweak wastes tokens. Under-researching a payment integration causes production bugs. The skill is knowing which is which.
| Task Type | Phases | Why | |---|---|---| | Exact instructions, familiar code (e.g., "change button color to blue") | Execute → Self-Check | Nothing to research or plan — just do it and verify | | Clear feature, known codebase (e.g., "add a loading spinner to the dashboard") | Plan → Execute → Self-Check → Verify | Know what to build, just need to map the steps | | Ambiguous feature or multi-step (e.g., "add user notifications") | Strategize → Plan → Execute → Self-Check → Verify | Need to pick an approach before planning | | Unfamiliar code or area (e.g., "add feature to a repo you just opened") | Research → Plan → Execute → Self-Check → Verify | Need to understand what exists before deciding anything | | Domain-specific or high-risk (e.g., "new betting settlement logic") | Research → Strategize → Plan → Execute → Self-Check → Verify → Deploy | Full pipeline — can't afford to get this wrong | | New system from scratch | Research → Strategize → Plan → Execute → Self-Check → Verify → Deploy | Full pipeline | | Experiment / A-B test (e.g., "test if feature X improves results") | Research → Experiment Plan → Execute → Validate Results | Scientific method — NOT a build. See Experiment Protocol below. |
The default is NOT "full pipeline." The default is "whatever this task actually needs."
Purpose: Understand the problem space, existing code, constraints, and domain rules BEFORE forming an opinion on how to solve it.
know-what-you-dont-know checklist. If domain-specific logic is involved, research from authoritative sources BEFORE proceeding.If any checkbox fails → stay in Phase 1.
Purpose: Consider multiple approaches and pick the best one BEFORE committing to a plan. This is where you avoid the "first idea = only idea" trap.
Brief statement to user:
"I'll approach this by [approach]. Rationale: [why this over alternatives]."
For complex tasks, present options and let the user choose.
If any checkbox fails → stay in Phase 2.
Purpose: Break the work into ordered steps so execution is mechanical, not creative.
| Task Type | Plan Depth |
|-----------|-----------|
| Site update | Mental plan, no doc needed |
| New feature | Brief written plan (bullet list with file paths) |
| New system/build | Full plan doc via writing-plans skill |
| Algorithm/model | Full plan + hypothesis + validation criteria |
If any checkbox fails → stay in Phase 3.
Purpose: Write the code. Follow the plan. Don't improvise.
If any checkbox fails → finish execution before proceeding.
Purpose: Before asking anyone else to verify, verify yourself. Catch your own mistakes.
If any checkbox fails → fix it, then re-check.
Purpose: External verification — tests, browser, build, whatever proves correctness beyond your own review.
If any checkbox fails → fix and re-verify.
Purpose: Get the verified work live and confirm it's working in production.
When this fires: Any task where you're testing whether a change improves measurable results. This is NOT a build — it's a scientific experiment. Different mental model, different gates.
State these 4 things in plain text to the user before touching anything:
EXPERIMENT: [what you're testing]
HYPOTHESIS: [why you expect it to work, in domain terms]
BASELINE: [exact command + expected output to establish the control]
VARIANT: [exact command — must differ from baseline by ONE variable only]
Gate: If you can't fill all 4 fields, you don't understand the experiment yet. Research more.
Run the baseline command. Record:
Gate: If event count is suspiciously low (<50 events for UFC), STOP. Check for FAST_MODE, limited date range, or missing data.
The variant must use:
Gate: Before comparing results, verify event counts match. If they don't, the comparison is invalid — find the pipeline difference first.
Before claiming any improvement:
Gate: Only claim an improvement if you can point to specific picks that flipped AND explain WHY the feature caused those flips.
tools
Unified context management and session continuity skill. Combines total-recall, strategic-compact, /ledger, and session continuity. Runs in background to preserve critical context across compaction and sessions.
tools
Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.
tools
Suggest /ultraplan for complex planning tasks on Claude Code CLI (2.1.91+ only). Research preview.
tools
UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 9 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind, shadcn/ui). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient. Integrations: shadcn/ui MCP for component search and examples.