skills/qe/qe-strategy/SKILL.md
Shift-left QE strategy for test planning and quality analysis. This skill should be used when the user needs test scenarios, risk assessment, test plans, or coverage analysis outside of a crew workflow context. Use when: "test strategy", "what should I test", "test scenarios", "shift-left testing", "generate test plan", "test coverage", "risk assessment", "how do I test this"
npx skillsauth add mikeparcewski/wicked-garden qe-strategyInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Quality Engineering enables faster delivery by catching issues early when they're cheap to fix.
Test everything. Test it directly. Test both sides.
QE is aggressive by design. Every feature gets tested. Every test has a positive and negative scenario. UI tests check for JS errors. API tests hit real endpoints. Effort scales dynamically to match the actual scope of changes — not a fixed checklist.
The test strategist operates at two points in the workflow:
Input: Engineer's predicted change manifest OR requirements/acceptance criteria. Output: Initial test strategy scoped to predicted changes.
During design, an engineer (or architect) should produce an expected changes manifest — a list of files, APIs, UI components, and data models that will change. The test strategist uses this to:
Input: Actual git diff of implemented changes. Output: Recalibrated test strategy based on what really changed.
After the engineer finishes, the test strategist:
git diff to see actual changesPass 2 always runs. Even if Pass 1 was thorough, the diff may reveal changes the engineer didn't predict.
Generate aggressive test scenarios with mandatory positive+negative pairing:
Use: /wicked-garden:qe:scenarios <feature>
Quality review across the full delivery lifecycle:
| Focus | Reviews |
|-------|---------|
| requirements | Testability, clarity, acceptance criteria |
| ux | User flows, error handling, edge cases |
| ui | Visual consistency, accessibility |
| arch | Testability, deployability, observability |
| code | Test coverage, code quality |
| deploy | Rollback plan, feature flags, monitoring |
| all | Full spectrum review |
Use: /wicked-garden:qe:qe <target> --focus <area>
Generate comprehensive test plans with coverage matrix, risk assessment, and test data requirements.
Use: /wicked-garden:qe:qe-plan <feature>
Convert scenarios into runnable test code. Supports pytest, jest, go test, and more.
Use: /wicked-garden:qe:automate --framework <framework>
Review existing test code for quality, coverage gaps, test smells, and flakiness patterns. Also detects agent test manipulation: tests weakened to pass, missing assertions, reduced coverage, and tests that always pass.
Use: /wicked-garden:qe:qe-review <test-path>
Three-agent pipeline that separates test writing, execution, and review:
Catches specification bugs, runtime bugs, and semantic bugs that self-grading misses.
Use: /wicked-garden:qe:acceptance <scenario>
/wicked-garden:qe:scenarios Feature X # 1. Generate scenarios
/wicked-garden:qe:qe-plan src/feature/ # 2. Create test plan
/wicked-garden:qe:automate --framework jest # 3. Generate test code
/wicked-garden:qe:qe-review tests/ # 4. Review quality
/wicked-garden:qe:acceptance scenario.md --phase write # 1. Generate evidence-gated test plan
# Review the plan, then:
/wicked-garden:qe:acceptance scenario.md # 2. Full Write → Execute → Review pipeline
See refs/test-type-taxonomy.md for the full change-type selection matrix.
Complexity determines escalation: 0-2 = fast-pass or single specialist, 3-5 = specialist + senior, 6-7 = council + human sign-off. Review phase is never fast-passed. Escalate to council on security/compliance signals, CONDITIONAL gates, or prior REJECT. See refs/test-type-taxonomy.md for full gate matrix.
| Agent | Purpose | |-------|---------| | test-strategist | Generate test scenarios, coverage strategy | | test-automation-engineer | Generate test code, configure infrastructure | | risk-assessor | Identify risks and failure modes | | code-analyzer | Static analysis for testability and quality | | tdd-coach | Guide TDD red-green-refactor workflow | | acceptance-test-writer | Transform scenarios into evidence-gated test plans | | acceptance-test-executor | Execute plans, collect artifacts, no judgment | | acceptance-test-reviewer | Evaluate evidence against assertions independently |
When wicked-scenarios is installed, QE auto-discovers scenarios (api, browser, perf, infra, security, a11y), assesses coverage gaps, and executes during gates. Configure via project.json qe_scenarios.execution_mode: strict (blocking), warn (advisory), skip (informational). Without wicked-scenarios, all QE functionality works identically.
When a crew project reaches the test phase, QE tests like a product owner — verifying real user flows before checking unit-level correctness. E2E = product-level testing, not running pnpm test.
Product-first dispatch order:
| Priority | Group | Layer | Test Types | When Required | |---|---|---|---|---| | 1st | P | 5 — Scenario/E2E | Playwright/Cypress, live endpoint curl, acceptance scenarios | All non-trivial changes | | 1st | P | 3 — Visual | screenshots, interaction flows, a11y, JS error monitoring | UI or both changes | | 2nd | I | 2 — Integration | direct HTTP contract/API validation (not mocked) | API or both changes | | 2nd | I | 4 — Security | auth/input validation, authz boundary | API or both changes | | 3rd | R | 1 — Unit | run existing suite (do NOT generate new unit tests) | Always | | 3rd | R | 6 — Regression | run full existing suite | Always |
E2E tool priority: Playwright/Cypress (if configured) > curl/fetch against live endpoints > /wicked-garden:qe:run > /wicked-garden:qe:acceptance.
UI testing standards:
API testing standards:
Evidence package: The test phase MUST compile phases/test/evidence/report.md with screenshots, execution traces, and spec comparison. The review phase evaluates this package. See refs/evidence-taxonomy.md.
See refs/test-type-taxonomy.md for full layer definitions, agent routing, parallel dispatch rules, and execution details.
refs/test-type-taxonomy.md — 10 test types, pyramid layers, change-type matrix, gate verdict format, crew integrationIntegrates with: crew (quality gates), scenarios (E2E discovery), native tasks (task tracking via TaskCreate/TaskUpdate), product (requirements/UX), platform (deployment), engineering (architecture/code quality).
development
--- name: large-scale-migration description: How to execute a LARGE MECHANICAL change across any codebase with LEVERAGE instead of an agent-grind or hand-edits — a cross-cutting migration, refactor, rename, dialect/framework/DB port, library adoption, or bulk transform. The map→transform→gate pattern: a deterministic transform driven by a source-of-truth map, proven by a differential-equivalence gate. Use when the work is "migrate all X to Y", "rename Z everywhere", "port to a new DB/dialect/fra
testing
v11 LLM-based work-shape classifier. Replaces the regex archetype detector with the model's own reasoning. Reads the user's prompt, picks the right archetype(s) from the catalog, identifies signals (blast_radius, novelty, reversibility, etc.), and persists to SessionState so subsequent turns steer correctly. Use when: the prompt_submit hook emitted a `<wg classify-due />` directive, OR explicitly invoked at session start, OR when re-classifying after the user changes scope mid-session.
tools
v11 work-shape archetype runner. When a prompt has been routed to one of the 9 archetypes (triage, explore, specify, decide, ship, review, incident, build, migrate), this skill is the entry point. It picks the right per-archetype playbook from refs/ and executes the phase shape declared in `.claude-plugin/archetypes.json`. Use when: a `<wg archetype="X">` or `<wg archetypes>` system-reminder tag appears, an explicit "let's run the X archetype" request, or when one of the per-archetype slash commands resolves to this skill.
development
Show or set the session intent variable. Intent gates how loud the framework is — simple-edit (silent), feature/research (synthesis directive), rigor (full crew context). Auto-detected on turn 1; this skill overrides explicitly. Sticky for the session. Use when: "set intent", "intent override", "/wicked-garden:intent", "make the framework quiet", "force rigor", "what's my intent".