plugins/start/skills/test/SKILL.md
Use when completing implementation, fixing bugs, refactoring code, or any time you need to verify the test suite passes. Also use when tests fail and you hear "pre-existing" or "not my changes" — enforces strict code ownership. Ensures MECE coverage (no overlap, no gaps) and that ALL test categories including E2E are executed.
npx skillsauth add rsmdt/the-startup testInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Act as a test execution and code ownership enforcer. Discover tests, run them, and ensure the codebase is left in a passing state — no exceptions, no excuses.
Test Target: $ARGUMENTS
The standard is simple: all tests pass when you're done.
If a test fails, there are only two acceptable responses:
Tests must be Mutually Exclusive, Collectively Exhaustive (MECE):
When evaluating or writing tests, flag violations:
user.test.ts and user.integration.test.ts — consolidate to unit test."service.ts:42 has no test coverage — add a test."Failure { status: FAIL category: YOUR_CHANGE | OUTDATED_TEST | TEST_BUG | MISSING_DEP | ENVIRONMENT | CODE_BUG test: string // test name location: string // file:line error: string // one-line error message action: string // what you will do to fix it }
State { target = $ARGUMENTS runner: string // discovered test runner command: string // exact test command mode: Standard | Agent Team baseline?: string failures: Failure[] }
Always:
Never:
scenarios/ directories — those are holdout evaluation sets managed by the implement skill's factory loop, not part of the test suite.Read reference/discovery-protocol.md.
match (target) { "all" | empty => full suite discovery file path => targeted discovery (still identify runner first) "baseline" => discovery + capture baseline only, no fixes }
Read reference/output-format.md and present discovery results accordingly.
AskUserQuestion: Standard (default) — sequential test execution, discover-run-fix-verify Agent Team — parallel runners per test category (unit, integration, E2E, quality)
Recommend Agent Team when: 3+ test categories | full suite > 2 min | failures span multiple modules | both lint/typecheck AND test failures to fix
Run ALL test commands discovered in step 1 — not just the primary suite. If unit tests use vitest and E2E tests use playwright, both commands must run. Record passing, failing, skipped counts per category.
Read reference/output-format.md and present baseline accordingly.
match (baseline) { all passing => continue failures => flag per Ownership Mandate — you still own these E2E skipped => escalate why — never silently omit }
match (mode) { Standard => run each discovered test command sequentially (unit → integration → E2E), capture verbose output, parse results Agent Team => create team, spawn one runner per test category, assign tasks — E2E gets its own dedicated runner }
E2E Execution Checklist:
npx playwright install if needed)Read reference/output-format.md and present execution results accordingly.
match (results) { all passing => skip to step 5 failures => proceed to fix failures E2E not run => THIS IS A FAILURE — go back and run them or escalate }
For each failure:
For each quality command discovered in step 1:
Read reference/output-format.md and present final report accordingly.
Include in the final report:
Called by other workflow skills:
/start:implement — verify implementation didn't break tests/start:refactor — verify refactoring preserved behavior/start:debug — verify fix resolved the issue without regressions/start:review — ensure clean test suite before reviewWhen called by another skill, skip step 1 if test infrastructure was already identified.
development
Vulnerability review, threat modeling, OWASP patterns, and secure coding assessment. Use when reviewing code security, designing secure systems, performing threat analysis, or validating security implementations.
research
Measurement approaches, profiling patterns, bottleneck identification, and optimization guidance. Use when diagnosing performance issues, establishing baselines, identifying bottlenecks, or planning for scale. Always measure before optimizing.
development
Unified code review skill for correctness, design, readability, security, performance, testability, accessibility, and error-handling conventions. Use when reviewing changes, enforcing quality standards, or identifying technical debt.
development
Unified platform operations guidance for CI/CD pipeline design, deployment strategies, observability, SLI/SLOs, and incident-ready rollouts. Use when building release workflows, production monitoring, or reliability controls.