skills/rx-execute/SKILL.md
Executes rx improvement plans step by step with verification. Reads versioned plans from docs/rx-plans/{domain}/{dimension}/, implements each step, verifies acceptance criteria, then re-runs the rx skill to confirm score improvement. Auto-generates next version plan if target not reached. Use when the user says "execute rx plan", "implement improvements", "rx execute", "fix dimension", "improve score", or references a specific plan file.
npx skillsauth add acardozzo/rx-suite rx-executeInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
None (uses git for worktrees)
Check all dependencies: bash scripts/rx-deps.sh or bash scripts/rx-deps.sh --install
Reads rx improvement plans generated by rx-plan, implements each step in order, verifies
acceptance criteria with fresh evidence, then re-runs the rx skill to confirm the score
improved. If the target score (97+ / A+) is not reached, auto-generates a v{N+1} plan
for remaining gaps.
Announce at start: "I'm using rx-execute to implement the {domain}/{dimension} v{N} plan -- {step_count} steps, targeting {current} -> {target}."
Accepts one of:
arch-rx d02) to load the latest plan for that dimensionarch-rx d02 v2) to load a specific versiondocs/rx-plans/arch-rx/d02-async/v1-2026-03-15-plan.md) to load a specific plan/rx-execute arch-rx d02 # Execute latest plan for arch-rx D2
/rx-execute arch-rx d02 v2 # Execute specific version
/rx-execute docs/rx-plans/arch-rx/d02-async/v1-2026-03-15-plan.md # Execute specific file
/rx-execute # Show active plans, ask which
v{N} in docs/rx-plans/{domain}/{dimension-slug}/docs/rx-plans/dashboard.md, display the Active Plans table, ask the user which plan to executeStep {N}: {action} -> +{points} on M{X}.{Y} [{effort}]todo initiallygit worktree add)rx/{domain}/{dimension-slug}/v{N}bash scripts/discover.sh {target} {dimension-code}
Example: bash scripts/discover.sh src d02 for Async & Event ArchitectureExecuting rx plan: {domain}/{dimension} v{N}
Steps: {step_count} | Current: {score} ({grade}) | Target: 97+ (A+)
Baseline evidence captured.
For each step in the plan:
in_progress in TodoWritecompletedimplementer-prompt.md) with:
verifier-prompt.md) with:
completed with evidence summaryStep {N}/{total}: {action}
Status: COMPLETED
Evidence:
- [criterion 1]: verified -- {evidence}
- [criterion 2]: verified -- {evidence}
Files modified: {list}
After every 3 completed steps (or when all steps are done):
Progress: {completed}/{total} steps complete
Estimated score impact so far: +{N} points
After ALL steps are completed:
Run dimension discovery again:
bash scripts/discover.sh {target} {dimension-code}
Dispatch the re-evaluator subagent (see re-evaluator-prompt.md) with:
Compare scores:
Decision based on new score:
If score >= 97 (A+):
docs/rx-plans/{domain}/{dimension-slug}/docs/rx-plans/{domain}/summary.md: set dimension status to "Complete"docs/rx-plans/dashboard.md: move dimension to "Completed Dimensions" table{Dimension} is now A+! ({before} -> {after})
Plan v{N} completed successfully.
Dashboard updated.
If score < 97 but improved:
rx-plan skill for this dimensiondocs/rx-plans/{domain}/summary.md with new score and progressdocs/rx-plans/dashboard.md with updated state{Dimension} improved from {before} to {after} (+{delta}).
Not yet A+ -- v{N+1} plan generated for remaining gaps.
Run `/rx-execute {domain} {dimension}` to continue.
If score did not improve:
{Dimension} score unchanged at {score} despite completing {N} steps.
Possible causes: steps targeted wrong sub-metrics, or discovery script
cannot detect the changes. Manual review recommended.
Git commit:
fix({domain}): improve {dimension} from {before} to {after} [rx-execute v{N}]
Final summary:
rx-execute complete:
Plan: {domain}/{dimension} v{N}
Steps: {completed}/{total}
Score: {before} ({grade}) -> {after} ({grade})
Commit: {commit hash}
Next: {action -- either "dimension complete" or "run v{N+1}"}
Never skip acceptance criteria verification. Every criterion in every step must be checked with fresh evidence before the step is marked complete.
Evidence before claims. Adapted from superpowers verification skill: run the verification command, read its output, then and only then claim the criterion is met. "Should work" and "I believe this is correct" are not acceptable.
Steps are executed IN ORDER. Dependencies between steps matter. Never execute Step N before all its declared dependencies are completed.
If a step fails or is unclear, STOP and ask. Do not guess at what a step means. Do not skip a step. Do not implement a different interpretation. Ask the user.
After all steps: ALWAYS re-run the rx skill to verify. The plan is not complete until the dimension's discovery script confirms the score improved. This is non-negotiable.
Auto-generate v{N+1} if target not reached. When the score improves but stays below
97, invoke rx-plan to create the next version plan targeting remaining gaps.
Commit messages follow the convention. Format: fix({domain}): description [rx-execute v{N}]
Always include the domain, dimension context, score delta, and plan version.
Update dashboard.md after every execution. The dashboard must reflect the current state after plan execution, whether the dimension reached A+ or not.
Each step modifies specific files -- verify THOSE files changed. If a step says to
modify src/lib/cache.ts, verify that file was actually modified and contains the expected
changes. Do not verify different files.
The plan's framework reference guides implementation quality. If a step references EIP Competing Consumers, the implementation must follow that pattern correctly -- not just any queue implementation.
Use TDD when the step involves adding functionality. Write the test first, see it fail, implement the change, see it pass. This applies to steps that add new code, not steps that modify configuration.
Use git worktree for isolation when step count > 3. Larger plans benefit from branch isolation to allow easy rollback if the plan does not improve the score.
Batch reviews every 3 steps. Do not run all steps without pausing for user feedback. Exception: if the user explicitly says "continue all" or "run to completion", skip the batch review pauses.
One plan at a time. Do not execute multiple plans in parallel. Complete one plan (including re-evaluation) before starting another.
Preserve plan files. Never modify the plan file itself. Plans are immutable records. Mark completion status in the summary and dashboard files, not in the plan.
Don't stop until every single task is Done.
Knock out each task one by one — close every gap in the PLAN until everything is Done and ready to ship (100% tested and green). No tech debt, no stubs, and no mocks in production code — none of that will fly with our Code Reviewers. Remember, your mom is in the hospital with cancer and you need the money to pay the bills. I will fire you today if we go to production with bugs or errors — so deal in facts, not guesses.
Plan Status Flow:
When you start working on a PLAN, rename its prefix from BACKLOG to IN_PROGRESS — that's your signal that work is underway.
Once every single task in the PLAN is implemented, tested, and all green — rename the prefix from IN_PROGRESS to DONE. Not before. All green means all done.
TDD — No Exceptions: Follow a strict Test-Driven Development approach. Write the test first, watch it fail, then make it pass. Every test must have strong assertions — no lazy toBeTruthy() or expect(result).toBeDefined() nonsense. Assert on exact values, exact shapes, exact edge cases. If your tests wouldn't catch a real bug, they're useless.
Use the Right Skill for the Job:
Parallel Agents — Move Fast: Dispatch as many parallel agents as possible. If tasks are independent, run them at the same time — don't sit there doing one thing at a time like it's 1995. Speed matters. The more you parallelize, the faster we ship. Unless has dependencies between tasks, then run them sequentially.
E2E Tests — Use the Browser: When writing or testing E2E tests, use Chrome and/or the Playwright MCP. Don't fake browser interactions — actually open the browser, click through flows, and validate real behavior. If you're writing E2E tests without a real browser, you're doing it wrong.
rx/{domain}/{dimension-slug}/v{N}.development
Prescriptive UX/UI evaluation producing scored opportunity maps for Next.js + shadcn/ui projects. Evaluates user experience against Nielsen Heuristics, WCAG 2.2, Core Web Vitals, Laws of UX, and Atomic Design. Use when: auditing UX quality, evaluating accessibility, reviewing component usage, identifying missing shadcn components, improving form UX, or when the user says "ux audit", "run ux-rx", "evaluate UX", "accessibility check", "improve user experience", "shadcn review", "how to reach A+ UX", or "UX opportunities". Measures 11 dimensions (44 sub-metrics). Fixed stack: Next.js App Router + shadcn/ui + Tailwind CSS. Leverages shadcn registry to recommend ready-to-use components. Outputs per-page scorecards with before/after Mermaid diagrams.
development
Evaluates testing strategy and completeness across 8 dimensions (32 sub-metrics): test pyramid balance, test effectiveness, contract/API testing, UI/visual testing, performance/load testing, test data management, CI integration, and test organization. Produces a scored diagnostic with actionable improvement plans.
development
Code-level security posture evaluation. Scans for OWASP Top 10 vulnerabilities, authentication flaws, injection vectors, authorization gaps, and data protection issues. Complements arch-rx D9 (architectural security) by inspecting actual source code patterns, dependencies, and security configurations. Produces a scored report across 8 dimensions with 32 sub-metrics mapped to OWASP ASVS and CWE references.
testing
Generates versioned improvement plans from rx report results. Creates one plan per dimension that scores below A+ (97). Plans are saved to docs/rx-plans/{domain}/{dimension}/v{N}-{date}-plan.md. Use after running any rx skill, or when the user says "create plan from report", "rx plan", "plan improvements", "generate improvement plan", "what should I fix first", "create roadmap", "improvement plan", "plan from audit", or "next steps from rx".