skills/qa-systematic/SKILL.md
Systematic web application QA testing with issue taxonomy, health scoring, and regression tracking. Triggers on: "QA this", "test the app", "smoke test", "run QA", "systematic test", "regression test", "full QA", "/qa-systematic".
npx skillsauth add mathews-tom/armory qa-systematicInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Systematic page-by-page testing, 8-category health score, full issue documentation.
30-second smoke test of critical paths only: login, main nav, primary action.
Diff current state against saved baseline, report new/resolved issues.
Detect available automation in priority order:
agent-browser skill — check if the agent-browser skill is loadedplaywright, puppeteer, or cypress binaries on PATHUse the highest-priority method available. State which method is in use at the start of the report.
references/project-detection.md for framework port conventions (e.g., Next.js → 3000, Vite → 5173, Django → 8000)..qa-reports/.For each page, run the per-page checklist below. In quick mode, run only the items marked with (Q).
<img> tagsFor each issue found, classify using references/issue-taxonomy.md:
Assign a unique ID: QA-001, QA-002, etc.
Compute health score using the weights defined below and detailed in references/report-template.md.
references/report-template.md..qa-reports/<YYYY-MM-DD>-<mode>.json.Weighted average across 8 categories, scored 0-100.
| Category | Weight | | -------------- | ------ | | Console errors | 15% | | Broken links | 10% | | Functional | 20% | | UX/Usability | 15% | | Accessibility | 15% | | Visual | 10% | | Performance | 10% | | Content | 5% |
Scoring per category: start at 100, deduct per issue by severity:
Floor at 0. Final health score = weighted sum of category scores.
Run only items marked (Q) in the Phase 4 checklist. Skip health score computation — report pass/fail per critical path. Target completion: 30 seconds of actual testing time.
.qa-reports/.testing
Create, review, and restyle data visualizations using Edward Tufte principles: high data-ink ratio, direct labels, range-frame axes, small multiples, accessible color, responsive charts, and honest comparisons. Triggers on: "create a chart", "style this chart", "review this graph", "Tufte chart", "data visualization", "Recharts", "Plotly", "matplotlib", "Chart.js", "ECharts", "D3". Use when generating or critiquing charts, dashboards, sparklines, and data tables.
testing
Manages dependent branch stacks and stacked pull requests using safe Git topology rules. Triggers on: "create stacked PRs", "publish this stack", "sync my PR stack", "rebase this stack", "merge the stack", "retarget child PRs", "split this branch into stacked PRs", "validate this stack", "cleanup stacked branches". Use when local branches or one source branch need to become a dependency-ordered PR stack with correct parent bases, validation, synchronization, merge order, and cleanup.
development
Scaffolds per-repository agent context so coding agents share the same issue tracker rules, triage label vocabulary, domain glossary, ADR layout, and handoff conventions. Triggers on: "set up project context", "configure agent docs", "create CONTEXT.md", "setup agent workflow", "agent issue tracker setup", "triage labels", "domain glossary for agents". Use when a repo needs durable context files before planning, triage, debugging, TDD, architecture review, or multi-agent implementation.
testing
Produces phased task boards from feature requests: dependency-mapped work items, parallelization flags, risk flags, edge cases, test matrices. Triggers on: "decompose this feature", "task breakdown with dependencies", "phased implementation plan", "work breakdown structure". NOT for effort estimates, use estimate-calibrator.