Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

mathews-tom/qa-systematic

Name: qa-systematic
Author: mathews-tom

skills/qa-systematic/SKILL.md

npx skillsauth add mathews-tom/armory qa-systematic

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Systematic QA Testing

Modes

Full (default)

Systematic page-by-page testing, 8-category health score, full issue documentation.

Quick

30-second smoke test of critical paths only: login, main nav, primary action.

Regression

Diff current state against saved baseline, report new/resolved issues.

Browser Automation Detection

Detect available automation in priority order:

Playwright MCP server — check if Playwright tools are available in the current tool list
agent-browser skill — check if the agent-browser skill is loaded
Direct CLI tools — check for playwright, puppeteer, or cypress binaries on PATH
Manual fallback — instruct the user to navigate and report observations

Use the highest-priority method available. State which method is in use at the start of the report.

Workflow

Phase 1: Initialize

Detect mode from user prompt. Default to full if unspecified.
Detect application URL:
- Check references/project-detection.md for framework port conventions (e.g., Next.js → 3000, Vite → 5173, Django → 8000).
- If not detectable, ask the user.
Detect available browser automation method (priority list above).
If regression mode: load previous baseline from .qa-reports/.

Phase 2: Authenticate (if needed)

Navigate to root URL and check if a login wall is present.
If credentials were provided: authenticate and store session.
If not: ask the user for test credentials, or skip auth-gated pages and note the gap in the report.

Phase 3: Orient

Navigate to root URL.
Map the primary navigation structure — collect all top-level nav links.
Classify each page: static, form, list, detail, dashboard.
Build a test plan ordered by page category (forms and dashboards first — highest defect density).

Phase 4: Explore (Full mode)

For each page, run the per-page checklist below. In quick mode, run only the items marked with (Q).

Visual Scan

(Q) Layout renders correctly — no overlap, no overflow
Images load — no broken <img> tags
Typography consistent — no visible font fallbacks
Responsive: check at desktop (1280px) and mobile (375px) widths

Interactive Elements

(Q) All buttons and links are clickable and responsive
Hover states present where expected
Focus indicators visible for keyboard navigation
Disabled states visually distinct

Forms

(Q) Required field validation fires on empty submit
Error messages display on invalid input
Success feedback on valid submission
Form resubmission handled — no duplicate submissions on double-click

Navigation

(Q) All nav links resolve — no 404s
Back button works as expected
Deep links work — direct URL access returns correct page
Breadcrumbs accurate (if present)

State Management

Loading states displayed during async operations
Empty states handled — no blank pages when data is absent
Error states recoverable — retry or back options present
Data persists across navigation — no lost form data on back/forward

Console

(Q) No JavaScript errors in console
No failed network requests (4xx/5xx)
No mixed content warnings
No deprecation warnings in hot paths

Responsiveness

Mobile layout usable — no horizontal scroll at 375px
Touch targets >= 44px
Text readable without zoom (>= 16px body text)

Phase 5: Document

For each issue found, classify using references/issue-taxonomy.md:

Severity: critical (blocks usage), major (degrades experience), minor (cosmetic/polish)
Category: functional, visual, accessibility, performance, content, navigation, security, console
Evidence: screenshot description or reproduction steps

Assign a unique ID: QA-001, QA-002, etc.

Compute health score using the weights defined below and detailed in references/report-template.md.

Phase 6: Wrap Up

Generate structured report following references/report-template.md.
Save to .qa-reports/<YYYY-MM-DD>-<mode>.json.
If full mode: save baseline for future regression comparison.
Present summary: health score, critical/major/minor counts, top 3 priority fixes.

Health Score

Weighted average across 8 categories, scored 0-100.

| Category | Weight | | -------------- | ------ | | Console errors | 15% | | Broken links | 10% | | Functional | 20% | | UX/Usability | 15% | | Accessibility | 15% | | Visual | 10% | | Performance | 10% | | Content | 5% |

Scoring per category: start at 100, deduct per issue by severity:

Critical: -30
Major: -15
Minor: -5

Floor at 0. Final health score = weighted sum of category scores.

Quick Mode Behavior

Run only items marked (Q) in the Phase 4 checklist. Skip health score computation — report pass/fail per critical path. Target completion: 30 seconds of actual testing time.

Regression Mode Behavior

Load the most recent baseline from .qa-reports/.
Run full mode.
Diff issues by ID and description similarity.
Report: new issues, resolved issues, persistent issues.
Save updated baseline.

mathews-tom/qa-systematic

skills/qa-systematic/SKILL.md

Systematic web application QA testing with issue taxonomy, health scoring, and regression tracking. Triggers on: "QA this", "test the app", "smoke test", "run QA", "systematic test", "regression test", "full QA", "/qa-systematic".

229 stars

development

Updated May 9, 2026

$ install --global

skillsauth

npx skillsauth add mathews-tom/armory qa-systematic

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 9, 2026, 8:06 AM142.9s5 files scanned

SKILL.md

name:: qa-systematic
description:: Systematic web application QA testing with issue taxonomy, health scoring, and regression tracking. Triggers on: "QA this", "test the app", "smoke test", "run QA", "systematic test", "regression test", "full QA", "/qa-systematic".
version:: 1.0.1
category:: development
tags:: [qa, testing, browser, regression]
difficulty:: advanced
phase:: verify

Systematic QA Testing

Modes

Full (default)

Systematic page-by-page testing, 8-category health score, full issue documentation.

Quick

30-second smoke test of critical paths only: login, main nav, primary action.

Regression

Diff current state against saved baseline, report new/resolved issues.

Browser Automation Detection

Detect available automation in priority order:

Playwright MCP server — check if Playwright tools are available in the current tool list
agent-browser skill — check if the agent-browser skill is loaded
Direct CLI tools — check for playwright, puppeteer, or cypress binaries on PATH
Manual fallback — instruct the user to navigate and report observations

Use the highest-priority method available. State which method is in use at the start of the report.

Workflow

Phase 1: Initialize

Detect mode from user prompt. Default to full if unspecified.
Detect application URL:
- Check references/project-detection.md for framework port conventions (e.g., Next.js → 3000, Vite → 5173, Django → 8000).
- If not detectable, ask the user.
Detect available browser automation method (priority list above).
If regression mode: load previous baseline from .qa-reports/.

Phase 2: Authenticate (if needed)

Navigate to root URL and check if a login wall is present.
If credentials were provided: authenticate and store session.
If not: ask the user for test credentials, or skip auth-gated pages and note the gap in the report.

Phase 3: Orient

Navigate to root URL.
Map the primary navigation structure — collect all top-level nav links.
Classify each page: static, form, list, detail, dashboard.
Build a test plan ordered by page category (forms and dashboards first — highest defect density).

Phase 4: Explore (Full mode)

For each page, run the per-page checklist below. In quick mode, run only the items marked with (Q).

Visual Scan

(Q) Layout renders correctly — no overlap, no overflow
Images load — no broken <img> tags
Typography consistent — no visible font fallbacks
Responsive: check at desktop (1280px) and mobile (375px) widths

Interactive Elements

(Q) All buttons and links are clickable and responsive
Hover states present where expected
Focus indicators visible for keyboard navigation
Disabled states visually distinct

Forms

(Q) Required field validation fires on empty submit
Error messages display on invalid input
Success feedback on valid submission
Form resubmission handled — no duplicate submissions on double-click

Navigation

(Q) All nav links resolve — no 404s
Back button works as expected
Deep links work — direct URL access returns correct page
Breadcrumbs accurate (if present)

State Management

Loading states displayed during async operations
Empty states handled — no blank pages when data is absent
Error states recoverable — retry or back options present
Data persists across navigation — no lost form data on back/forward

Console

(Q) No JavaScript errors in console
No failed network requests (4xx/5xx)
No mixed content warnings
No deprecation warnings in hot paths

Responsiveness

Mobile layout usable — no horizontal scroll at 375px
Touch targets >= 44px
Text readable without zoom (>= 16px body text)

Phase 5: Document

For each issue found, classify using references/issue-taxonomy.md:

Severity: critical (blocks usage), major (degrades experience), minor (cosmetic/polish)
Category: functional, visual, accessibility, performance, content, navigation, security, console
Evidence: screenshot description or reproduction steps

Assign a unique ID: QA-001, QA-002, etc.

Compute health score using the weights defined below and detailed in references/report-template.md.

Phase 6: Wrap Up

Generate structured report following references/report-template.md.
Save to .qa-reports/<YYYY-MM-DD>-<mode>.json.
If full mode: save baseline for future regression comparison.
Present summary: health score, critical/major/minor counts, top 3 priority fixes.

Health Score

Weighted average across 8 categories, scored 0-100.

Scoring per category: start at 100, deduct per issue by severity:

Critical: -30
Major: -15
Minor: -5

Floor at 0. Final health score = weighted sum of category scores.

Quick Mode Behavior

Run only items marked (Q) in the Phase 4 checklist. Skip health score computation — report pass/fail per critical path. Target completion: 30 seconds of actual testing time.

Regression Mode Behavior

Load the most recent baseline from .qa-reports/.
Run full mode.
Diff issues by ID and description similarity.
Report: new issues, resolved issues, persistent issues.
Save updated baseline.

Related Skills

mathews-tom/chart-clarity

testing

VerifiedTrustedCommunity

Create, review, and restyle data visualizations using Edward Tufte principles: high data-ink ratio, direct labels, range-frame axes, small multiples, accessible color, responsive charts, and honest comparisons. Triggers on: "create a chart", "style this chart", "review this graph", "Tufte chart", "data visualization", "Recharts", "Plotly", "matplotlib", "Chart.js", "ECharts", "D3". Use when generating or critiquing charts, dashboards, sparklines, and data tables.

242SKILL.mdUpdated Jun 6, 2026

mathews-tom/chart-clarity

mathews-tom/stacked-prs

testing

VerifiedTrustedCommunity

Manages dependent branch stacks and stacked pull requests using safe Git topology rules. Triggers on: "create stacked PRs", "publish this stack", "sync my PR stack", "rebase this stack", "merge the stack", "retarget child PRs", "split this branch into stacked PRs", "validate this stack", "cleanup stacked branches". Use when local branches or one source branch need to become a dependency-ordered PR stack with correct parent bases, validation, synchronization, merge order, and cleanup.

242SKILL.mdUpdated May 23, 2026

mathews-tom/stacked-prs

mathews-tom/project-context-setup

development

VerifiedTrustedCommunity

Scaffolds per-repository agent context so coding agents share the same issue tracker rules, triage label vocabulary, domain glossary, ADR layout, and handoff conventions. Triggers on: "set up project context", "configure agent docs", "create CONTEXT.md", "setup agent workflow", "agent issue tracker setup", "triage labels", "domain glossary for agents". Use when a repo needs durable context files before planning, triage, debugging, TDD, architecture review, or multi-agent implementation.

230SKILL.mdUpdated May 12, 2026

mathews-tom/project-context-setup

mathews-tom/task-decomposer

testing

VerifiedTrustedCommunity

Produces phased task boards from feature requests: dependency-mapped work items, parallelization flags, risk flags, edge cases, test matrices. Triggers on: "decompose this feature", "task breakdown with dependencies", "phased implementation plan", "work breakdown structure". NOT for effort estimates, use estimate-calibrator.

230SKILL.mdUpdated Apr 6, 2026

mathews-tom/task-decomposer

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/mathews-tom/armory.git

# Copy into Claude Code skills folder (global)
cp -r armory/skills/qa-systematic ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

mathews-tom/armory

229 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT