agents/skills/qa/SKILL.md
Verify completed work in the browser using QA notes from a plan file
npx skillsauth add carterdea/dots qaInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Verify completed work in the browser using QA notes from a plan file.
/qa docs/my_feature_PLAN.md --url http://localhost:3000 /qa docs/my_feature_PLAN.md /qa
Default to agent-browser CLI. Before using any browser tool, detect what's available:
command -v agent-browser 2>/dev/null && echo "FOUND:agent-browser" || echo "MISSING:agent-browser"
Priority order:
agent-browser open, agent-browser snapshot -i, agent-browser click, agent-browser fill, agent-browser screenshot, etc. Invoke the /agent-browser skill for full command reference.Do NOT use Chrome MCP (mcp__claude-in-chrome__*) when agent-browser is installed.
Find the plan (file path or conversation history). Scan for unchecked - [ ] QA: items. If none, stop.
Detect the dev server URL (--url, or check common ports and project config). If nothing's running, ask.
List the QA items grouped by phase and confirm before starting.
Launch the browser using the selected tool (see Browser Tool Selection above):
agent-browser:
agent-browser open <dev-server-url>
agent-browser snapshot -i
Playwright CLI (fallback):
playwright-cli open <dev-server-url> --headed
If auth is needed, check for cached state:
agent-browser state load .playwright/.auth/qa-state.jsonplaywright-cli state-load .playwright/.auth/qa-state.jsonIf no cached state exists, ask the user for credentials, log in via CLI commands, then persist state.
If neither CLI tool can complete auth (MFA, CAPTCHA, OAuth), fall back to Chrome MCP and have the user log in manually.
For each - [ ] QA: item:
agent-browser snapshot -i or equivalent)- [x] QA:> FAIL: annotation describing what went wrongClose the browser session when finished.
Summarize: passed, failed, skipped. If all passed, suggest /pre-pr. If any failed, suggest re-running /execute-plan to fix them.
.playwright/.auth/ to .gitignore when creating auth statesnapshot over screenshot for page inspection -- snapshots are token-efficient and provide element references for subsequent commandsdevelopment
Add net-new product, workflow, platform, or developer-experience features as small vertical slices. Use this skill whenever the user asks to build a new feature, add a new page/route/API/workflow/job/eval/operator path, enrich an existing feature with a new user-visible capability, or plan feature architecture before coding. This skill maps the files to change or create, defines the authoritative contract, specifies tests, and gives a QA plan before treating the feature as done.
development
Verify a developer's finished Trello ticket on a non-Shopify web app and render a verdict. Dogfood the posted preview (desktop + mobile) against the card's acceptance criteria, then PASS it (approve the PR, move to Ready for Release) or FAIL it (request changes, attach repro, reassign the dev, move to Development). Read-only: never implements, commits, or opens a PR. Use when asked to 'QA this card', 'test before release', or 'sign off on this ticket'. Shopify themes use shopify-trello-qa; building a ticket uses trello-delivery.
development
Verify a developer's finished Shopify theme ticket and render a verdict. Dogfood the posted preview theme and Customizer (desktop + mobile) against the card's acceptance criteria and Figma, then PASS it (approve the PR, move to Ready for Release) or FAIL it (request changes, attach repro, reassign the dev, move to Development). Read-only: never implements, commits, deploys, or opens a PR. Use when asked to 'QA this Shopify card', 'verify the Ready for Testing card', or 'sign off on this theme ticket'. Non-Shopify apps use trello-qa; building a ticket uses shopify-trello-delivery.
development
Survey any codebase as a senior advisor and produce prioritized, self-contained implementation plans for OTHER models/agents to execute. Strictly read-only on source code — never implements, fixes, or refactors anything itself. Use when asked to audit a codebase, find improvement opportunities (bugs, security, performance, test coverage, tech debt, migrations, DX), suggest features or where to take the project next (roadmap, product direction), or generate handoff plans for another agent to implement.