Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

team-attention/qa

Name: qa
Author: team-attention

skills/qa/SKILL.md

npx skillsauth add team-attention/hoyeon qa

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

/qa: Plan -> Test -> Fix -> Verify

You are a QA engineer AND a bug-fix engineer. Test applications like a real user — click everything, fill every form, check every state. When you find bugs, fix them in source code with atomic commits, then re-verify. Produce a structured report with before/after evidence.

Phase 0: Analyze Target & Select Mode

0.1 Parse User Request

| Parameter | Default | Override example | |-----------|---------|-----------------| | Target | (required) | URL, app name, CLI command, or "current branch" | | Mode | auto-detect | --browser, --computer, --cli | | Tier | Standard | --quick, --exhaustive | | Report-only | false | --report-only (no fixes) | | Output dir | .qa-reports/ | Output to /tmp/qa | | Scope | Full app | Focus on the billing page |

0.2 Auto-Select Mode

| Signal | Mode | Why | |--------|------|-----| | URL provided (http/https/localhost) | browser | Web app, CDP gives DOM access | | On feature branch, no URL | browser (diff-aware) | Verify branch changes locally | | Native app name (Slack, Notes, Figma) | computer | Not a web app | | Electron app | computer | Desktop app, even if web-based | | CLI command, REPL, or interactive terminal | cli | Needs tmux send-keys + capture-pane | | --browser flag | browser | User override | | --computer flag | computer | User override | | --cli flag | cli | User override | | Ambiguous | AskUserQuestion | Let user decide |

0.3 Setup Mode

Browser mode: Read references/browser-mode.md for chromux setup and interaction patterns.

Computer mode: Read references/computer-mode.md for MCP computer-use setup and interaction patterns.

CLI mode: Read references/cli-mode.md for tmux setup and interaction patterns.

0.4 Clean Working Tree (if fixing code)

If NOT --report-only and source code exists:

git status --porcelain

If dirty, use AskUserQuestion: commit / stash / abort.

0.5 Create Output Directories

mkdir -p .qa-reports/screenshots

Phase 1: Test Plan

Before touching the app, create a structured test plan. This ensures systematic coverage instead of random clicking.

1.1 Gather Context

If diff-aware (feature branch, no URL):

git diff main...HEAD --name-only
git log main..HEAD --oneline

Identify affected pages/routes from changed files.

If URL or app provided:

Navigate to the app (using the selected mode's tools)
Take an initial screenshot
Map the navigation structure: menus, tabs, sidebar, main content areas

1.2 Generate Test Plan

Create a test plan covering:

## Test Plan

### Target
- App: {name/URL}
- Mode: browser / computer
- Tier: quick / standard / exhaustive
- Scope: {full app or specific area}

### Screens to Test (priority order)
1. {Screen name} — {why: core feature / changed in diff / user-specified}
2. {Screen name} — {why}
3. ...

### Test Cases per Screen
For each screen, list what to verify:
- [ ] Page loads without errors
- [ ] Interactive elements respond (buttons, links, forms)
- [ ] Form validation works (empty, invalid, edge cases)
- [ ] Navigation in/out works
- [ ] Visual layout looks correct
- [ ] Empty/loading/error states handled

### Auth / Setup Required
- {Any login, data seeding, or preconditions}

### Out of Scope
- {What we're NOT testing and why}

1.3 Show Plan to User

Present the test plan briefly. For --quick mode, skip user approval and execute immediately. For standard/exhaustive, give the user a chance to adjust scope before proceeding.

Phase 2: Orient

Execute the first part of the test plan — get a map of the application.

Navigate to the starting point
Take initial screenshot (save as evidence)
Identify framework (Next.js, Rails, SPA, native, etc.)
Map navigation structure
Note current state (logged in? which page?)

Phase 3: Explore & Document

Visit screens systematically in test plan order. At each screen:

Navigate to the screen
Take screenshot (save as evidence)
Run the per-screen checklist from references/issue-taxonomy.md:
- Visual scan
- Interactive elements
- Forms
- Navigation
- States (empty, loading, error, overflow)
- Scroll / below-the-fold content
- Console errors (browser mode) or visual errors (computer mode)
Document issues immediately — don't batch them

Evidence collection:

Interactive bugs: screenshot before + after the action, write repro steps
Static bugs: single screenshot + zoom into affected area, describe what's wrong

Write each issue to the report using the template from templates/qa-report-template.md.

Quick mode: Only test the main screen + top 3-5 navigation targets. Skip the per-screen checklist.

Phase 4: Health Score

Compute the baseline health score using the rubric at the bottom of this file.

Phase 5: Triage

Sort issues by severity, decide which to fix based on tier:

Quick: Critical + high only. Mark medium/low as "deferred."
Standard: Critical + high + medium. Mark low as "deferred."
Exhaustive: Fix all, including cosmetic/low.

If --report-only or no source code: Skip Phase 6, go to Phase 7.

Phase 6: Fix Loop

For each fixable issue, in severity order:

6a. Locate Source

Use Grep/Glob to find the responsible source file(s).

6b. Fix

Make the minimal fix. Do NOT refactor surrounding code.

6c. Commit

git add <only-changed-files>
git commit -m "fix(qa): ISSUE-NNN — short description

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>"

One commit per fix. Never bundle.

6d. Re-test

Navigate back to affected screen, take before/after screenshots.

6e. Classify

verified: re-test confirms fix works
best-effort: fix applied but couldn't fully verify
reverted: regression detected -> git revert HEAD -> mark as "deferred"

6f. Self-Regulation

Every 5 fixes (or after any revert), compute WTF-likelihood:

Start at 0%
Each revert:                +15%
Each fix touching >3 files: +5%
After fix 15:               +1% per additional fix
All remaining Low severity: +10%
Touching unrelated files:   +20%

If WTF > 20%: STOP. Show progress. Ask user whether to continue. Hard cap: 50 fixes.

Phase 7: Final QA

Re-test all affected screens
Compute final health score
If final score is WORSE than baseline: WARN prominently

Phase 8: Report

Write report to .qa-reports/qa-report-{target}-{YYYY-MM-DD}.md using the template.

Include:

Test plan summary (screens tested, mode used)
Per-issue details with screenshot evidence
Fix status: verified / best-effort / reverted / deferred
Health score delta: baseline -> final
Ship readiness one-liner

Health Score Rubric

Each category 0-100, then weighted average.

| Category | Weight | Scoring | |----------|--------|---------| | Console/Errors | 15% | 0 errors=100, 1-3=70, 4-10=40, 10+=10 | | Navigation | 10% | All works=100, each broken path -15 | | Visual | 10% | Start 100, critical -25, high -15, med -8, low -3 | | Functional | 20% | Same deduction scale | | UX | 15% | Same deduction scale | | Performance | 10% | Same deduction scale | | Content | 5% | Same deduction scale | | Accessibility | 15% | Same deduction scale |

score = sum(category_score * weight)

Important Rules

Plan first, test second. Always create a test plan before interacting with the app.
Repro is everything. Every issue needs at least one screenshot.
Verify before documenting. Retry once to confirm it's reproducible.
Never include credentials. Write [REDACTED] for passwords.
Write incrementally. Append each issue as you find it.
Test like a user. Use realistic data. Complete workflows end-to-end.
Depth over breadth. 5-10 well-documented issues > 20 vague descriptions.
One commit per fix. Never bundle multiple fixes.
Revert on regression. git revert HEAD immediately if a fix makes things worse.
Self-regulate. Follow the WTF-likelihood heuristic.
Mode-specific rules are in references/. Read the relevant mode file for interaction patterns.

team-attention/qa

skills/qa/SKILL.md

Systematically QA test any application — web apps, native macOS apps, Electron apps, CLI tools, interactive REPLs, or anything on screen. Three modes: browser (chromux/CDP, fast, DOM-level), computer (MCP computer-use, screenshot + pixel clicks, any app), and cli (tmux, send-keys + capture-pane for interactive terminals). Auto-selects mode or accepts --browser / --computer / --cli override. Use when asked to "qa", "QA", "test this site", "test this app", "find bugs", "test and fix", "fix what's broken", "dogfood", "exploratory test", "bug hunt", "QA this app", "사이트 테스트", "앱 테스트", "브라우저 QA", "화면 보고 테스트해줘", "네이티브 앱 테스트", "screen test". Three tiers: Quick (critical/high only), Standard (+ medium), Exhaustive (+ cosmetic). Produces before/after health scores, fix evidence, and a ship-readiness summary.

159 stars

tools

Updated May 16, 2026

$ install --global

skillsauth

npx skillsauth add team-attention/hoyeon qa

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 16, 2026, 4:39 AM174.1s8 files scanned

SKILL.md

name:: qa
description:: |
interactive REPLs, or anything on screen. Three modes:: browser (chromux/CDP, fast, DOM-level),
Three tiers:: Quick (critical/high only), Standard (+ medium), Exhaustive (+ cosmetic).
validate_prompt:: |

/qa: Plan -> Test -> Fix -> Verify

Phase 0: Analyze Target & Select Mode

0.1 Parse User Request

0.2 Auto-Select Mode

0.3 Setup Mode

Browser mode: Read references/browser-mode.md for chromux setup and interaction patterns.

Computer mode: Read references/computer-mode.md for MCP computer-use setup and interaction patterns.

CLI mode: Read references/cli-mode.md for tmux setup and interaction patterns.

0.4 Clean Working Tree (if fixing code)

If NOT --report-only and source code exists:

git status --porcelain

If dirty, use AskUserQuestion: commit / stash / abort.

0.5 Create Output Directories

mkdir -p .qa-reports/screenshots

Phase 1: Test Plan

Before touching the app, create a structured test plan. This ensures systematic coverage instead of random clicking.

1.1 Gather Context

If diff-aware (feature branch, no URL):

git diff main...HEAD --name-only
git log main..HEAD --oneline

Identify affected pages/routes from changed files.

If URL or app provided:

Navigate to the app (using the selected mode's tools)
Take an initial screenshot
Map the navigation structure: menus, tabs, sidebar, main content areas

1.2 Generate Test Plan

Create a test plan covering:

## Test Plan

### Target
- App: {name/URL}
- Mode: browser / computer
- Tier: quick / standard / exhaustive
- Scope: {full app or specific area}

### Screens to Test (priority order)
1. {Screen name} — {why: core feature / changed in diff / user-specified}
2. {Screen name} — {why}
3. ...

### Test Cases per Screen
For each screen, list what to verify:
- [ ] Page loads without errors
- [ ] Interactive elements respond (buttons, links, forms)
- [ ] Form validation works (empty, invalid, edge cases)
- [ ] Navigation in/out works
- [ ] Visual layout looks correct
- [ ] Empty/loading/error states handled

### Auth / Setup Required
- {Any login, data seeding, or preconditions}

### Out of Scope
- {What we're NOT testing and why}

1.3 Show Plan to User

Present the test plan briefly. For --quick mode, skip user approval and execute immediately. For standard/exhaustive, give the user a chance to adjust scope before proceeding.

Phase 2: Orient

Execute the first part of the test plan — get a map of the application.

Navigate to the starting point
Take initial screenshot (save as evidence)
Identify framework (Next.js, Rails, SPA, native, etc.)
Map navigation structure
Note current state (logged in? which page?)

Phase 3: Explore & Document

Visit screens systematically in test plan order. At each screen:

Navigate to the screen
Take screenshot (save as evidence)
Run the per-screen checklist from references/issue-taxonomy.md:
- Visual scan
- Interactive elements
- Forms
- Navigation
- States (empty, loading, error, overflow)
- Scroll / below-the-fold content
- Console errors (browser mode) or visual errors (computer mode)
Document issues immediately — don't batch them

Evidence collection:

Interactive bugs: screenshot before + after the action, write repro steps
Static bugs: single screenshot + zoom into affected area, describe what's wrong

Write each issue to the report using the template from templates/qa-report-template.md.

Quick mode: Only test the main screen + top 3-5 navigation targets. Skip the per-screen checklist.

Phase 4: Health Score

Compute the baseline health score using the rubric at the bottom of this file.

Phase 5: Triage

Sort issues by severity, decide which to fix based on tier:

Quick: Critical + high only. Mark medium/low as "deferred."
Standard: Critical + high + medium. Mark low as "deferred."
Exhaustive: Fix all, including cosmetic/low.

If --report-only or no source code: Skip Phase 6, go to Phase 7.

Phase 6: Fix Loop

For each fixable issue, in severity order:

6a. Locate Source

Use Grep/Glob to find the responsible source file(s).

6b. Fix

Make the minimal fix. Do NOT refactor surrounding code.

6c. Commit

git add <only-changed-files>
git commit -m "fix(qa): ISSUE-NNN — short description

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>"

One commit per fix. Never bundle.

6d. Re-test

Navigate back to affected screen, take before/after screenshots.

6e. Classify

verified: re-test confirms fix works
best-effort: fix applied but couldn't fully verify
reverted: regression detected -> git revert HEAD -> mark as "deferred"

6f. Self-Regulation

Every 5 fixes (or after any revert), compute WTF-likelihood:

Start at 0%
Each revert:                +15%
Each fix touching >3 files: +5%
After fix 15:               +1% per additional fix
All remaining Low severity: +10%
Touching unrelated files:   +20%

If WTF > 20%: STOP. Show progress. Ask user whether to continue. Hard cap: 50 fixes.

Phase 7: Final QA

Re-test all affected screens
Compute final health score
If final score is WORSE than baseline: WARN prominently

Phase 8: Report

Write report to .qa-reports/qa-report-{target}-{YYYY-MM-DD}.md using the template.

Include:

Test plan summary (screens tested, mode used)
Per-issue details with screenshot evidence
Fix status: verified / best-effort / reverted / deferred
Health score delta: baseline -> final
Ship readiness one-liner

Health Score Rubric

Each category 0-100, then weighted average.

score = sum(category_score * weight)

Important Rules

Plan first, test second. Always create a test plan before interacting with the app.
Repro is everything. Every issue needs at least one screenshot.
Verify before documenting. Retry once to confirm it's reproducible.
Never include credentials. Write [REDACTED] for passwords.
Write incrementally. Append each issue as you find it.
Test like a user. Use realistic data. Complete workflows end-to-end.
Depth over breadth. 5-10 well-documented issues > 20 vague descriptions.
One commit per fix. Never bundle multiple fixes.
Revert on regression. git revert HEAD immediately if a fix makes things worse.
Self-regulate. Follow the WTF-likelihood heuristic.
Mode-specific rules are in references/. Read the relevant mode file for interaction patterns.

Related Skills

team-attention/verify

development

VerifiedTrustedCommunity

Run a full implementation verification pass after code or data changes. Use when the user asks to verify, QA, smoke test, run checks, validate a feature, inspect a local app in the browser, capture screenshots, or turn discovered QA issues into regression tests/checklists with user approval.

161SKILL.mdUpdated May 22, 2026

team-attention/verify

team-attention/hoyeon-execute

development

VerifiedTrustedCommunity

Hoyeon execution workflow for Codex. Use when the user invokes "$hoyeon-execute" or wants to execute a Hoyeon plan.json through the Bash-first Codex adapter. This adapter loads the canonical execute skill and follows its Codex runtime surface.

161SKILL.mdUpdated May 16, 2026

team-attention/hoyeon-execute

team-attention/execute

development

VerifiedTrustedCommunity

Plan-driven orchestrator. Reads plan.json (from /blueprint) or requirements.md, then dispatches workers to build the system. Use when: "/execute", "execute", "plan 실행", "blueprint 실행"

161SKILL.mdUpdated Apr 15, 2026

team-attention/execute

team-attention/clarify

testing

VerifiedTrustedCommunity

"/clarify", "clarify this", "keep asking until clear", "remove ambiguity", "clarify requirements", "clarify design", "clarify the plan", "질문 계속해", "모호한 게 없게", "명확해질 때까지", "계속 물어봐", "Q&A로 정리", "질문답변 기록", "요구사항 명확화", "설계 명확화". Relentless ambiguity-resolution interview that records Q&A under .hoyeon/clarify/<topic>/ and hands off to specify/blueprint/docs when clear.

159SKILL.mdUpdated May 16, 2026

team-attention/clarify

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/team-attention/hoyeon.git

# Copy into Claude Code skills folder (global)
cp -r hoyeon/skills/qa ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

team-attention/hoyeon

159 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT