Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

petekp/manual-testing

Name: manual-testing
Author: petekp

skills/manual-testing/SKILL.md

npx skillsauth add petekp/claude-skills manual-testing

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Manual Testing

Finish work with a tight verification loop: prove everything possible with tools first, then ask the user to verify only what requires human eyes, hands, devices, or judgment.

Do not make the user invent the test plan. Lead them through it.

Quality Bar

Before asking the user to do anything, know:

What changed
What the expected behavior is
What can be verified automatically
What still needs a human check
What nearby behavior could have regressed

Never ask the user to "poke around" or "let me know if it works." Give concrete actions, a specific screen or command, the expected result, and a short set of likely outcomes to reply with.

Workflow

1. Build a Verification Matrix

Translate the change into a short verification plan before running anything.

For each changed behavior, capture:

Primary success path
Most likely failure or edge case
One nearby regression check
Verification owner: tool or user

Use a simple internal checklist like:

| Behavior | Happy path | Edge/regression | Verified by | |----------|------------|-----------------|-------------| | Save settings | Form saves | Validation error still works | Tool + user |

Keep the matrix small and focused on the current change.

2. Run Tool-Verifiable Checks First

Exhaust automated verification before involving the user.

Prefer to verify these yourself:

Build, compile, typecheck, lint, test
API responses and status codes
File output, database state, logs, and side effects
CLI behavior, exit codes, and generated artifacts
Browser automation, screenshots, DOM text, or network behavior when tools can prove it

Only hand work to the user when the result depends on:

Visual correctness
Motion, timing, or feel
Real-device behavior
Cross-browser differences
Screen reader behavior
Third-party flows that require human interaction

If an automated check fails, stop and address it before asking for manual verification.

3. Prepare the User Path

Set the user up so they can perform the check with minimal effort.

Provide:

Exact route, URL, screen, or command
Any required setup state
The single action to take
The expected result
What to reply with

If the user needs an already-running app, point them to the exact place to open. If you can safely prepare state, data, or fixtures first, do that yourself.

4. Lead the User Through Atomic Steps

Run manual verification as a guided sequence, not a dump of vague instructions.

Prefer one atomic step at a time. For a tiny smoke test, bundle at most 2-3 closely related checks.

Use this structure:

Testing: [feature or fix]
Progress: Step N of M

Action: [exact thing to click, type, or inspect]
Expected: [what should happen]
Reply with one:
1. [expected outcome]
2. [common failure mode]
3. [second common failure mode]
4. Other

If structured question tools are available, convert those reply options into a structured prompt. Otherwise ask the question in plain text with the options inline.

Example:

Testing: profile photo upload
Progress: Step 2 of 3

Action: Open `/settings/profile`, upload a PNG under 2 MB, and wait for the save state to finish.
Expected: The new avatar appears in the header and no error message is shown.
Reply with one:
1. Upload worked and the new avatar is visible
2. Upload finished but the avatar did not update
3. I saw an error message or spinner got stuck
4. Other

5. Cover the Right Surface Area

Always test the changed path first, then cover the most likely place it could fail.

Use these prompts as a calibration checklist.

For UI changes:

Check initial render
Check loading, empty, error, disabled, and success states when relevant
Check keyboard/focus path for interactive controls
Check mobile or narrow-width layout if the change is layout-sensitive
Check copy, spacing, and obvious visual regressions

For bug fixes:

Reproduce the original bug path
Verify the bug no longer occurs
Verify a nearby path still behaves correctly

For API or backend changes:

Verify happy-path response
Verify invalid-input or failure-path behavior
Verify the persisted side effect or downstream state change
Verify logs or errors do not show new breakage

For CLI or local-tool changes:

Verify the success path
Verify a common failure path and exit code
Verify output files, stdout/stderr, and help text when relevant

6. Handle Failures Like a Debugger

When the user reports a problem:

Capture the exact step that failed
Record expected versus actual behavior
Note any visible error text, logs, or screenshots available
Decide whether to stop and investigate immediately or finish the remaining checks only if that still adds value

Before changing code, generate 2-3 plausible hypotheses for the failure so the next debugging step is deliberate instead of guess-driven.

If the failure blocks confidence in the change, stop the manual test and switch into diagnosis.

7. Summarize With Confidence and Gaps

Close with a short verification summary that separates what is proven from what is still assumed.

Include:

Automated checks run and their results
Manual steps completed and their results
Bugs or regressions found
Remaining unverified areas
Recommended next action

Guidelines

Minimize user effort; maximize agent effort
Keep the user in one context at a time
Prefer concrete reply options over open-ended questions
Check one happy path, one failure path, and one nearby regression when practical
Use tools aggressively before asking for human verification
Keep the running narrative clear so the user remembers what is being tested and what has already passed

petekp/manual-testing

skills/manual-testing/SKILL.md

Guide users through targeted manual verification after code changes. Use when asked to "test this", "verify it works", "QA this", "walk me through testing", "smoke test", "sanity check", "regression test", "acceptance test", or after implementing a feature or bug fix that still needs human validation. Favor this skill for focused verification of the current change; use a broader exploratory-testing skill for open-ended bug hunting across an entire app.

35 stars

development

Updated Apr 29, 2026

$ install --global

skillsauth

npx skillsauth add petekp/claude-skills manual-testing

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 29, 2026, 9:44 AM44.4s1 file scanned

SKILL.md

name:: manual-testing
description:: Guide users through targeted manual verification after code changes. Use when asked to "test this", "verify it works", "QA this", "walk me through testing", "smoke test", "sanity check", "regression test", "acceptance test", or after implementing a feature or bug fix that still needs human validation. Favor this skill for focused verification of the current change; use a broader exploratory-testing skill for open-ended bug hunting across an entire app.

Manual Testing

Finish work with a tight verification loop: prove everything possible with tools first, then ask the user to verify only what requires human eyes, hands, devices, or judgment.

Do not make the user invent the test plan. Lead them through it.

Quality Bar

Before asking the user to do anything, know:

What changed
What the expected behavior is
What can be verified automatically
What still needs a human check
What nearby behavior could have regressed

Never ask the user to "poke around" or "let me know if it works." Give concrete actions, a specific screen or command, the expected result, and a short set of likely outcomes to reply with.

Workflow

1. Build a Verification Matrix

Translate the change into a short verification plan before running anything.

For each changed behavior, capture:

Primary success path
Most likely failure or edge case
One nearby regression check
Verification owner: tool or user

Use a simple internal checklist like:

| Behavior | Happy path | Edge/regression | Verified by | |----------|------------|-----------------|-------------| | Save settings | Form saves | Validation error still works | Tool + user |

Keep the matrix small and focused on the current change.

2. Run Tool-Verifiable Checks First

Exhaust automated verification before involving the user.

Prefer to verify these yourself:

Build, compile, typecheck, lint, test
API responses and status codes
File output, database state, logs, and side effects
CLI behavior, exit codes, and generated artifacts
Browser automation, screenshots, DOM text, or network behavior when tools can prove it

Only hand work to the user when the result depends on:

Visual correctness
Motion, timing, or feel
Real-device behavior
Cross-browser differences
Screen reader behavior
Third-party flows that require human interaction

If an automated check fails, stop and address it before asking for manual verification.

3. Prepare the User Path

Set the user up so they can perform the check with minimal effort.

Provide:

Exact route, URL, screen, or command
Any required setup state
The single action to take
The expected result
What to reply with

If the user needs an already-running app, point them to the exact place to open. If you can safely prepare state, data, or fixtures first, do that yourself.

4. Lead the User Through Atomic Steps

Run manual verification as a guided sequence, not a dump of vague instructions.

Prefer one atomic step at a time. For a tiny smoke test, bundle at most 2-3 closely related checks.

Use this structure:

Testing: [feature or fix]
Progress: Step N of M

Action: [exact thing to click, type, or inspect]
Expected: [what should happen]
Reply with one:
1. [expected outcome]
2. [common failure mode]
3. [second common failure mode]
4. Other

If structured question tools are available, convert those reply options into a structured prompt. Otherwise ask the question in plain text with the options inline.

Example:

Testing: profile photo upload
Progress: Step 2 of 3

Action: Open `/settings/profile`, upload a PNG under 2 MB, and wait for the save state to finish.
Expected: The new avatar appears in the header and no error message is shown.
Reply with one:
1. Upload worked and the new avatar is visible
2. Upload finished but the avatar did not update
3. I saw an error message or spinner got stuck
4. Other

5. Cover the Right Surface Area

Always test the changed path first, then cover the most likely place it could fail.

Use these prompts as a calibration checklist.

For UI changes:

Check initial render
Check loading, empty, error, disabled, and success states when relevant
Check keyboard/focus path for interactive controls
Check mobile or narrow-width layout if the change is layout-sensitive
Check copy, spacing, and obvious visual regressions

For bug fixes:

Reproduce the original bug path
Verify the bug no longer occurs
Verify a nearby path still behaves correctly

For API or backend changes:

Verify happy-path response
Verify invalid-input or failure-path behavior
Verify the persisted side effect or downstream state change
Verify logs or errors do not show new breakage

For CLI or local-tool changes:

Verify the success path
Verify a common failure path and exit code
Verify output files, stdout/stderr, and help text when relevant

6. Handle Failures Like a Debugger

When the user reports a problem:

Capture the exact step that failed
Record expected versus actual behavior
Note any visible error text, logs, or screenshots available
Decide whether to stop and investigate immediately or finish the remaining checks only if that still adds value

Before changing code, generate 2-3 plausible hypotheses for the failure so the next debugging step is deliberate instead of guess-driven.

If the failure blocks confidence in the change, stop the manual test and switch into diagnosis.

7. Summarize With Confidence and Gaps

Close with a short verification summary that separates what is proven from what is still assumed.

Include:

Automated checks run and their results
Manual steps completed and their results
Bugs or regressions found
Remaining unverified areas
Recommended next action

Guidelines

Minimize user effort; maximize agent effort
Keep the user in one context at a time
Prefer concrete reply options over open-ended questions
Check one happy path, one failure path, and one nearby regression when practical
Use tools aggressively before asking for human verification
Keep the running narrative clear so the user remembers what is being tested and what has already passed

Related Skills

petekp/pr-self-review

development

VerifiedTrustedCommunity

Draft short, plainspoken notes in the author's voice that help reviewers understand non-obvious choices, boundaries, and preserved behavior in the author's own pull request or local diff. Use when the user asks to self-review, annotate, or add reviewer context to their PR or changes. Draft locally when no PR exists, and post approved notes as one GitHub review when a PR does exist. Do not use for reviewing someone else's PR, writing code comments, explaining code generally, or drafting a PR description. Never post without explicit approval.

40SKILL.mdUpdated Jul 21, 2026

petekp/pr-self-review

petekp/tailwind-plugin-craft

tools

VerifiedTrustedCommunity

Design and build pure-CSS (zero-JavaScript) Tailwind CSS v4 plugins of unusual depth and craft. Use when the user wants to create, architect, or refine a Tailwind utility plugin or CSS effect — e.g. "make a tailwind plugin", "build a tw-* plugin", "a CSS-only shimmer/fade/glow/grain/noise utility", "tailwind v4 @utility", "package this effect as a plugin", or wants an effect with surprising visual depth (gradients, masks, filters, SVG filter tricks, scroll-driven animation). Pairs deep CSS/SVG technique research with a bespoke tuning workbench for dialing the effect in. Inspired by tw-fade and tw-shimmer.

40SKILL.mdUpdated Jul 15, 2026

petekp/tailwind-plugin-craft

petekp/pr-screenshot-comparison

content-media

VerifiedTrustedCommunity

Create clear, polished before-and-after screenshots for a GitHub pull request. Use when a UI change needs visual proof: capture matching states, crop to the relevant UI, stitch and caption one comparison image, attach it natively to the PR, and keep the image out of the repository.

40SKILL.mdUpdated Jul 15, 2026

petekp/pr-screenshot-comparison

petekp/skills/latent-potential

testing

VerifiedTrustedCommunity

--- name: latent-potential description: First-principles, team-of-experts assessment of a software project that surfaces latent potential; underexploited assets, a sharper north star, missing high-leverage capabilities, better framing and messaging. Produces a prioritized, evidence-grounded report with cheap probes, a reframe candidate, a stop-doing list, and an honest skeptic's case. Use whenever the user wants fresh eyes on a project they have built: "what am I sitting on", "what could this be

40SKILL.mdUpdated Jul 15, 2026

petekp/skills/latent-potential

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/petekp/claude-skills.git

# Copy into Claude Code skills folder (global)
cp -r claude-skills/skills/manual-testing ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

petekp/claude-skills

35 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT