Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

petekp/tdd

Name: tdd
Author: petekp

skills/tdd/SKILL.md

npx skillsauth add petekp/claude-code-setup tdd

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Test-Driven Development

Treat executable evidence as the source of truth. Use this workflow to prevent five common agent failures:

guessing instead of proving
writing many tests before learning anything
testing shapes or internals instead of behavior
over-mocking code under your control
letting the feedback loop get so slow that TDD stops working

Start the Session

Before editing code:

Read repo-local instructions such as AGENTS.md, CLAUDE.md, and package or test scripts.
Inspect nearby production code and nearby tests to infer naming, seams, fixtures, and the fastest targeted command.
Identify two commands up front:
- the fastest command that runs the single target test
- the broader command that validates the changed boundary before finishing
Choose the smallest public or near-public seam that can prove the behavior. See seams.md.
Ask the user only when the public contract, intended behavior, or required coverage scope is materially ambiguous.

Follow repo-local instructions if they are stricter than this skill.

Work Vertically

Do not batch all tests first and all implementation later.

Wrong:
  RED:   test1, test2, test3
  GREEN: impl1, impl2, impl3

Right:
  RED -> GREEN -> REFACTOR: test1 -> impl1
  RED -> GREEN -> REFACTOR: test2 -> impl2
  RED -> GREEN -> REFACTOR: test3 -> impl3

Write one failing test. Make it pass with the smallest sensible change. Refactor only on green. Repeat.

Choose the Entry Path

New feature

Start with the smallest user-visible or caller-visible behavior worth shipping.
Write one tracer-bullet test at the chosen seam.
Make it fail for the right reason.
Implement the thinnest slice that turns the test green.
Add the next behavior only after the current one is proven.

Bug fix or regression

List 2-3 plausible hypotheses before changing code.
Reproduce the bug with the smallest failing regression test that would have caught it.
Narrow the reproduction with logs, assertions, or a lower seam if the first repro is noisy.
Fix the code under that failing test.
Add one neighboring test only if it proves the fix is specific rather than accidental.

See bugfixes.md for the detailed mini-loop.

Legacy code or refactor

Freeze current behavior with a characterization test before reshaping internals.
Refactor behind that safety rail until the design exposes a better seam.
Replace overly broad characterization coverage with tighter behavioral tests when the seam improves.
Use the green state to deepen modules and simplify interfaces.

See seams.md, deep-modules.md, and refactoring.md.

Run the Core Loop

For each cycle:

RED: Write or tighten one test that proves one behavior. Confirm it fails.
GREEN: Write the minimum production change that makes only that behavior pass.
REFACTOR: Clean up duplication, naming, and structure while staying green.
Re-run the smallest relevant command first, then widen verification as confidence grows.

If the test cannot fail, the loop is invalid. Break it on purpose, lower the seam, or add the missing observability before trusting it.

Keep Feedback Fast

Use this ladder:

Run the single target test while iterating.
Run the surrounding file, package, or focused suite after a small cluster of green cycles.
Run broader verification before finishing: the relevant integration suite, typecheck, lint, or full tests for the touched boundary.
Call out any verification gap explicitly if time, tooling, or environment prevents broader checks.

If the loop feels slow, the seam is probably too high. Move down a level unless the behavior truly lives in the browser or across system boundaries.

Choose Assertions That Survive Refactors

Assert observable outcomes, not helper calls.
Prefer public interfaces over internal collaborators.
Mock only system boundaries you do not control. See mocking.md.
Treat tests as specifications for behavior, not snapshots of implementation shape.
Keep each test about one behavior, even if that behavior needs more than one assertion.

See tests.md for examples and rewrites.

Use Subagents Carefully

When other agents help:

Keep ownership of the failing test, the red/green loop, and final verification in the main thread.
Let workers explore implementation ideas or refactors under the existing failing test.
Reject fixes that only pass by weakening the test unless the test was proving the wrong behavior.

Avoid These Anti-Patterns

Editing production code for the target behavior before seeing a meaningful failing test
Writing the whole test plan up front
Solving multiple behaviors in one red/green cycle
Using browser or end-to-end tests for logic that could be proven faster elsewhere
Mocking modules you own just to make the test convenient
Leaving debug scaffolding, speculative branches, or unverified refactors behind

Finish with Proof

Consider the task done only when:

the changed behavior is covered by a test that failed before the change
the targeted tests pass
the relevant broader verification command has run, or the gap is documented clearly
refactors happened only while green
the final summary states what behavior is now proven and what still relies on manual verification

petekp/tdd

skills/tdd/SKILL.md

Test-driven development for features, bug fixes, regressions, and safe refactors using a failing-test-first workflow. Use when Codex needs to add or change behavior with proof, reproduce a bug in a test, write regression or characterization tests, make a refactor safer, or respond to prompts like "use TDD", "red-green-refactor", "write the test first", "add a regression test", "reproduce this in a test", "prove the fix", "cover this change with tests", or "make this safe to refactor". Prefer this skill when confidence should come from executable evidence instead of reasoning alone.

35 stars

development

Updated Apr 29, 2026

$ install --global

skillsauth

npx skillsauth add petekp/claude-code-setup tdd

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 29, 2026, 9:45 AM34.0s9 files scanned

SKILL.md

name:: tdd
description:: >

Test-Driven Development

Treat executable evidence as the source of truth. Use this workflow to prevent five common agent failures:

guessing instead of proving
writing many tests before learning anything
testing shapes or internals instead of behavior
over-mocking code under your control
letting the feedback loop get so slow that TDD stops working

Start the Session

Before editing code:

Read repo-local instructions such as AGENTS.md, CLAUDE.md, and package or test scripts.
Inspect nearby production code and nearby tests to infer naming, seams, fixtures, and the fastest targeted command.
Identify two commands up front:
- the fastest command that runs the single target test
- the broader command that validates the changed boundary before finishing
Choose the smallest public or near-public seam that can prove the behavior. See seams.md.
Ask the user only when the public contract, intended behavior, or required coverage scope is materially ambiguous.

Follow repo-local instructions if they are stricter than this skill.

Work Vertically

Do not batch all tests first and all implementation later.

Wrong:
  RED:   test1, test2, test3
  GREEN: impl1, impl2, impl3

Right:
  RED -> GREEN -> REFACTOR: test1 -> impl1
  RED -> GREEN -> REFACTOR: test2 -> impl2
  RED -> GREEN -> REFACTOR: test3 -> impl3

Write one failing test. Make it pass with the smallest sensible change. Refactor only on green. Repeat.

Choose the Entry Path

New feature

Start with the smallest user-visible or caller-visible behavior worth shipping.
Write one tracer-bullet test at the chosen seam.
Make it fail for the right reason.
Implement the thinnest slice that turns the test green.
Add the next behavior only after the current one is proven.

Bug fix or regression

List 2-3 plausible hypotheses before changing code.
Reproduce the bug with the smallest failing regression test that would have caught it.
Narrow the reproduction with logs, assertions, or a lower seam if the first repro is noisy.
Fix the code under that failing test.
Add one neighboring test only if it proves the fix is specific rather than accidental.

See bugfixes.md for the detailed mini-loop.

Legacy code or refactor

Freeze current behavior with a characterization test before reshaping internals.
Refactor behind that safety rail until the design exposes a better seam.
Replace overly broad characterization coverage with tighter behavioral tests when the seam improves.
Use the green state to deepen modules and simplify interfaces.

See seams.md, deep-modules.md, and refactoring.md.

Run the Core Loop

For each cycle:

RED: Write or tighten one test that proves one behavior. Confirm it fails.
GREEN: Write the minimum production change that makes only that behavior pass.
REFACTOR: Clean up duplication, naming, and structure while staying green.
Re-run the smallest relevant command first, then widen verification as confidence grows.

If the test cannot fail, the loop is invalid. Break it on purpose, lower the seam, or add the missing observability before trusting it.

Keep Feedback Fast

Use this ladder:

Run the single target test while iterating.
Run the surrounding file, package, or focused suite after a small cluster of green cycles.
Run broader verification before finishing: the relevant integration suite, typecheck, lint, or full tests for the touched boundary.
Call out any verification gap explicitly if time, tooling, or environment prevents broader checks.

If the loop feels slow, the seam is probably too high. Move down a level unless the behavior truly lives in the browser or across system boundaries.

Choose Assertions That Survive Refactors

Assert observable outcomes, not helper calls.
Prefer public interfaces over internal collaborators.
Mock only system boundaries you do not control. See mocking.md.
Treat tests as specifications for behavior, not snapshots of implementation shape.
Keep each test about one behavior, even if that behavior needs more than one assertion.

See tests.md for examples and rewrites.

Use Subagents Carefully

When other agents help:

Keep ownership of the failing test, the red/green loop, and final verification in the main thread.
Let workers explore implementation ideas or refactors under the existing failing test.
Reject fixes that only pass by weakening the test unless the test was proving the wrong behavior.

Avoid These Anti-Patterns

Editing production code for the target behavior before seeing a meaningful failing test
Writing the whole test plan up front
Solving multiple behaviors in one red/green cycle
Using browser or end-to-end tests for logic that could be proven faster elsewhere
Mocking modules you own just to make the test convenient
Leaving debug scaffolding, speculative branches, or unverified refactors behind

Finish with Proof

Consider the task done only when:

the changed behavior is covered by a test that failed before the change
the targeted tests pass
the relevant broader verification command has run, or the gap is documented clearly
refactors happened only while green
the final summary states what behavior is now proven and what still relies on manual verification

Related Skills

petekp/pr-self-review

development

VerifiedTrustedCommunity

Draft short, plainspoken notes in the author's voice that help reviewers understand non-obvious choices, boundaries, and preserved behavior in the author's own pull request or local diff. Use when the user asks to self-review, annotate, or add reviewer context to their PR or changes. Draft locally when no PR exists, and post approved notes as one GitHub review when a PR does exist. Do not use for reviewing someone else's PR, writing code comments, explaining code generally, or drafting a PR description. Never post without explicit approval.

40SKILL.mdUpdated Jul 21, 2026

petekp/pr-self-review

petekp/tailwind-plugin-craft

tools

VerifiedTrustedCommunity

Design and build pure-CSS (zero-JavaScript) Tailwind CSS v4 plugins of unusual depth and craft. Use when the user wants to create, architect, or refine a Tailwind utility plugin or CSS effect — e.g. "make a tailwind plugin", "build a tw-* plugin", "a CSS-only shimmer/fade/glow/grain/noise utility", "tailwind v4 @utility", "package this effect as a plugin", or wants an effect with surprising visual depth (gradients, masks, filters, SVG filter tricks, scroll-driven animation). Pairs deep CSS/SVG technique research with a bespoke tuning workbench for dialing the effect in. Inspired by tw-fade and tw-shimmer.

40SKILL.mdUpdated Jul 15, 2026

petekp/tailwind-plugin-craft

petekp/pr-screenshot-comparison

content-media

VerifiedTrustedCommunity

Create clear, polished before-and-after screenshots for a GitHub pull request. Use when a UI change needs visual proof: capture matching states, crop to the relevant UI, stitch and caption one comparison image, attach it natively to the PR, and keep the image out of the repository.

40SKILL.mdUpdated Jul 15, 2026

petekp/pr-screenshot-comparison

petekp/skills/latent-potential

testing

VerifiedTrustedCommunity

--- name: latent-potential description: First-principles, team-of-experts assessment of a software project that surfaces latent potential; underexploited assets, a sharper north star, missing high-leverage capabilities, better framing and messaging. Produces a prioritized, evidence-grounded report with cheap probes, a reframe candidate, a stop-doing list, and an honest skeptic's case. Use whenever the user wants fresh eyes on a project they have built: "what am I sitting on", "what could this be

40SKILL.mdUpdated Jul 15, 2026

petekp/skills/latent-potential

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/petekp/claude-code-setup.git

# Copy into Claude Code skills folder (global)
cp -r claude-code-setup/skills/tdd ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

petekp/claude-code-setup

35 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

petekp/tdd

$ install --global

Security Scan Results

SKILL.md

Test-Driven Development

Start the Session

Work Vertically

Choose the Entry Path

New feature

Bug fix or regression

Legacy code or refactor

Run the Core Loop

Keep Feedback Fast

Choose Assertions That Survive Refactors

Use Subagents Carefully

Avoid These Anti-Patterns

Finish with Proof

Read More Only As Needed

Related Skills

petekp/pr-self-review

petekp/tailwind-plugin-craft

petekp/pr-screenshot-comparison

petekp/skills/latent-potential

petekp/tdd

$ install --global

Security Scan Results

SKILL.md

Test-Driven Development

Start the Session

Work Vertically

Choose the Entry Path

New feature

Bug fix or regression

Legacy code or refactor

Run the Core Loop

Keep Feedback Fast

Choose Assertions That Survive Refactors

Use Subagents Carefully

Avoid These Anti-Patterns

Finish with Proof

Read More Only As Needed

Related Skills

petekp/pr-self-review

petekp/tailwind-plugin-craft

petekp/pr-screenshot-comparison

petekp/skills/latent-potential