Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

szymonkaliski/tdd

Name: tdd
Author: szymonkaliski

dotfiles/agents/skills/tdd/SKILL.md

npx skillsauth add szymonkaliski/home-configuration tdd

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Test-Driven Development

Choosing a Mode

First, determine which mode fits the project:

Test suite mode: The project has an existing test runner (vitest, pytest, cargo test, go test, etc.). Write tests using the project's framework.
Ad-hoc mode: No test suite, or the bug is easier to reproduce outside one. Write a standalone repro script in ./tmp/repro/. This directory is gitignored scratch space — create it if it doesn't exist.

Both modes follow the same red-green discipline. The only difference is where the "test" lives.

Bug Fix Workflow

This is the primary workflow. Do NOT attempt a fix before you have a failing repro.

1. Isolate

Understand the bug. Read the relevant code. Identify the minimal conditions that trigger it.
Narrow down: which input, which code path, which state.

2. RED — Reproduce

Test suite mode: Write a failing test case in the project's test framework that captures the broken behavior. Run the suite — confirm it fails for the right reason.

Ad-hoc mode: Write a self-contained repro script in ./tmp/repro/:

./tmp/repro/
├── repro.sh          # or repro.js, repro.py, etc.
├── input.txt         # test fixtures if needed
└── expected-output   # what correct behavior looks like

The repro script should:

Exit 0 on correct behavior, exit 1 on bug
Print a clear message: what was expected vs what happened
Be runnable in one command (e.g. bash ./tmp/repro/repro.sh)
Reference project files by relative path from project root

Run it. Confirm it fails (exit 1). This is your proof the bug exists.

3. GREEN — Fix

Fix the bug with minimal changes.
Run the repro again. Confirm it passes (exit 0).

4. Verify

Test suite mode: Run the full test suite to check for regressions.
Ad-hoc mode: Run the repro script, plus manually verify the fix makes sense in context. If there are any existing tests in the project, run those too.

Feature Workflow

1. Planning

Before writing any code:

[ ] Confirm with user what interface changes are needed
[ ] Confirm with user which behaviors to test (prioritize)
[ ] Design interfaces for testability
[ ] List the behaviors to test (not implementation steps)
[ ] Get user approval on the plan

You can't test everything. Focus on critical paths and complex logic, not every possible edge case.

2. Vertical Slices

One test → one implementation → repeat.

WRONG (horizontal):
  RED:   test1, test2, test3, test4, test5
  GREEN: impl1, impl2, impl3, impl4, impl5

RIGHT (vertical):
  RED→GREEN: test1→impl1
  RED→GREEN: test2→impl2
  RED→GREEN: test3→impl3
  ...

DO NOT write all tests first. Tests written in bulk test imagined behavior, not actual behavior. Each test should respond to what you learned from the previous cycle.

Rules:

One test at a time
Only enough code to pass current test
Don't anticipate future tests
Keep tests focused on observable behavior

3. Refactor

After all tests pass, look for refactor candidates. Never refactor while RED. Get to GREEN first.

Test Quality

See tests.md for examples and mocking.md for mocking guidelines.

Good tests verify behavior through public interfaces. They describe what the system does, survive internal refactors, and read like specifications.

Bad tests are coupled to implementation: mocking internal collaborators, testing private methods, asserting on call counts. Warning sign: test breaks when you refactor but behavior hasn't changed.

Mock only at system boundaries (external APIs, databases, time/randomness). Don't mock your own code.

Checklist Per Cycle

[ ] Test/repro describes behavior, not implementation
[ ] Test/repro uses public interface only
[ ] Test/repro would survive internal refactor
[ ] Code is minimal for this test
[ ] No speculative features added

szymonkaliski/tdd

dotfiles/agents/skills/tdd/SKILL.md

Test-driven bug fixing and feature development. Use when fixing bugs or building features — works with test suites or ad-hoc repro scripts. Enforces red-green-refactor vertical slices.

8 stars

development

Updated Apr 15, 2026

$ install --global

skillsauth

npx skillsauth add szymonkaliski/home-configuration tdd

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 15, 2026, 2:24 AM4.2s5 files scanned

SKILL.md

name:: tdd
description:: Test-driven bug fixing and feature development. Use when fixing bugs or building features — works with test suites or ad-hoc repro scripts. Enforces red-green-refactor vertical slices.

Test-Driven Development

Choosing a Mode

First, determine which mode fits the project:

Test suite mode: The project has an existing test runner (vitest, pytest, cargo test, go test, etc.). Write tests using the project's framework.
Ad-hoc mode: No test suite, or the bug is easier to reproduce outside one. Write a standalone repro script in ./tmp/repro/. This directory is gitignored scratch space — create it if it doesn't exist.

Both modes follow the same red-green discipline. The only difference is where the "test" lives.

Bug Fix Workflow

This is the primary workflow. Do NOT attempt a fix before you have a failing repro.

1. Isolate

Understand the bug. Read the relevant code. Identify the minimal conditions that trigger it.
Narrow down: which input, which code path, which state.

2. RED — Reproduce

Test suite mode: Write a failing test case in the project's test framework that captures the broken behavior. Run the suite — confirm it fails for the right reason.

Ad-hoc mode: Write a self-contained repro script in ./tmp/repro/:

./tmp/repro/
├── repro.sh          # or repro.js, repro.py, etc.
├── input.txt         # test fixtures if needed
└── expected-output   # what correct behavior looks like

The repro script should:

Exit 0 on correct behavior, exit 1 on bug
Print a clear message: what was expected vs what happened
Be runnable in one command (e.g. bash ./tmp/repro/repro.sh)
Reference project files by relative path from project root

Run it. Confirm it fails (exit 1). This is your proof the bug exists.

3. GREEN — Fix

Fix the bug with minimal changes.
Run the repro again. Confirm it passes (exit 0).

4. Verify

Test suite mode: Run the full test suite to check for regressions.
Ad-hoc mode: Run the repro script, plus manually verify the fix makes sense in context. If there are any existing tests in the project, run those too.

Feature Workflow

1. Planning

Before writing any code:

[ ] Confirm with user what interface changes are needed
[ ] Confirm with user which behaviors to test (prioritize)
[ ] Design interfaces for testability
[ ] List the behaviors to test (not implementation steps)
[ ] Get user approval on the plan

You can't test everything. Focus on critical paths and complex logic, not every possible edge case.

2. Vertical Slices

One test → one implementation → repeat.

WRONG (horizontal):
  RED:   test1, test2, test3, test4, test5
  GREEN: impl1, impl2, impl3, impl4, impl5

RIGHT (vertical):
  RED→GREEN: test1→impl1
  RED→GREEN: test2→impl2
  RED→GREEN: test3→impl3
  ...

DO NOT write all tests first. Tests written in bulk test imagined behavior, not actual behavior. Each test should respond to what you learned from the previous cycle.

Rules:

One test at a time
Only enough code to pass current test
Don't anticipate future tests
Keep tests focused on observable behavior

3. Refactor

After all tests pass, look for refactor candidates. Never refactor while RED. Get to GREEN first.

Test Quality

See tests.md for examples and mocking.md for mocking guidelines.

Good tests verify behavior through public interfaces. They describe what the system does, survive internal refactors, and read like specifications.

Mock only at system boundaries (external APIs, databases, time/randomness). Don't mock your own code.

Checklist Per Cycle

[ ] Test/repro describes behavior, not implementation
[ ] Test/repro uses public interface only
[ ] Test/repro would survive internal refactor
[ ] Code is minimal for this test
[ ] No speculative features added

Related Skills

szymonkaliski/plan-review

tools

VerifiedTrustedCommunity

Open the current plan file in nvim for user review

8SKILL.mdUpdated Apr 15, 2026

szymonkaliski/plan-review

szymonkaliski/git-review

testing

VerifiedTrustedCommunity

Review uncommitted changes for issues, missed items, and improvements. Use when reviewing before commit or when user asks to check their work.

8SKILL.mdUpdated Apr 15, 2026

szymonkaliski/git-review

szymonkaliski/git-commit

tools

VerifiedTrustedCommunity

Format, lint, test, and commit changes. Detects repo tooling automatically.

8SKILL.mdUpdated Apr 15, 2026

szymonkaliski/git-commit

szymonkaliski/git-catchup

tools

VerifiedTrustedCommunity

Understand current work context from recent changes. Use at session start or when resuming work.

8SKILL.mdUpdated Apr 15, 2026

szymonkaliski/git-catchup

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/szymonkaliski/home-configuration.git

# Copy into Claude Code skills folder (global)
cp -r home-configuration/dotfiles/agents/skills/tdd ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

szymonkaliski/home-configuration

8 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT