Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

bostonaholic/test-driven-bug-fix

Name: test-driven-bug-fix
Author: bostonaholic

skills/test-driven-bug-fix/SKILL.md

npx skillsauth add bostonaholic/team test-driven-bug-fix

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Test-Driven Bug Fix

A bug without a failing test is an unverified assumption. A bug fixed without a failing test may be fixed correctly this time, but has no protection against regression. The test-driven bug fix discipline ensures that every fix is:

Reproduced — the bug is confirmed to exist before any code changes
Pinned — a failing test locks in the expected correct behavior
Fixed minimally — the smallest change that makes the test pass
Verified — no regression in existing behavior

Triage Before Reproducing

Before reproducing, classify the failure into one of four buckets:

| Bucket | Symptom | Action | |--------|---------|--------| | Product | Real defect in the code under test | Continue with the four-step discipline below | | Test impl | Test wrong; behavior correct | File a separate test-fix; do NOT change production code to satisfy a bad test | | Infra | CI environment, DB, network, container | Fix the env; do not encode the env-fix as a test | | Tooling | Test runner / build system | Fix the tool; the bug is not in the product |

Intermittent failures are not a fifth bucket — they belong in one of the four above. Quarantining a test as "flaky" without classifying the failure hides the very intermittent product bug that the test surfaced. The conditions that make a test flaky are frequently the conditions that trigger the bug. Reproduce deterministically before fixing — see skills/systematic-debugging/SKILL.md.

The Four-Step Discipline

Follow skills/progress-tracking/SKILL.md: when this procedure has two or more steps, seed one todo item per step before starting and mark each complete as you go.

Step 1: Reproduce

Before writing any code, reproduce the bug. Understanding exactly when and why the bug occurs is the prerequisite for everything that follows.

Run the failing scenario manually or via existing tests
Identify the exact inputs that trigger the bug
Understand what the system does (actual behavior) versus what it should do (expected behavior)
Identify which file(s) and function(s) are involved

Do not hypothesize a fix during this step. Observe first.

Reproduction is complete when you can reliably trigger the bug on demand.

Step 2: Write a Failing Test

Write a test that:

Reproduces the bug — the test exercises the exact scenario that triggers the bug
Asserts the correct behavior — the assertion captures what should happen, not what currently happens
Fails for the right reason — the test fails with an assertion failure (wrong behavior), not an error (broken test infrastructure)

Name the test to document the bug scenario: test_returns_error_when_token_is_expired, not test_bug_123 or test_fix.

Run the test suite and confirm:

The new test FAILS (the bug exists)
The new test fails with an assertion failure, not a crash or error
All existing tests still pass (the test itself is not broken)

This is the "Red" state. Do not proceed until the test fails correctly.

Step 3: Fix Minimally

Apply the smallest change that makes the failing test pass.

Minimal means minimal. Change only the code that produces the wrong behavior. Do not refactor, improve, or extend.
Do not change the test. The test defines correct behavior. If the test is wrong, that is a separate problem — do not fix the code to match wrong tests.
Do not fix other bugs found along the way. If you discover a related bug, note it. File it for later. Fix only the targeted bug.
After each change, run the tests. The failing test should move from failing to passing. No existing test should start failing.

This is the "Green" state. The targeted test passes, all other tests pass.

Step 4: Verify

After the fix:

Run the full test suite. Every existing test must pass. If any test now fails that passed before, the fix introduced a regression — undo and investigate.
Re-run the reproduction case. Confirm that the original bug no longer occurs with the original inputs.
Check for related instances. If the root cause is a pattern (e.g., missing null check), search the codebase for the same pattern. File issues for related instances — do not fix them in this commit.
Review the minimal fix with a mutation check. Temporarily revert one line of the fix and re-run the new test. It must go red again. If it still passes, the test does not exercise the fix — strengthen the assertion or the reproduction inputs. This guards against fixes that hide the symptom without addressing the root cause, and against tests that drift away from the bug.

Commit Structure

Each step produces a commit:

test: reproduce <bug description> with failing test

Adds a test that fails due to the bug described in <issue reference>.
The test will pass once the fix is applied.

fix: <minimal description of the fix>

Fixes the root cause identified in the preceding test commit.
All tests now pass including the new reproduction test.

Closes #<issue>

Keeping the test commit and fix commit separate makes the intention clear: the test proves the bug existed, the fix makes it go away.

What This Is NOT

Not a refactoring opportunity. Bug fixes are not the time to improve the surrounding code. The scope is: broken test passes, no regressions.
Not a feature addition. If the correct behavior requires new functionality beyond restoring the previous intent, that is a feature, not a bug fix.
Not a workaround. A workaround avoids the buggy code path. A fix corrects the buggy code. When in doubt, fix the root cause.

bostonaholic/test-driven-bug-fix

skills/test-driven-bug-fix/SKILL.md

Test-driven bug fix methodology — loaded by the bug-fix pipeline to enforce reproduce-first, red-green discipline when fixing defects

3 stars

testing

Updated Jun 2, 2026

$ install --global

skillsauth

npx skillsauth add bostonaholic/team test-driven-bug-fix

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 2, 2026, 3:13 AM110.7s1 file scanned

SKILL.md

name:: test-driven-bug-fix
description:: Test-driven bug fix methodology — loaded by the bug-fix pipeline to enforce reproduce-first, red-green discipline when fixing defects
user-invocable:: false

Test-Driven Bug Fix

Reproduced — the bug is confirmed to exist before any code changes
Pinned — a failing test locks in the expected correct behavior
Fixed minimally — the smallest change that makes the test pass
Verified — no regression in existing behavior

Triage Before Reproducing

Before reproducing, classify the failure into one of four buckets:

The Four-Step Discipline

Follow skills/progress-tracking/SKILL.md: when this procedure has two or more steps, seed one todo item per step before starting and mark each complete as you go.

Step 1: Reproduce

Before writing any code, reproduce the bug. Understanding exactly when and why the bug occurs is the prerequisite for everything that follows.

Run the failing scenario manually or via existing tests
Identify the exact inputs that trigger the bug
Understand what the system does (actual behavior) versus what it should do (expected behavior)
Identify which file(s) and function(s) are involved

Do not hypothesize a fix during this step. Observe first.

Reproduction is complete when you can reliably trigger the bug on demand.

Step 2: Write a Failing Test

Write a test that:

Reproduces the bug — the test exercises the exact scenario that triggers the bug
Asserts the correct behavior — the assertion captures what should happen, not what currently happens
Fails for the right reason — the test fails with an assertion failure (wrong behavior), not an error (broken test infrastructure)

Name the test to document the bug scenario: test_returns_error_when_token_is_expired, not test_bug_123 or test_fix.

Run the test suite and confirm:

The new test FAILS (the bug exists)
The new test fails with an assertion failure, not a crash or error
All existing tests still pass (the test itself is not broken)

This is the "Red" state. Do not proceed until the test fails correctly.

Step 3: Fix Minimally

Apply the smallest change that makes the failing test pass.

Minimal means minimal. Change only the code that produces the wrong behavior. Do not refactor, improve, or extend.
Do not change the test. The test defines correct behavior. If the test is wrong, that is a separate problem — do not fix the code to match wrong tests.
Do not fix other bugs found along the way. If you discover a related bug, note it. File it for later. Fix only the targeted bug.
After each change, run the tests. The failing test should move from failing to passing. No existing test should start failing.

This is the "Green" state. The targeted test passes, all other tests pass.

Step 4: Verify

After the fix:

Run the full test suite. Every existing test must pass. If any test now fails that passed before, the fix introduced a regression — undo and investigate.
Re-run the reproduction case. Confirm that the original bug no longer occurs with the original inputs.
Check for related instances. If the root cause is a pattern (e.g., missing null check), search the codebase for the same pattern. File issues for related instances — do not fix them in this commit.
Review the minimal fix with a mutation check. Temporarily revert one line of the fix and re-run the new test. It must go red again. If it still passes, the test does not exercise the fix — strengthen the assertion or the reproduction inputs. This guards against fixes that hide the symptom without addressing the root cause, and against tests that drift away from the bug.

Commit Structure

Each step produces a commit:

test: reproduce <bug description> with failing test

Adds a test that fails due to the bug described in <issue reference>.
The test will pass once the fix is applied.

fix: <minimal description of the fix>

Fixes the root cause identified in the preceding test commit.
All tests now pass including the new reproduction test.

Closes #<issue>

Keeping the test commit and fix commit separate makes the intention clear: the test proves the bug existed, the fix makes it go away.

What This Is NOT

Not a refactoring opportunity. Bug fixes are not the time to improve the surrounding code. The scope is: broken test passes, no regressions.
Not a feature addition. If the correct behavior requires new functionality beyond restoring the previous intent, that is a feature, not a bug fix.
Not a workaround. A workaround avoids the buggy code path. A fix corrects the buggy code. When in doubt, fix the root cause.

Related Skills

bostonaholic/progress-tracking

data-ai

VerifiedTrustedCommunity

Todo-first progress convention for multi-step procedures — loaded by every multi-step agent to track its own steps without drift

3SKILL.mdUpdated Jun 2, 2026

bostonaholic/progress-tracking

bostonaholic/eng-design-doc-review

testing

VerifiedTrustedCommunity

Adversarially review a technical design document with fresh context before the human gate. Dispatches the built-in `general-purpose` subagent (clean context, no shared history with the design-author) against `docs/plans/<id>/design.md` and presents its verdict — APPROVE, REQUEST CHANGES, or COMMENT. Optional, not part of the QRSPI pipeline. Trigger on "review the design doc", "audit design.md", "is this design ready", or `/eng-design-doc-review`.

3SKILL.mdUpdated May 23, 2026

bostonaholic/eng-design-doc-review

bostonaholic/code-review

development

VerifiedTrustedCommunity

Generator-evaluator separation and review methodology — loaded by review agents to enforce fresh-context review discipline, Conventional Comments format, and gate verdicts

3SKILL.mdUpdated May 2, 2026

bostonaholic/code-review

bostonaholic/team-worktree

data-ai

VerifiedTrustedCommunity

Prepare one or more isolated git worktrees — one per repository the topic touches. Router action — no agent. Trigger on "set up the worktree", "isolate this work", or "/team-worktree".

3SKILL.mdUpdated Apr 23, 2026

bostonaholic/team-worktree

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/bostonaholic/team.git

# Copy into Claude Code skills folder (global)
cp -r team/skills/test-driven-bug-fix ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

bostonaholic/team

3 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT