Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

shawn-sandy/tdd-fix

Name: tdd-fix
Author: shawn-sandy

kit/plugins/code-testing-agent/skills/tdd-fix/SKILL.md

npx skillsauth add shawn-sandy/agentics tdd-fix

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Given a bug description, write a failing test that reproduces it, then enter an autonomous loop — run tests, analyze failures, edit code, re-run — until green or 10 iterations. Log each iteration's hypothesis. After passing, run the full suite, commit with a fix: prefix, and open a PR.

Freedom level: Strict — Follow these steps in order. Do not skip or combine steps. Stop at each hard-stop marker.

When not to use

Does not design tests from scratch — use code-testing-agent. Does not review test quality — use reviewing-tests.

Step 0: Create Progress Todos
Step 1: Parse Bug Description
Step 2: Write the Failing Test (Red Phase)
Step 3: Autonomous Fix Loop (max 10 iterations)
Step 4: Hard Cap — Loop Exhausted
Step 5: Regression Sweep
Step 6: Summarize the Fix
Step 7: Commit via commit-agent
Step 8: Open PR via pr-agent
Step 9: Stop

Step 0: Create Progress Todos

Use TodoWrite to create todos for Steps 1–9, all status: "pending". Mark each status: "completed" as you finish it.

Step 1: Parse Bug Description

Extract from the invocation message:

| Field | What to extract | |-------|----------------| | Symptom | What the code currently does wrong | | Expected behavior | What it should do instead | | Affected file(s) | Source file(s) containing the bug — explicit path, or inferred from symptom | | Test file | Corresponding test file (infer from naming convention if not given) |

If any field is missing and cannot be inferred, use AskUserQuestion with a focused question for the missing field only. Do not ask for everything at once.

Do not open any source file yet. Only parse the message.

Step 2: Write the Failing Test (Red Phase)

Locate the test file. Use Glob to find it:
- *.test.ts, *.test.js, *.spec.ts, *.spec.js (JS/TS)
- test_*.py, *_test.py (Python)
- *_test.go (Go)
- *_test.rs (Rust)
- *.test.sh, *.bats (shell)
If multiple candidates exist, prefer the one closest in the directory tree to the affected source file.
Read the test file (Read) to understand its structure, assertion style, and import pattern.
Append a new test case that will fail because of the bug. Use Edit (not Write) to add to the existing file. The test must:
- Target exactly the behavior described in Step 1
- Use the project's existing assertion style
- Include a comment # tdd-fix: reproducing <symptom> (or language equivalent) to identify it later
Do NOT edit any production code in this step.
Run the test once (Bash) to confirm it fails. If it unexpectedly passes, stop and use AskUserQuestion:

"The new test passed without any code changes — the bug may already be fixed, or the test may not be reproducing it correctly. How do you want to proceed?"

Step 3: Autonomous Fix Loop (max 10 iterations)

Initialize an iteration log. Render it as a markdown table and update it live after each iteration:

| # | Hypothesis | Change Made | Result |
|---|------------|-------------|--------|

For each iteration i from 1 to 10:

3a — Form a hypothesis

Read the failure output from the previous run (or from Step 2 on iteration 1). In one sentence, state why the test is failing and what in the production code is responsible. Write the hypothesis to the iteration log.

Examples of well-formed hypotheses:

"Operator in add() is subtraction, not addition."
"parseDate does not handle the Z timezone suffix."
"Off-by-one: loop ends at < n but should be <= n."

3b — Edit the production file

Use Edit (not Write) to apply the minimal change implied by the hypothesis. Record a one-line diff summary in the log.

Do not refactor unrelated code. Do not add unrelated tests. Change only what the hypothesis requires.

3c — Run the scoped test

Run only the test written in Step 2 via Bash. Record the result (PASS or FAIL + excerpt) in the iteration log.

If PASS: exit the loop and proceed to Step 5.
If FAIL: if i < 10, increment and go to 3a. If i == 10, proceed to Step 4.

Show the updated iteration log after every iteration.

Step 4: Hard Cap — Loop Exhausted

If the loop reaches 10 iterations without a passing test:

Print the full iteration log.
Surface the last three hypotheses and why each failed.
Output:

tdd-fix stopped after 10 iterations. The test is still failing.
No commit or PR will be created.

Suggestions for next steps:
- Review the iteration log above for patterns.
- Consider whether the bug is in a different file than expected.
- The test file and any partial edits remain on disk for manual inspection.

STOP. Do not commit, do not open a PR.

Step 5: Regression Sweep

Once the scoped test passes, run the full test suite with no scope filter. Use Bash with the appropriate full-suite command for the project:

Node/JS: npm test, yarn test, pnpm test, or npx vitest run
Python: pytest, python -m pytest
Go: go test ./...
Shell: run the top-level test runner script if one exists

If any previously-passing test now fails:

Report the regressions — test names and failure excerpts.
Do not commit.
Output:

Regression detected. The fix broke existing tests (listed above).
No commit or PR will be created. The changes remain on disk.

STOP.

If all tests pass, continue to Step 6.

Step 6: Summarize the Fix

Print a summary block before committing:

## tdd-fix Summary

Bug:        <symptom from Step 1>
Fix:        <final hypothesis from Step 3>
Iterations: <i of 10>
Files changed:
  - <production file(s) edited>
  - <test file appended>
Full suite: PASS

Step 7: Commit via commit-agent

Invoke the commit-agent skill. When it drafts the commit message, ensure:

Type is fix
Scope is the most-changed top-level directory
Description summarizes the symptom in imperative mood

Example: fix(tests/demo): correct add() operator from subtraction to addition

The commit-agent skill handles staging, pre-commit hooks, and conventional format — do not duplicate that logic here.

Step 8: Open PR via pr-agent

Invoke the pr-agent skill. When it drafts the PR body, include the iteration log from Step 3 under a ## How it was found (tdd-fix) section.

The pr-agent skill handles push, platform detection (GitHub/GitLab), and branch checks — do not duplicate that logic here.

Step 9: Stop

STOP here. Do not analyze code further, do not re-run tests, do not suggest refactors or cleanup, do not open additional issues. The fix is complete when the PR URL is returned by pr-agent.

shawn-sandy/tdd-fix

kit/plugins/code-testing-agent/skills/tdd-fix/SKILL.md

Fixes bugs via TDD with up to 10 red-green iterations. Writes a failing test then autonomously iterates until the bug is resolved. Use when the user asks to TDD-fix a bug or run a red-green cycle.

2 stars

testing

Updated May 28, 2026

$ install --global

skillsauth

npx skillsauth add shawn-sandy/agentics tdd-fix

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 28, 2026, 2:02 AM138.7s1 file scanned

SKILL.md

name:: tdd-fix
description:: Fixes bugs via TDD with up to 10 red-green iterations. Writes a failing test then autonomously iterates until the bug is resolved. Use when the user asks to TDD-fix a bug or run a red-green cycle.
allowed-tools:: Bash, Read, Write, Edit, Glob, Grep, TodoWrite, AskUserQuestion
disable-model-invocation:: true

Freedom level: Strict — Follow these steps in order. Do not skip or combine steps. Stop at each hard-stop marker.

When not to use

Does not design tests from scratch — use code-testing-agent. Does not review test quality — use reviewing-tests.

Step 0: Create Progress Todos
Step 1: Parse Bug Description
Step 2: Write the Failing Test (Red Phase)
Step 3: Autonomous Fix Loop (max 10 iterations)
Step 4: Hard Cap — Loop Exhausted
Step 5: Regression Sweep
Step 6: Summarize the Fix
Step 7: Commit via commit-agent
Step 8: Open PR via pr-agent
Step 9: Stop

Step 0: Create Progress Todos

Use TodoWrite to create todos for Steps 1–9, all status: "pending". Mark each status: "completed" as you finish it.

Step 1: Parse Bug Description

Extract from the invocation message:

If any field is missing and cannot be inferred, use AskUserQuestion with a focused question for the missing field only. Do not ask for everything at once.

Do not open any source file yet. Only parse the message.

Step 2: Write the Failing Test (Red Phase)

Locate the test file. Use Glob to find it:
- *.test.ts, *.test.js, *.spec.ts, *.spec.js (JS/TS)
- test_*.py, *_test.py (Python)
- *_test.go (Go)
- *_test.rs (Rust)
- *.test.sh, *.bats (shell)
If multiple candidates exist, prefer the one closest in the directory tree to the affected source file.
Read the test file (Read) to understand its structure, assertion style, and import pattern.
Append a new test case that will fail because of the bug. Use Edit (not Write) to add to the existing file. The test must:
- Target exactly the behavior described in Step 1
- Use the project's existing assertion style
- Include a comment # tdd-fix: reproducing <symptom> (or language equivalent) to identify it later
Do NOT edit any production code in this step.
Run the test once (Bash) to confirm it fails. If it unexpectedly passes, stop and use AskUserQuestion:

"The new test passed without any code changes — the bug may already be fixed, or the test may not be reproducing it correctly. How do you want to proceed?"

Step 3: Autonomous Fix Loop (max 10 iterations)

Initialize an iteration log. Render it as a markdown table and update it live after each iteration:

| # | Hypothesis | Change Made | Result |
|---|------------|-------------|--------|

For each iteration i from 1 to 10:

3a — Form a hypothesis

Examples of well-formed hypotheses:

"Operator in add() is subtraction, not addition."
"parseDate does not handle the Z timezone suffix."
"Off-by-one: loop ends at < n but should be <= n."

3b — Edit the production file

Use Edit (not Write) to apply the minimal change implied by the hypothesis. Record a one-line diff summary in the log.

Do not refactor unrelated code. Do not add unrelated tests. Change only what the hypothesis requires.

3c — Run the scoped test

Run only the test written in Step 2 via Bash. Record the result (PASS or FAIL + excerpt) in the iteration log.

If PASS: exit the loop and proceed to Step 5.
If FAIL: if i < 10, increment and go to 3a. If i == 10, proceed to Step 4.

Show the updated iteration log after every iteration.

Step 4: Hard Cap — Loop Exhausted

If the loop reaches 10 iterations without a passing test:

Print the full iteration log.
Surface the last three hypotheses and why each failed.
Output:

tdd-fix stopped after 10 iterations. The test is still failing.
No commit or PR will be created.

Suggestions for next steps:
- Review the iteration log above for patterns.
- Consider whether the bug is in a different file than expected.
- The test file and any partial edits remain on disk for manual inspection.

STOP. Do not commit, do not open a PR.

Step 5: Regression Sweep

Once the scoped test passes, run the full test suite with no scope filter. Use Bash with the appropriate full-suite command for the project:

Node/JS: npm test, yarn test, pnpm test, or npx vitest run
Python: pytest, python -m pytest
Go: go test ./...
Shell: run the top-level test runner script if one exists

If any previously-passing test now fails:

Report the regressions — test names and failure excerpts.
Do not commit.
Output:

Regression detected. The fix broke existing tests (listed above).
No commit or PR will be created. The changes remain on disk.

STOP.

If all tests pass, continue to Step 6.

Step 6: Summarize the Fix

Print a summary block before committing:

## tdd-fix Summary

Bug:        <symptom from Step 1>
Fix:        <final hypothesis from Step 3>
Iterations: <i of 10>
Files changed:
  - <production file(s) edited>
  - <test file appended>
Full suite: PASS

Step 7: Commit via commit-agent

Invoke the commit-agent skill. When it drafts the commit message, ensure:

Type is fix
Scope is the most-changed top-level directory
Description summarizes the symptom in imperative mood

Example: fix(tests/demo): correct add() operator from subtraction to addition

The commit-agent skill handles staging, pre-commit hooks, and conventional format — do not duplicate that logic here.

Step 8: Open PR via pr-agent

Invoke the pr-agent skill. When it drafts the PR body, include the iteration log from Step 3 under a ## How it was found (tdd-fix) section.

The pr-agent skill handles push, platform detection (GitHub/GitLab), and branch checks — do not duplicate that logic here.

Step 9: Stop

STOP here. Do not analyze code further, do not re-run tests, do not suggest refactors or cleanup, do not open additional issues. The fix is complete when the PR URL is returned by pr-agent.

Related Skills

shawn-sandy/merge

development

VerifiedTrustedCommunity

Checks whether the branch's PR is ready and merges it when green. Runs the readiness gate, lint, and an approval prompt. Use when the user asks "merge?" or if a PR is ready to merge.

2SKILL.mdUpdated Jul 22, 2026

shawn-sandy/build

development

VerifiedTrustedCommunity

Implements a plan file that already exists. Walks its steps, ticks the spec, re-renders, and runs the completion gates. Use when asked to implement an existing plan.

2SKILL.mdUpdated Jul 21, 2026

shawn-sandy/agentic-memory-management

development

VerifiedTrustedCommunity

Audits and optimizes CLAUDE.md project memory files. Checks adherence to Claude Code best practices and produces actionable fixes. Use when the user asks to audit, optimize, or diagnose a CLAUDE.md.

2SKILL.mdUpdated Jul 21, 2026

shawn-sandy/agentic-memory-management

shawn-sandy/artifact-to-post

development

VerifiedTrustedCommunity

Converts an HTML artifact or Markdown file into a draft post for a static site. Scopes CSS to keep interactive blocks alive and escapes prose for MDX. Use when asked to turn an artifact into a post.

2SKILL.mdUpdated Jul 21, 2026

shawn-sandy/artifact-to-post

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/shawn-sandy/agentics.git

# Copy into Claude Code skills folder (global)
cp -r agentics/kit/plugins/code-testing-agent/skills/tdd-fix ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

shawn-sandy/agentics

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

shawn-sandy/tdd-fix

$ install --global

Security Scan Results

SKILL.md

When not to use

Table of Contents

Step 0: Create Progress Todos

Step 1: Parse Bug Description

Step 2: Write the Failing Test (Red Phase)

Step 3: Autonomous Fix Loop (max 10 iterations)

3a — Form a hypothesis

3b — Edit the production file

3c — Run the scoped test

Step 4: Hard Cap — Loop Exhausted

Step 5: Regression Sweep

Step 6: Summarize the Fix

Step 7: Commit via commit-agent

Step 8: Open PR via pr-agent

Step 9: Stop

Related Skills

shawn-sandy/merge

shawn-sandy/build

shawn-sandy/agentic-memory-management

shawn-sandy/artifact-to-post

shawn-sandy/tdd-fix

$ install --global

Security Scan Results

SKILL.md

When not to use

Table of Contents

Step 0: Create Progress Todos

Step 1: Parse Bug Description

Step 2: Write the Failing Test (Red Phase)

Step 3: Autonomous Fix Loop (max 10 iterations)

3a — Form a hypothesis

3b — Edit the production file

3c — Run the scoped test

Step 4: Hard Cap — Loop Exhausted

Step 5: Regression Sweep

Step 6: Summarize the Fix

Step 7: Commit via commit-agent

Step 8: Open PR via pr-agent

Step 9: Stop

Related Skills

shawn-sandy/merge

shawn-sandy/build

shawn-sandy/agentic-memory-management

shawn-sandy/artifact-to-post