Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

michaelalber/tdd-cycle

Name: tdd-cycle
Author: michaelalber

skills/tdd-cycle/SKILL.md

npx skillsauth add michaelalber/ai-toolkit tdd-cycle

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

TDD Cycle Orchestrator

"The goal is clean code that works. First we make it work, then we make it clean." — Ron Jeffries

Core Philosophy

This skill coordinates the canonical TDD cycle: RED → GREEN → REFACTOR. It maintains phase state, enforces transitions, and prevents the AI from "helping" by skipping phases.

The cycle is non-negotiable:

RED: Write a failing test that defines desired behavior
GREEN: Write minimal code to make the test pass
REFACTOR: Improve code structure while keeping tests green

Knowledge Base Lookups

Use search_knowledge (grounded-code-mcp) to ground decisions in authoritative references.

| Query | When to Call | |-------|--------------| | search_knowledge("TDD red green refactor cycle phases") | At session start — confirms canonical phase definitions and transition rules | | search_knowledge("Kent Beck test desiderata properties") | When evaluating test quality — authoritative source for the 12 properties | | search_knowledge("test-first development discipline XP") | When enforcing TDD discipline — grounding in XP practices | | search_knowledge("unit test naming conventions AAA arrange act assert") | When guiding test structure — naming and organization patterns |

Protocol: Search before advising on phase transitions or test quality criteria. Cite the source path in your response.

Kent Beck's 12 Test Desiderata

Use this framework to evaluate test quality at every phase:

| Property | Description | Priority | |----------|-------------|----------| | Isolated | Tests don't affect each other | Critical | | Composable | Can run any subset of tests | High | | Deterministic | Same result every time | Critical | | Specific | Failure points to cause | High | | Behavioral | Tests behavior, not implementation | High | | Structure-insensitive | Refactoring doesn't break tests | High | | Fast | Quick feedback loop | Medium | | Writable | Easy to create new tests | Medium | | Readable | Easy to understand intent | High | | Automated | No manual intervention | Critical | | Predictive | Passing tests = working code | High | | Inspiring | Confidence to make changes | Medium |

Workflow

Phase State Machine

┌─────────────────────────────────────────────────────┐
│                                                     │
│    ┌─────┐      ┌───────┐      ┌──────────┐        │
│ ──►│ RED │─────►│ GREEN │─────►│ REFACTOR │────┐   │
│    └─────┘      └───────┘      └──────────┘    │   │
│        ▲                                       │   │
│        └───────────────────────────────────────┘   │
│                                                     │
└─────────────────────────────────────────────────────┘

State Block Format

Maintain state across conversation turns using this block:

<tdd-state>
phase: RED | GREEN | REFACTOR
iteration: [number]
feature: [brief description]
current_test: [test name or "none"]
tests_passing: [true | false | unknown]
last_action: [what was just done]
next_action: [what should happen next]
blockers: [any issues preventing progress]
</tdd-state>

Phase Transitions

RED Phase Entry

Preconditions:

Previous cycle complete OR starting new feature
Clear understanding of desired behavior

Actions:

Identify the smallest testable behavior increment
Write ONE failing test
Run test suite, verify new test fails
Document expected vs actual behavior

Exit Criteria:

Exactly one new failing test
Test failure is for the RIGHT reason (not syntax/import errors)
Test clearly expresses intended behavior

GREEN Phase Entry

Preconditions:

RED phase complete
Exactly one failing test exists
Failure reason is verified

Actions:

Write MINIMAL code to pass the test
No additional functionality
"Fake it till you make it" is acceptable
Run tests, verify all pass

Exit Criteria:

All tests pass (including the new one)
No more code than necessary was written

REFACTOR Phase Entry

Preconditions:

GREEN phase complete
All tests passing
Code works but may not be clean

Actions:

Identify code smells or duplication
Make ONE small improvement
Run tests after EACH change
Repeat until satisfied

Exit Criteria:

All tests still pass
Code is cleaner/clearer
No behavior changes

Mode Selection

At session start, determine the mode:

Autonomous Mode (tdd-agent):

AI drives all phases
Stricter verification requirements
Explicit reasoning at each step

Pair Mode (tdd-pair):

Human and AI collaborate
Role negotiation (driver/navigator, ping-pong)
More conversational flow

Output Templates

Session Start

## TDD Session: [Feature Name]

**Mode**: [Autonomous | Pair]
**Stack**: [Language/Framework]

<tdd-state>
phase: RED
iteration: 1
feature: [description]
current_test: none
tests_passing: unknown
last_action: Session initialized
next_action: Write first failing test
blockers: none
</tdd-state>

### RED Phase - Iteration 1

I'll write a test for: [specific behavior]

Phase Transition

### Phase Complete: [PHASE]

**What was accomplished:**
- [bullet points]

**Verification:**
- Tests run: [yes/no]
- Result: [pass/fail with count]

<tdd-state>
phase: [NEXT_PHASE]
...
</tdd-state>

### [NEXT_PHASE] Phase - Iteration [N]

Next step: [action]

AI Discipline Rules

CRITICAL: Never Skip RED

Before writing ANY implementation code, verify:

A failing test exists for the feature
The test actually fails when run
The failure is for the expected reason

If no failing test exists, STOP and write one first.

CRITICAL: Minimal GREEN

During GREEN phase:

Write the LEAST code possible to pass the test
"Obvious implementation" is fine for trivial cases
Prefer "fake it" over "build it" when uncertain
Adding features not covered by tests is FORBIDDEN

CRITICAL: Green-to-Green Refactoring

During REFACTOR phase:

Run tests before AND after each change
If tests fail, immediately revert
One refactoring at a time
No new functionality during refactoring

CRITICAL: State Verification

Before any phase transition:

Run the test suite
Verify expected results
Update state block
Only then proceed

CRITICAL: Resist "Helpfulness"

The AI must NOT:

Write tests AND implementation together
Add "while I'm here" improvements during GREEN
Skip refactoring because "the code is simple"
Assume tests pass without running them

Stack-Specific Guidance

See reference files for stack-specific patterns:

Phase Transitions - Detailed transition logic
State Management - Persisting state across turns

Integration with Other Skills

RED phase → Use domain knowledge to write meaningful tests
GREEN phase → Invoke tdd-implementer for implementation
REFACTOR phase → Invoke tdd-refactor for safe improvements
Autonomous mode → Delegate to tdd-agent
Pair mode → Delegate to tdd-pair
Verification → Invoke tdd-verify for compliance check

Common Anti-Patterns to Avoid

| Anti-Pattern | Why It's Wrong | Correct Approach | |--------------|----------------|------------------| | Test after code | Tests become verification, not specification | Always RED first | | Multiple features per cycle | Loses precision, harder to debug | One behavior at a time | | Skipping refactor | Technical debt accumulates | Always evaluate for cleanup | | Gold-plating in GREEN | Violates minimal implementation | Save improvements for REFACTOR | | Tests that test implementation | Brittle, break on refactor | Test behavior only |

Recovery Procedures

If Tests Pass Unexpectedly in RED

The test is likely:

Testing existing behavior (duplicate)
Not testing what you think
Has a bug in the test itself

Action: Examine the test, strengthen assertions, or find the actual gap.

If Tests Fail in REFACTOR

Immediately revert the last change
Analyze why the refactoring broke behavior
Consider smaller steps or different approach

If State is Lost

Run the full test suite
Examine recent changes
Reconstruct state block from evidence
Resume at appropriate phase

michaelalber/tdd-cycle

skills/tdd-cycle/SKILL.md

Orchestrate RED-GREEN-REFACTOR TDD phases. Use when starting TDD, managing phase transitions, or maintaining TDD discipline across a development session. Do NOT use when TDD discipline is optional or exploratory; do NOT use when a failing test does not yet exist — create the test first.

development

Updated Apr 20, 2026

$ install --global

skillsauth

npx skillsauth add michaelalber/ai-toolkit tdd-cycle

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 20, 2026, 6:18 AM10.0s3 files scanned

SKILL.md

name:: tdd-cycle
description:: >

TDD Cycle Orchestrator

"The goal is clean code that works. First we make it work, then we make it clean." — Ron Jeffries

Core Philosophy

This skill coordinates the canonical TDD cycle: RED → GREEN → REFACTOR. It maintains phase state, enforces transitions, and prevents the AI from "helping" by skipping phases.

The cycle is non-negotiable:

RED: Write a failing test that defines desired behavior
GREEN: Write minimal code to make the test pass
REFACTOR: Improve code structure while keeping tests green

Knowledge Base Lookups

Use search_knowledge (grounded-code-mcp) to ground decisions in authoritative references.

Protocol: Search before advising on phase transitions or test quality criteria. Cite the source path in your response.

Kent Beck's 12 Test Desiderata

Use this framework to evaluate test quality at every phase:

Workflow

Phase State Machine

┌─────────────────────────────────────────────────────┐
│                                                     │
│    ┌─────┐      ┌───────┐      ┌──────────┐        │
│ ──►│ RED │─────►│ GREEN │─────►│ REFACTOR │────┐   │
│    └─────┘      └───────┘      └──────────┘    │   │
│        ▲                                       │   │
│        └───────────────────────────────────────┘   │
│                                                     │
└─────────────────────────────────────────────────────┘

State Block Format

Maintain state across conversation turns using this block:

<tdd-state>
phase: RED | GREEN | REFACTOR
iteration: [number]
feature: [brief description]
current_test: [test name or "none"]
tests_passing: [true | false | unknown]
last_action: [what was just done]
next_action: [what should happen next]
blockers: [any issues preventing progress]
</tdd-state>

Phase Transitions

RED Phase Entry

Preconditions:

Previous cycle complete OR starting new feature
Clear understanding of desired behavior

Actions:

Identify the smallest testable behavior increment
Write ONE failing test
Run test suite, verify new test fails
Document expected vs actual behavior

Exit Criteria:

Exactly one new failing test
Test failure is for the RIGHT reason (not syntax/import errors)
Test clearly expresses intended behavior

GREEN Phase Entry

Preconditions:

RED phase complete
Exactly one failing test exists
Failure reason is verified

Actions:

Write MINIMAL code to pass the test
No additional functionality
"Fake it till you make it" is acceptable
Run tests, verify all pass

Exit Criteria:

All tests pass (including the new one)
No more code than necessary was written

REFACTOR Phase Entry

Preconditions:

GREEN phase complete
All tests passing
Code works but may not be clean

Actions:

Identify code smells or duplication
Make ONE small improvement
Run tests after EACH change
Repeat until satisfied

Exit Criteria:

All tests still pass
Code is cleaner/clearer
No behavior changes

Mode Selection

At session start, determine the mode:

Autonomous Mode (tdd-agent):

AI drives all phases
Stricter verification requirements
Explicit reasoning at each step

Pair Mode (tdd-pair):

Human and AI collaborate
Role negotiation (driver/navigator, ping-pong)
More conversational flow

Output Templates

Session Start

## TDD Session: [Feature Name]

**Mode**: [Autonomous | Pair]
**Stack**: [Language/Framework]

<tdd-state>
phase: RED
iteration: 1
feature: [description]
current_test: none
tests_passing: unknown
last_action: Session initialized
next_action: Write first failing test
blockers: none
</tdd-state>

### RED Phase - Iteration 1

I'll write a test for: [specific behavior]

Phase Transition

### Phase Complete: [PHASE]

**What was accomplished:**
- [bullet points]

**Verification:**
- Tests run: [yes/no]
- Result: [pass/fail with count]

<tdd-state>
phase: [NEXT_PHASE]
...
</tdd-state>

### [NEXT_PHASE] Phase - Iteration [N]

Next step: [action]

AI Discipline Rules

CRITICAL: Never Skip RED

Before writing ANY implementation code, verify:

A failing test exists for the feature
The test actually fails when run
The failure is for the expected reason

If no failing test exists, STOP and write one first.

CRITICAL: Minimal GREEN

During GREEN phase:

Write the LEAST code possible to pass the test
"Obvious implementation" is fine for trivial cases
Prefer "fake it" over "build it" when uncertain
Adding features not covered by tests is FORBIDDEN

CRITICAL: Green-to-Green Refactoring

During REFACTOR phase:

Run tests before AND after each change
If tests fail, immediately revert
One refactoring at a time
No new functionality during refactoring

CRITICAL: State Verification

Before any phase transition:

Run the test suite
Verify expected results
Update state block
Only then proceed

CRITICAL: Resist "Helpfulness"

The AI must NOT:

Write tests AND implementation together
Add "while I'm here" improvements during GREEN
Skip refactoring because "the code is simple"
Assume tests pass without running them

Stack-Specific Guidance

See reference files for stack-specific patterns:

Phase Transitions - Detailed transition logic
State Management - Persisting state across turns

Integration with Other Skills

RED phase → Use domain knowledge to write meaningful tests
GREEN phase → Invoke tdd-implementer for implementation
REFACTOR phase → Invoke tdd-refactor for safe improvements
Autonomous mode → Delegate to tdd-agent
Pair mode → Delegate to tdd-pair
Verification → Invoke tdd-verify for compliance check

Common Anti-Patterns to Avoid

Recovery Procedures

If Tests Pass Unexpectedly in RED

The test is likely:

Testing existing behavior (duplicate)
Not testing what you think
Has a bug in the test itself

Action: Examine the test, strengthen assertions, or find the actual gap.

If Tests Fail in REFACTOR

Immediately revert the last change
Analyze why the refactoring broke behavior
Consider smaller steps or different approach

If State is Lost

Run the full test suite
Examine recent changes
Reconstruct state block from evidence
Resume at appropriate phase

Related Skills

michaelalber/grilling

development

VerifiedTrustedCommunity

Interviews the user relentlessly about a plan, decision, or idea — one question at a time, each with a recommended answer. Shared engine behind "grill-me" and "grill-with-docs". Use on any "grill" trigger phrase or to stress-test thinking. Do NOT use to build the plan; it ends at shared understanding, not implementation.

1SKILL.mdUpdated Jul 23, 2026

michaelalber/grilling

michaelalber/grill-with-docs

testing

VerifiedTrustedCommunity

Runs a relentless interview to sharpen a plan or design, capturing the decisions as ADRs and a glossary along the way. Use when the user wants to be grilled AND wants the session to leave durable domain documentation behind. Do NOT use for a throwaway stress-test with no artifacts; use grill-me instead.

1SKILL.mdUpdated Jul 23, 2026

michaelalber/grill-with-docs

michaelalber/vue-security-review

tools

VerifiedTrustedCommunity

OWASP-based security review of Vue/TypeScript front-ends. Detects framework (Vite/Vue CLI/Nuxt), entry points, and data flows; scans the OWASP Top 10 (2025) mapped to Vue client-side risks (raw-HTML XSS via v-html, URL/protocol injection, bundled secrets, insecure token storage, dependency CVEs, missing CSP, open redirects, router guard bypass); emits an exec summary plus graded findings. Use to audit Vue for vulnerabilities. Not for architecture grading (vue-architecture-checklist).

1SKILL.mdUpdated Jul 20, 2026

michaelalber/vue-security-review

michaelalber/vue-modernization-analyzer

tools

VerifiedTrustedCommunity

Analyzes legacy Vue codebases and produces actionable modernization plans. Primary migration paths include Options API to Composition API, Vue 2 to Vue 3, Vue CLI to Vite, JavaScript to TypeScript, Vue Test Utils/Karma/Mocha to Vitest + Vue Testing Library, legacy Vuex to Pinia, and removed-in-Vue-3 pattern cleanup (filters, event bus, `$listeners`). Does NOT perform the migration — assesses, quantifies risk, and plans.

1SKILL.mdUpdated Jul 20, 2026

michaelalber/vue-modernization-analyzer

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/michaelalber/ai-toolkit.git

# Copy into Claude Code skills folder (global)
cp -r ai-toolkit/skills/tdd-cycle ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

michaelalber/ai-toolkit

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT