Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

thkt/generating-tdd-tests

Name: generating-tdd-tests
Author: thkt

skills/generating-tdd-tests/SKILL.md

npx skillsauth add thkt/dotclaude generating-tdd-tests

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

TDD Test Generation

Test behavior, not implementation. Assert on what the code does (observable output, return values, side effects), not how it does it (internal calls, private state, execution order).

A test that fails for the wrong reason (syntax error, wrong import) is not a valid Red. Fix the test first, then verify the failure matches the intended behavior gap.

Test Philosophy

Default: Classical (Detroit) style. Use real objects. Mock only at system boundaries.

| Principle | Rule | | --- | --- | | Behavior over implementation | Test public API output, not internal calls | | State verification preferred | Assert on result values, not "was X called" | | Real objects first | Use real dependencies. Mock only external I/O | | Black-box perspective | Treat the unit as a black box via its public interface | | Sociable tests | Let collaborators participate. Isolate only at boundaries |

When to use mocks (London style exceptions):

External APIs, databases, file system, network
Non-deterministic dependencies (time, random)
Slow dependencies that block the 2-min cycle

RGRC Cycle

| Phase | Goal | Rule | | -------- | ------------ | ------------------------ | | Red | Failing test | Verify failure reason | | Green | Pass test | "You can sin" - dirty OK | | Refactor | Clean code | Keep tests green | | Commit | Save state | All checks pass |

Baby Steps (2-min cycle)

30s: Write failing test → 1min: Make pass → 10s: Run tests → 30s: Tiny refactor → 20s: Commit if green

Test Design

| Technique | Use For | Example | | ------------------------ | --------------------- | ---------------------- | | Equivalence Partitioning | Group same behavior | Age: <18, 18-120 | | Boundary Value | Test edges | 17, 18, 120, 121 | | Decision Table | Multi-condition logic | isLoggedIn × isPremium |

Assertion Quality

Every test must verify a specific outcome. Weak assertions alone are forbidden.

| Category | Matchers | When acceptable | | --- | --- | --- | | Weak (existence) | toBeTruthy, toBeDefined, toBeFalsy, toBeNull, toBeUndefined | Only with a meaningful assertion in the same test | | Meaningful (value) | toBe, toEqual, toStrictEqual, toMatch, toContain, toThrow, toHaveLength | Always preferred | | Meaningful (call) | toHaveBeenCalledWith, toHaveBeenCalledTimes, toHaveReturnedWith | When verifying side effects |

Bad: expect(result).toBeTruthy() Good: expect(result).toEqual({ id: 1, name: "Alice" })

One test, one concept. If two tests assert the same function with the same argument pattern, merge or parameterize with test.each.

Mock Rules

| Rule | Threshold | | --- | --- | | Mock count per test | Must not exceed assertion count | | Mock scope | External dependencies only (API, DB, file system) | | Mock target | Never mock the module under test |

Mock Pitfalls

| Anti-Pattern | Problem | Instead | | --------------------------- | ------------------------------------------- | ------------------------------------------ | | Assert mock was called | Tests mock behavior, not component behavior | Assert on observable output or side effect | | Test-only production method | Pollutes production API for test access | Extract to test utility or use public API | | Mock before understanding | Hides real dependency behavior | Understand dependency first, then mock | | Partial mock structure | Missing fields cause false passes | Mirror complete real API structure | | Mock overuse | More mocks than assertions = testing wiring | Reduce mocks or add meaningful assertions |

Coverage

See rules/development/THRESHOLDS.md for canonical values.

Naming

| Level | Pattern | | ----- | ------------------------------------------------ | | Suite | describe("[Target]", ...) | | Group | describe("[Method]", ...) | | Test | it("when [condition], should [expected]", ...) |

Framework Detection

| Condition | Framework | | ------------------ | --------- | | vitest in deps | Vitest | | jest in deps | Jest | | bun as runtime | Bun test | | No framework found | Vitest |

References

| Topic | File | | -------------- | --------------------------------------------------------- | | Feature-driven | ${CLAUDE_SKILL_DIR}/references/feature-driven.md | | Bug-driven | ${CLAUDE_SKILL_DIR}/references/bug-driven.md | | Flaky tests | ${CLAUDE_SKILL_DIR}/references/flaky-test-management.md |

thkt/generating-tdd-tests

skills/generating-tdd-tests/SKILL.md

TDD with RGRC cycle and Baby Steps. Use when: TDD, テスト駆動, Red-Green-Refactor, Baby Steps.

9 stars

development

Updated Apr 22, 2026

$ install --global

skillsauth

npx skillsauth add thkt/dotclaude generating-tdd-tests

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 22, 2026, 7:16 AM145.1s4 files scanned

SKILL.md

name:: generating-tdd-tests
description:: >
TDD with RGRC cycle and Baby Steps. Use when:: TDD, テスト駆動,
allowed-tools:: [Read, Write, Edit, Grep, Glob, Task]
context:: fork
user-invocable:: false

TDD Test Generation

Test behavior, not implementation. Assert on what the code does (observable output, return values, side effects), not how it does it (internal calls, private state, execution order).

A test that fails for the wrong reason (syntax error, wrong import) is not a valid Red. Fix the test first, then verify the failure matches the intended behavior gap.

Test Philosophy

Default: Classical (Detroit) style. Use real objects. Mock only at system boundaries.

When to use mocks (London style exceptions):

External APIs, databases, file system, network
Non-deterministic dependencies (time, random)
Slow dependencies that block the 2-min cycle

RGRC Cycle

Baby Steps (2-min cycle)

30s: Write failing test → 1min: Make pass → 10s: Run tests → 30s: Tiny refactor → 20s: Commit if green

Test Design

Assertion Quality

Every test must verify a specific outcome. Weak assertions alone are forbidden.

Bad: expect(result).toBeTruthy() Good: expect(result).toEqual({ id: 1, name: "Alice" })

One test, one concept. If two tests assert the same function with the same argument pattern, merge or parameterize with test.each.

Mock Rules

Mock Pitfalls

Coverage

See rules/development/THRESHOLDS.md for canonical values.

Naming

Framework Detection

| Condition | Framework | | ------------------ | --------- | | vitest in deps | Vitest | | jest in deps | Jest | | bun as runtime | Bun test | | No framework found | Vitest |

References

Related Skills

thkt/use-cli-herdr

tools

VerifiedTrustedCommunity

Delegate implementation to codex (coder) via the herdr-agentchat plugin and drive a two-pane conversation to completion.

11SKILL.mdUpdated Jul 27, 2026

thkt/scribe

development

VerifiedTrustedCommunity

Extract recurring patterns from past closed PRs/issues and the research findings in workspace/research/, verify them against the latest code, and propose them to docs/wiki/ via PR.

11SKILL.mdUpdated Jul 20, 2026

thkt/dr

development

VerifiedTrustedCommunity

Create Decision Records (DR) in MADR v4 format with auto-numbering.

11SKILL.mdUpdated Jul 15, 2026

thkt/shake

development

VerifiedTrustedCommunity

Detect flaky tests by shaking them — repeated runs under varied order, parallelism, and seed — plus a static smell scan that flags latent flakiness in tests that currently pass. Classify each target as confirmed-flaky, latent-flaky, or stable and fix the root cause without weakening the test. Do NOT use to fix a confirmed single bug (use /fix) or for static-only code review (use /audit).

11SKILL.mdUpdated Jul 13, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/thkt/dotclaude.git

# Copy into Claude Code skills folder (global)
cp -r dotclaude/skills/generating-tdd-tests ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

thkt/dotclaude

9 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT