Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

maestria-co/writing-tests

Name: writing-tests
Author: maestria-co

skills/writing-tests/SKILL.md

npx skillsauth add maestria-co/ai-playbook writing-tests

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Skill: Writing Tests

Purpose

Validate that code behaves correctly across the full range of inputs and conditions. Tests verify the contract (observable behavior), not the internals (implementation details). Tests coupled to internals break on refactors and create false confidence.

Think before writing. Read the spec and implementation first. Identify all the ways the code can fail before writing a single test.

Pre-flight Checks

Check 1 — Read the implementation and spec

Read both the spec/acceptance criteria and the implementation code before writing tests.

If implementation does not exist → stop. Tests cannot be written for unimplemented code. Surface: "Implementation needed before tests can be written."
If implementation exists but spec/acceptance criteria are missing → infer from the implementation's behavior and state your assumptions in the test file.

Check 2 — Find the test framework and conventions

Locate 2–3 existing test files in the same area. Identify:

Test framework and runner (Jest, Vitest, pytest, etc.)
Folder structure and naming convention (*.test.ts, *.spec.js, tests/)
How mocks and fixtures are set up
How async code is tested
Write new tests that match these conventions exactly.
If no tests exist in the codebase → use the framework in the project manifest (package.json, etc.).

Execution Steps

Step 1 — List Test Cases Before Writing

Write a comment block listing every test case you plan to write, grouped by category:

// Happy path:     valid input → expected output
// Edge cases:     null, empty, boundary values, special chars
// Error cases:    invalid input, missing required fields, downstream failures
// State cases:    before/after state transitions, concurrent writes
// Security cases: [only if code handles auth, external input, or file I/O]
//   - SQL injection / NoSQL injection
//   - Malformed or oversized inputs
//   - Missing/expired/tampered tokens

Step 2 — Write Happy Path Tests

Verify that the code does what it's supposed to do for valid, expected input. At minimum: one test per distinct success path.

Step 3 — Write Edge Case Tests

Cover boundary conditions:

Empty inputs ("", [], {}, null, undefined)
Boundary values (0, -1, max int, empty string vs. whitespace-only)
Special characters in string inputs
Date/time edge cases (midnight, DST transitions, leap years) if applicable

Step 4 — Write Error Case Tests

Verify that failures are handled explicitly and produce correct error behavior:

Invalid input returns the correct error code and message
Missing required fields are rejected
Downstream service failures are handled (do not crash, return appropriate error)
Every error path is tested — no silent swallows

Step 5 — Write Security Tests (If Applicable)

For code that handles authentication, authorization, or external input:

SQL/NoSQL injection attempts produce errors, not data leaks
Malformed tokens return 401, not 500
Oversized inputs are rejected with 400/413, not crashing the service
Unauthorized access to another user's resources returns 403

Step 6 — Run the Suite

Run tests with the no-watch flag:

Jest/Vitest: --watchAll=false or --run
pytest: pytest (no watch by default)
Other runners: ensure the process exits after running

Coverage targets:

New code: ≥90%
Bug fixes: must include a test that reproduces the bug and passes after the fix
Refactors: coverage must not decrease from before the refactor
If all tests pass and coverage is met → proceed to output.
If tests fail → diagnose and fix tests (not the implementation, unless the implementation is wrong).
If code is untestable (global state, no dependency injection) → report it. Do not mock the thing under test.

Step 7 — Report

Produce a test report.

Output Formats

All tests passing:

Tests: ✅ All passing
Coverage: [X%] | Tests: N (unit: N, integration: N)
Files: [test files written or modified]

Suggested next: [next task in plan, or "done"]

Tests failing or bugs found:

Tests: ⚠️ N failing

Bugs:
- Bug: [description] | File: [path:line] | Input: [reproduction] | Expected: [value] | Actual: [value] | Severity: [low/medium/high]

Suggested next: implementing-features (fix bugs, then return to writing-tests)

Code untestable:

Tests: ❌ Blocked
Reason: [e.g., global state in src/services/auth.js:47, no dependency injection]

Suggested next: implementing-features (refactor for testability, then return to writing-tests)

Examples

<examples> <example> <scenario>Testing a new model with encrypted fields</scenario> <input> <task>Write tests for OAuthProvider model and migration.</task> <context> - src/models/OAuthProvider.js — just implemented (Sequelize + encrypted tokens) - db/migrations/20240315-create-oauth-provider.js — just implemented - Existing test style: tests/models/User.test.js (Jest, Sequelize mock) - Coverage target: ≥90% </context> </input> <output> Tests: ✅ All passing Coverage: 94% | Tests: 12 (unit: 12) Files: tests/models/OAuthProvider.test.js

Test cases covered: Happy path: create valid OAuthProvider, retrieve by userId+provider Edge cases: null userId, empty provider string, token at max length Error cases: duplicate (userId+provider) raises unique constraint, invalid provider value Security cases: accessToken encrypted at rest (stored value differs from plaintext input) </output> </example>

<example> <scenario>Writing a regression test for a bug fix</scenario> <input> <task>Add regression test for the case-sensitive email bug fix.</task> <context> - Bug: login failed for mixed-case emails (e.g., [email protected]) - Fix applied in src/services/auth.js:47 — .toLowerCase() added before DB lookup - Existing tests: tests/services/auth.test.js </context> </input> <output> Tests: ✅ All passing Coverage: 91% (unchanged) | Tests: 8 (7 existing + 1 new) Files: tests/services/auth.test.js (added 1 test)

Added test: "login succeeds for mixed-case email"

Input: { email: "[email protected]", password: "correct" }
Expected: 200 with user session
Validates: bug does not regress </output>

</example> <example> <scenario>Test blocked by untestable code</scenario> <input> <task>Write tests for the token refresh service.</task> <context> - src/services/tokenRefresh.js — directly imports and calls a global Redis client - No dependency injection; client is a module-level singleton </context> </input> <output> Tests: ❌ Blocked Reason: src/services/tokenRefresh.js imports Redis client as a module-level singleton with no injection point. Cannot mock the Redis client without mocking the implementation itself, which tests internals rather than behavior.

Suggested next: implementing-features — refactor tokenRefresh.js to accept a redis client as a constructor/function parameter, then return to writing-tests. </output> </example>

</examples>

Constraints

Do not write production code — route testability fixes to implementing-features
Do not skip edge or security cases to hit coverage targets faster
Do not mock the code under test or its internal utilities
Always use no-watch flags — never leave a hanging test runner
Always leave the test suite green before completing — do not hand off a broken suite
Test the contract (behavior), not the internals (implementation)
Scope: reads and writes test files; reads implementation files and spec; does not modify source code

maestria-co/writing-tests

skills/writing-tests/SKILL.md

Writes and runs a test suite for a piece of code, covering happy path, edge cases, error cases, and security cases. Use when: implementation is complete and needs test coverage, a bug needs a reproduction test and fix validation, or code needs coverage before a refactor. Do not use when: the code under test is not yet implemented, or the spec is still unclear.

development

Updated Apr 26, 2026

$ install --global

skillsauth

npx skillsauth add maestria-co/ai-playbook writing-tests

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 26, 2026, 6:44 AM57.5s1 file scanned

SKILL.md

name:: writing-tests
description:: >
error cases, and security cases. Use when:: implementation is complete and needs
coverage before a refactor. Do not use when:: the code under test is not yet
degree_of_freedom:: medium

Skill: Writing Tests

Purpose

Think before writing. Read the spec and implementation first. Identify all the ways the code can fail before writing a single test.

Pre-flight Checks

Check 1 — Read the implementation and spec

Read both the spec/acceptance criteria and the implementation code before writing tests.

If implementation does not exist → stop. Tests cannot be written for unimplemented code. Surface: "Implementation needed before tests can be written."
If implementation exists but spec/acceptance criteria are missing → infer from the implementation's behavior and state your assumptions in the test file.

Check 2 — Find the test framework and conventions

Locate 2–3 existing test files in the same area. Identify:

Test framework and runner (Jest, Vitest, pytest, etc.)
Folder structure and naming convention (*.test.ts, *.spec.js, tests/)
How mocks and fixtures are set up
How async code is tested
Write new tests that match these conventions exactly.
If no tests exist in the codebase → use the framework in the project manifest (package.json, etc.).

Execution Steps

Step 1 — List Test Cases Before Writing

Write a comment block listing every test case you plan to write, grouped by category:

// Happy path:     valid input → expected output
// Edge cases:     null, empty, boundary values, special chars
// Error cases:    invalid input, missing required fields, downstream failures
// State cases:    before/after state transitions, concurrent writes
// Security cases: [only if code handles auth, external input, or file I/O]
//   - SQL injection / NoSQL injection
//   - Malformed or oversized inputs
//   - Missing/expired/tampered tokens

Step 2 — Write Happy Path Tests

Verify that the code does what it's supposed to do for valid, expected input. At minimum: one test per distinct success path.

Step 3 — Write Edge Case Tests

Cover boundary conditions:

Empty inputs ("", [], {}, null, undefined)
Boundary values (0, -1, max int, empty string vs. whitespace-only)
Special characters in string inputs
Date/time edge cases (midnight, DST transitions, leap years) if applicable

Step 4 — Write Error Case Tests

Verify that failures are handled explicitly and produce correct error behavior:

Invalid input returns the correct error code and message
Missing required fields are rejected
Downstream service failures are handled (do not crash, return appropriate error)
Every error path is tested — no silent swallows

Step 5 — Write Security Tests (If Applicable)

For code that handles authentication, authorization, or external input:

SQL/NoSQL injection attempts produce errors, not data leaks
Malformed tokens return 401, not 500
Oversized inputs are rejected with 400/413, not crashing the service
Unauthorized access to another user's resources returns 403

Step 6 — Run the Suite

Run tests with the no-watch flag:

Jest/Vitest: --watchAll=false or --run
pytest: pytest (no watch by default)
Other runners: ensure the process exits after running

Coverage targets:

New code: ≥90%
Bug fixes: must include a test that reproduces the bug and passes after the fix
Refactors: coverage must not decrease from before the refactor
If all tests pass and coverage is met → proceed to output.
If tests fail → diagnose and fix tests (not the implementation, unless the implementation is wrong).
If code is untestable (global state, no dependency injection) → report it. Do not mock the thing under test.

Step 7 — Report

Produce a test report.

Output Formats

All tests passing:

Tests: ✅ All passing
Coverage: [X%] | Tests: N (unit: N, integration: N)
Files: [test files written or modified]

Suggested next: [next task in plan, or "done"]

Tests failing or bugs found:

Tests: ⚠️ N failing

Bugs:
- Bug: [description] | File: [path:line] | Input: [reproduction] | Expected: [value] | Actual: [value] | Severity: [low/medium/high]

Suggested next: implementing-features (fix bugs, then return to writing-tests)

Code untestable:

Tests: ❌ Blocked
Reason: [e.g., global state in src/services/auth.js:47, no dependency injection]

Suggested next: implementing-features (refactor for testability, then return to writing-tests)

Examples

Added test: "login succeeds for mixed-case email"

Input: { email: "[email protected]", password: "correct" }
Expected: 200 with user session
Validates: bug does not regress </output>

Suggested next: implementing-features — refactor tokenRefresh.js to accept a redis client as a constructor/function parameter, then return to writing-tests. </output> </example>

</examples>

Constraints

Do not write production code — route testability fixes to implementing-features
Do not skip edge or security cases to hit coverage targets faster
Do not mock the code under test or its internal utilities
Always use no-watch flags — never leave a hanging test runner
Always leave the test suite green before completing — do not hand off a broken suite
Test the contract (behavior), not the internals (implementation)
Scope: reads and writes test files; reads implementation files and spec; does not modify source code

Related Skills

maestria-co/writing-skills

testing

VerifiedTrustedCommunity

Use when creating a new skill, editing an existing skill, or helping a user author a skill for this system. Covers structure, discoverability, quality, and discipline hardening.

SKILL.mdUpdated Apr 26, 2026

maestria-co/writing-skills

maestria-co/verification-checklist

development

VerifiedTrustedCommunity

Evidence-based verification process to run before marking any task complete. Use this skill every time you're about to report that work is done — for features, bug fixes, refactoring, or any code change. This catches the most common failure mode: declaring "done" without proof. If you're finishing up and about to tell the user the task is complete, run this checklist first.

SKILL.mdUpdated Apr 26, 2026

maestria-co/verification-checklist

maestria-co/using-skills

development

VerifiedTrustedCommunity

Teaches agents how to discover, select, and invoke skills from the skill library. Use this skill whenever you're uncertain which skill applies to a task, when composing multiple skills for complex work, or when you need to understand what skills are available. This is your go-to when facing an ambiguous task and need to figure out the right approach before diving into implementation.

SKILL.mdUpdated Apr 26, 2026

maestria-co/using-skills

maestria-co/upgrade-repo

development

VerifiedTrustedCommunity

Safely upgrade a language version, framework, or major dependency. Use when someone says "upgrade Node to v22", "migrate to React 19", "we need to get off this old version", "update our framework", or "how do we upgrade X?". Also triggers when a security advisory requires a version bump. Scope is infrastructure only: manifests, lock files, build configuration, and source changes required by the new version. Content drift discovered during an upgrade (outdated docs, stale patterns) is out of scope — delegate those to `context-maintenance`. Prevents the most common upgrade pitfalls: skipping migration guides, multi-major jumps, and losing rollback options.

SKILL.mdUpdated Apr 26, 2026

maestria-co/upgrade-repo

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/maestria-co/ai-playbook.git

# Copy into Claude Code skills folder (global)
cp -r ai-playbook/skills/writing-tests ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

maestria-co/ai-playbook

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT