Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

lexler/approval-tests

Name: approval-tests
Author: lexler

output_skills/testing/approval-tests/SKILL.md

npx skillsauth add lexler/skill-factory approval-tests

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

STARTER_CHARACTER = 📸

Approval Tests

Philosophy

"A picture's worth 1000 assertions."

Approval tests verify complex output by comparing against a saved "golden master" file instead of writing individual assertions. You capture the output once, review it, approve it, and future runs compare against that approved snapshot.

You don't need to know the expected output upfront. Run the code, see what it produces, decide if it's correct. Approval is a judgment - you're confirming this is what the code should produce. Whoever writes the code reviews and approves.

Use approval tests when:

Output is complex - instead of 20 assertions, one approval captures everything
Characterizing legacy code - snapshot behavior, then refactor safely
Combinatorial testing - test all input combinations in one approval
Assertions would be tedious or brittle

Use assertions when:

Simple values or specific edge cases
Non-deterministic output that can't be scrubbed

Core Workflow

1. Write test with verify(result)
2. Run test → FAILS (no .approved file yet)
3. Creates: TestName.approved.txt (empty) + TestName.received.txt (actual output)
4. Review .received file - is this correct?
5. YES → rename/copy .received to .approved
6. Run test again → PASSES
7. Commit .approved file to version control

File naming convention:

{TestClass}.{test_method}.approved.txt   ← commit this
{TestClass}.{test_method}.received.txt   ← gitignore this

Critical rules:

.approved files ARE your test expectations - commit them
.received files are temporary - add *.received.* to .gitignore
Never edit .approved files by hand - always generate via test

When a test fails, a diff tool opens showing approved vs received. This is how you review changes. Reporters configure which diff tool to use.

Core API Pattern

All languages follow the same pattern:

verify(result)                    # Basic string/object verification
verify_as_json(object)            # Objects as formatted JSON
verify_all(header, items)         # Collections with labels
verify_all_combinations(fn, inputs)  # All input combinations

Non-deterministic data (timestamps, GUIDs) must be scrubbed before verification.

Key Techniques

Scrubbers - replace values that change between runs (timestamps, UUIDs, random numbers, ports, paths) with stable placeholders like [Date1] or guid_1. Without scrubbing, tests pass locally but fail in CI.
Inline approvals - expectations in source code instead of separate files. Avoids file proliferation for short output. Python uses docstrings, Java uses text blocks.
Storyboard - show an object at multiple points in time, like frames in a comic. Each step appears in the diff, making it easy to see how state changes. For workflows, state machines, animations. Python/Java have classes; Node.js uses string building.
Combinations - test all permutations of input parameters in one approval. Exhaustive coverage without writing separate tests for each case. For large sets, pairwise testing reduces millions of combinations to ~100.
Multiple approvals per test - calling verify() twice overwrites the same file, so only the last one is tested. Parameter-based naming creates separate files for each scenario.

See language references for implementation details.

Language References

Detect language from project files, then read the appropriate reference for installation, quick start, core patterns, and links to deeper reference files:

python.md - Python (pyproject.toml, setup.py, requirements.txt)
nodejs.md - JavaScript/TypeScript (package.json)
java.md - Java (pom.xml, build.gradle)

Anti-Patterns

Don't write assertions for complex objects - use verify_as_json() instead
Don't commit .received files - they're temporary
Don't forget scrubbers for timestamps, GUIDs, random values
Don't over-verify - one approval per logical behavior. Large approvals hide signal in noise; unrelated changes break tests.
Don't hand-edit .approved files - always generate via test. Hand-edited files may not match actual code output.
Don't use verify_all for structured data - use verify_as_json({"items": items})
Don't mix approvals with assertions - the approval captures everything
Don't call verify() multiple times without NamerFactory - each overwrites the same file

Flaky tests across environments usually means unscrubbed dynamic data (timestamps, UUIDs, ports, paths).

lexler/approval-tests

output_skills/testing/approval-tests/SKILL.md

Writes approval tests (snapshot/golden master testing) for Python, JavaScript/TypeScript, or Java. Use when verifying complex output, characterization testing legacy code, testing combinations, or working with .approved/.received files.

216 stars

development

Updated Jun 17, 2026

$ install --global

skillsauth

npx skillsauth add lexler/skill-factory approval-tests

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 17, 2026, 6:42 AM142.5s25 files scanned

SKILL.md

name:: approval-tests
description:: Writes approval tests (snapshot/golden master testing) for Python, JavaScript/TypeScript, or Java. Use when verifying complex output, characterization testing legacy code, testing combinations, or working with .approved/.received files.

STARTER_CHARACTER = 📸

Approval Tests

Philosophy

"A picture's worth 1000 assertions."

Use approval tests when:

Output is complex - instead of 20 assertions, one approval captures everything
Characterizing legacy code - snapshot behavior, then refactor safely
Combinatorial testing - test all input combinations in one approval
Assertions would be tedious or brittle

Use assertions when:

Simple values or specific edge cases
Non-deterministic output that can't be scrubbed

Core Workflow

1. Write test with verify(result)
2. Run test → FAILS (no .approved file yet)
3. Creates: TestName.approved.txt (empty) + TestName.received.txt (actual output)
4. Review .received file - is this correct?
5. YES → rename/copy .received to .approved
6. Run test again → PASSES
7. Commit .approved file to version control

File naming convention:

{TestClass}.{test_method}.approved.txt   ← commit this
{TestClass}.{test_method}.received.txt   ← gitignore this

Critical rules:

.approved files ARE your test expectations - commit them
.received files are temporary - add *.received.* to .gitignore
Never edit .approved files by hand - always generate via test

When a test fails, a diff tool opens showing approved vs received. This is how you review changes. Reporters configure which diff tool to use.

Core API Pattern

All languages follow the same pattern:

verify(result)                    # Basic string/object verification
verify_as_json(object)            # Objects as formatted JSON
verify_all(header, items)         # Collections with labels
verify_all_combinations(fn, inputs)  # All input combinations

Non-deterministic data (timestamps, GUIDs) must be scrubbed before verification.

Key Techniques

Scrubbers - replace values that change between runs (timestamps, UUIDs, random numbers, ports, paths) with stable placeholders like [Date1] or guid_1. Without scrubbing, tests pass locally but fail in CI.
Inline approvals - expectations in source code instead of separate files. Avoids file proliferation for short output. Python uses docstrings, Java uses text blocks.
Storyboard - show an object at multiple points in time, like frames in a comic. Each step appears in the diff, making it easy to see how state changes. For workflows, state machines, animations. Python/Java have classes; Node.js uses string building.
Combinations - test all permutations of input parameters in one approval. Exhaustive coverage without writing separate tests for each case. For large sets, pairwise testing reduces millions of combinations to ~100.
Multiple approvals per test - calling verify() twice overwrites the same file, so only the last one is tested. Parameter-based naming creates separate files for each scenario.

See language references for implementation details.

Language References

Detect language from project files, then read the appropriate reference for installation, quick start, core patterns, and links to deeper reference files:

python.md - Python (pyproject.toml, setup.py, requirements.txt)
nodejs.md - JavaScript/TypeScript (package.json)
java.md - Java (pom.xml, build.gradle)

Anti-Patterns

Don't write assertions for complex objects - use verify_as_json() instead
Don't commit .received files - they're temporary
Don't forget scrubbers for timestamps, GUIDs, random values
Don't over-verify - one approval per logical behavior. Large approvals hide signal in noise; unrelated changes break tests.
Don't hand-edit .approved files - always generate via test. Hand-edited files may not match actual code output.
Don't use verify_all for structured data - use verify_as_json({"items": items})
Don't mix approvals with assertions - the approval captures everything
Don't call verify() multiple times without NamerFactory - each overwrites the same file

Flaky tests across environments usually means unscrubbed dynamic data (timestamps, UUIDs, ports, paths).

Related Skills

lexler/tdd

development

VerifiedTrustedCommunity

Test-driven development (TDD) process used when writing code. Use whenever you are adding any new code, unless the user explicitly asks to skip TDD or the code is exploratory/spike.

216SKILL.mdUpdated Jun 17, 2026

lexler/nullables

development

VerifiedTrustedCommunity

Writes tests without mocks using Nullables. Use when writing tests, especially testing code with external I/O (HTTP, files, databases, clocks, random numbers), designing infrastructure wrappers or replacing mocking libraries.

216SKILL.mdUpdated Jun 17, 2026

lexler/bdd-with-approvals

testing

VerifiedTrustedCommunity

Scannable BDD tests written in domain language. Use when doing BDD.

216SKILL.mdUpdated Jun 17, 2026

lexler/bdd-with-approvals

lexler/refinement-loop

tools

VerifiedTrustedCommunity

Iterative refinement through multiple passes. Use when the user asks to 'meditate on', 'distill', 'refine', or 'iterate on' something, or proactively when a problem benefits from multiple passes rather than a single attempt.

216SKILL.mdUpdated Jun 17, 2026

lexler/refinement-loop

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/lexler/skill-factory.git

# Copy into Claude Code skills folder (global)
cp -r skill-factory/output_skills/testing/approval-tests ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

lexler/skill-factory

216 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT