Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

policyengine/policyengine-test-writing

Name: policyengine-test-writing
Author: policyengine

skills/technical-patterns/policyengine-test-writing-skill/SKILL.md

npx skillsauth add policyengine/policyengine-claude policyengine-test-writing

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

PolicyEngine Test Writing

Standard conventions for writing tests in PolicyEngine frontend apps, APIs, SDKs, and standalone tools. These rules apply to every language and framework (Vitest, pytest, etc.) unless a project-specific override exists.

Country model packages — use different conventions

Do NOT apply this skill to country model packages (policyengine-us, policyengine-uk, policyengine-canada, etc.). Those repos use YAML-based tests with entirely different structure, naming, and tooling. For country packages, use these instead:

policyengine-testing-patterns-skill (skills/technical-patterns/policyengine-testing-patterns-skill/SKILL.md) — YAML test structure, naming conventions (variable_name.yaml, integration.yaml), period handling, error margins, and quality standards
test-creator agent (agents/country-models/test-creator.md) — Automated agent that creates comprehensive YAML integration tests for government benefit program implementations

Country model tests are .yaml files that live alongside the variables they test, not .test.ts or .test.py files in a separate tests/ directory.

Core Principles

1. Given-When-Then Naming

Every test name follows the pattern test__given_X_condition__then_Y_occurs:

// TypeScript / Vitest
test("test__given_valid_income__then_tax_is_calculated", () => { ... });
test("test__given_negative_income__then_error_is_thrown", () => { ... });
test("test__given_zero_children__then_ctc_is_zero", () => { ... });

# Python / pytest
def test__given_valid_income__then_tax_is_calculated():
    ...
def test__given_negative_income__then_error_is_thrown():
    ...

Inside the test body, organize code into three clearly commented sections:

test("test__given_user_clicks_submit__then_form_is_submitted", async () => {
  // Given
  const user = userEvent.setup();
  const onSubmit = vi.fn();
  render(<Form onSubmit={onSubmit} />);

  // When
  await user.click(screen.getByRole("button", { name: /submit/i }));

  // Then
  expect(onSubmit).toHaveBeenCalledOnce();
});

2. One Test File Per Source File

Each source file gets exactly one corresponding test file named test_FILENAME:

| Source file | Test file | |---|---| | utils/formatCurrency.ts | tests/unit/utils/test_formatCurrency.test.ts | | components/MetricCard.tsx | tests/unit/components/test_MetricCard.test.tsx | | lib/api/client.ts | tests/unit/lib/api/test_client.test.ts | | services/simulation.py | tests/unit/services/test_simulation.py |

The test file mirrors the source directory structure under a tests/ root.

3. Fixtures Live Separately

All mocks, setup code, patches, constants, and test data must be extracted to a fixture file with the same name in a fixtures/ directory:

tests/
├── fixtures/
│   ├── utils/
│   │   └── test_formatCurrency.ts    ← mocks, constants, helpers
│   ├── components/
│   │   └── test_MetricCard.ts
│   └── lib/
│       └── api/
│           └── test_client.ts
├── unit/
│   ├── utils/
│   │   └── test_formatCurrency.test.ts   ← imports from fixtures
│   ├── components/
│   │   └── test_MetricCard.test.tsx
│   └── lib/
│       └── api/
│           └── test_client.test.ts

What goes in fixtures:

Mock data objects and factory functions
Descriptive constants (no magic numbers in tests)
vi.fn() / MagicMock setup helpers
Patch targets and mock response builders
Shared beforeEach / afterEach setup functions

What stays in the test file:

describe / test blocks
The Given-When-Then logic
expect / assert statements

Import everything from the fixture:

import {
  VALID_HOUSEHOLD,
  EMPTY_HOUSEHOLD,
  mockApiSuccess,
  mockApiError,
  EXPECTED_TAX_AMOUNT,
} from "@/tests/fixtures/lib/api/test_client";

4. Test Edge Cases and Failure Paths

Every test file must cover, at minimum:

Happy path: Normal inputs produce expected outputs
Boundary values: Zero, empty string, empty array, min/max values
Error cases: Invalid inputs, network failures, missing data
Null/undefined: What happens with missing or nullable fields
Type coercion traps: String "0" vs number 0, empty object vs null

Structure the describe block to make coverage obvious:

describe("calculateTax", () => {
  // Happy path
  test("test__given_valid_income__then_correct_tax_returned", () => { ... });
  test("test__given_income_at_bracket_boundary__then_correct_bracket_applied", () => { ... });

  // Edge cases
  test("test__given_zero_income__then_zero_tax", () => { ... });
  test("test__given_negative_income__then_throws_error", () => { ... });

  // Error handling
  test("test__given_api_timeout__then_error_propagated", () => { ... });
  test("test__given_malformed_response__then_fallback_used", () => { ... });
});

5. Run Only What Changed

After writing or modifying test files, run only those specific tests — never the entire suite:

# TypeScript / Vitest — run specific test file(s)
bunx vitest run tests/unit/utils/test_formatCurrency.test.ts

# Python / pytest — run specific test file(s)
pytest tests/unit/variables/test_income.py -v

After tests pass, run formatters and typecheckers only on modified files:

# TypeScript — typecheck and lint only changed files
bunx tsc --noEmit
bunx eslint tests/unit/utils/test_formatCurrency.test.ts tests/fixtures/utils/test_formatCurrency.ts

# Python — format and lint only changed files
black tests/unit/variables/test_income.py tests/fixtures/variables/test_income.py
ruff check tests/unit/variables/test_income.py tests/fixtures/variables/test_income.py

Never run the full test suite or full linter unless explicitly asked. Large codebases take minutes to lint/test; running everything wastes time and produces noise unrelated to the changes.

Framework-Specific Notes

Vitest (TypeScript / React)

import { describe, test, expect, vi, beforeEach } from "vitest";

Use vi.fn() for mocks, vi.mock() for module mocks
Use vi.clearAllMocks() in beforeEach
For React components, prefer accessibility selectors (getByRole, getByLabelText) over test IDs
Use userEvent.setup() for user interactions (not fireEvent)
Use waitFor for async state updates

pytest (Python)

import pytest
from unittest.mock import MagicMock, patch

Use @pytest.fixture for setup, import from fixture files
Use @pytest.mark.parametrize for data-driven tests
Use pytest.raises(ExceptionType) for error assertions
Mark slow tests with @pytest.mark.slow

What to Test

Public API surface (exported functions, component props, class methods)
State transitions and side effects
Data transformations and calculations
Error handling and recovery paths
Boundary conditions and edge cases

What NOT to Test

Third-party library internals (Recharts rendering, Mantine components, pandas operations)
Private implementation details that may change
CSS/styling (unless testing conditional class application)
Simple pass-through getters with no logic

Detailed Reference

For fixture best practices, mock patterns, and accessibility selector priority, consult:

references/fixture-patterns.md — Comprehensive fixture organization and mock examples

policyengine/policyengine-test-writing

skills/technical-patterns/policyengine-test-writing-skill/SKILL.md

This skill should be used when writing unit tests, integration tests, or test fixtures for PolicyEngine frontend apps, APIs, SDKs, and standalone tools. NOT for country model packages (policyengine-us, policyengine-uk, etc.) — those use YAML-based tests with their own conventions. Covers the Given-When-Then naming convention, fixture extraction, edge case coverage, and the rule that only modified test files should be run. Triggers: "write tests", "add tests", "unit test", "test file", "test coverage", "write a test for", "test this function", "test this component", "given when then", "test fixtures", "mock setup", "edge cases", "test naming", "test convention"

28 stars

tools

Updated Apr 28, 2026

$ install --global

skillsauth

npx skillsauth add policyengine/policyengine-claude policyengine-test-writing

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 28, 2026, 11:12 AM210.9s2 files scanned

SKILL.md

name:: policyengine-test-writing
description:: |
Triggers:: write tests", "add tests", "unit test", "test file", "test coverage", "write a test for",

PolicyEngine Test Writing

Country model packages — use different conventions

policyengine-testing-patterns-skill (skills/technical-patterns/policyengine-testing-patterns-skill/SKILL.md) — YAML test structure, naming conventions (variable_name.yaml, integration.yaml), period handling, error margins, and quality standards
test-creator agent (agents/country-models/test-creator.md) — Automated agent that creates comprehensive YAML integration tests for government benefit program implementations

Country model tests are .yaml files that live alongside the variables they test, not .test.ts or .test.py files in a separate tests/ directory.

Core Principles

1. Given-When-Then Naming

Every test name follows the pattern test__given_X_condition__then_Y_occurs:

// TypeScript / Vitest
test("test__given_valid_income__then_tax_is_calculated", () => { ... });
test("test__given_negative_income__then_error_is_thrown", () => { ... });
test("test__given_zero_children__then_ctc_is_zero", () => { ... });

# Python / pytest
def test__given_valid_income__then_tax_is_calculated():
    ...
def test__given_negative_income__then_error_is_thrown():
    ...

Inside the test body, organize code into three clearly commented sections:

test("test__given_user_clicks_submit__then_form_is_submitted", async () => {
  // Given
  const user = userEvent.setup();
  const onSubmit = vi.fn();
  render(<Form onSubmit={onSubmit} />);

  // When
  await user.click(screen.getByRole("button", { name: /submit/i }));

  // Then
  expect(onSubmit).toHaveBeenCalledOnce();
});

2. One Test File Per Source File

Each source file gets exactly one corresponding test file named test_FILENAME:

The test file mirrors the source directory structure under a tests/ root.

3. Fixtures Live Separately

All mocks, setup code, patches, constants, and test data must be extracted to a fixture file with the same name in a fixtures/ directory:

tests/
├── fixtures/
│   ├── utils/
│   │   └── test_formatCurrency.ts    ← mocks, constants, helpers
│   ├── components/
│   │   └── test_MetricCard.ts
│   └── lib/
│       └── api/
│           └── test_client.ts
├── unit/
│   ├── utils/
│   │   └── test_formatCurrency.test.ts   ← imports from fixtures
│   ├── components/
│   │   └── test_MetricCard.test.tsx
│   └── lib/
│       └── api/
│           └── test_client.test.ts

What goes in fixtures:

Mock data objects and factory functions
Descriptive constants (no magic numbers in tests)
vi.fn() / MagicMock setup helpers
Patch targets and mock response builders
Shared beforeEach / afterEach setup functions

What stays in the test file:

describe / test blocks
The Given-When-Then logic
expect / assert statements

Import everything from the fixture:

import {
  VALID_HOUSEHOLD,
  EMPTY_HOUSEHOLD,
  mockApiSuccess,
  mockApiError,
  EXPECTED_TAX_AMOUNT,
} from "@/tests/fixtures/lib/api/test_client";

4. Test Edge Cases and Failure Paths

Every test file must cover, at minimum:

Happy path: Normal inputs produce expected outputs
Boundary values: Zero, empty string, empty array, min/max values
Error cases: Invalid inputs, network failures, missing data
Null/undefined: What happens with missing or nullable fields
Type coercion traps: String "0" vs number 0, empty object vs null

Structure the describe block to make coverage obvious:

describe("calculateTax", () => {
  // Happy path
  test("test__given_valid_income__then_correct_tax_returned", () => { ... });
  test("test__given_income_at_bracket_boundary__then_correct_bracket_applied", () => { ... });

  // Edge cases
  test("test__given_zero_income__then_zero_tax", () => { ... });
  test("test__given_negative_income__then_throws_error", () => { ... });

  // Error handling
  test("test__given_api_timeout__then_error_propagated", () => { ... });
  test("test__given_malformed_response__then_fallback_used", () => { ... });
});

5. Run Only What Changed

After writing or modifying test files, run only those specific tests — never the entire suite:

# TypeScript / Vitest — run specific test file(s)
bunx vitest run tests/unit/utils/test_formatCurrency.test.ts

# Python / pytest — run specific test file(s)
pytest tests/unit/variables/test_income.py -v

After tests pass, run formatters and typecheckers only on modified files:

# TypeScript — typecheck and lint only changed files
bunx tsc --noEmit
bunx eslint tests/unit/utils/test_formatCurrency.test.ts tests/fixtures/utils/test_formatCurrency.ts

# Python — format and lint only changed files
black tests/unit/variables/test_income.py tests/fixtures/variables/test_income.py
ruff check tests/unit/variables/test_income.py tests/fixtures/variables/test_income.py

Never run the full test suite or full linter unless explicitly asked. Large codebases take minutes to lint/test; running everything wastes time and produces noise unrelated to the changes.

Framework-Specific Notes

Vitest (TypeScript / React)

import { describe, test, expect, vi, beforeEach } from "vitest";

Use vi.fn() for mocks, vi.mock() for module mocks
Use vi.clearAllMocks() in beforeEach
For React components, prefer accessibility selectors (getByRole, getByLabelText) over test IDs
Use userEvent.setup() for user interactions (not fireEvent)
Use waitFor for async state updates

pytest (Python)

import pytest
from unittest.mock import MagicMock, patch

Use @pytest.fixture for setup, import from fixture files
Use @pytest.mark.parametrize for data-driven tests
Use pytest.raises(ExceptionType) for error assertions
Mark slow tests with @pytest.mark.slow

What to Test

Public API surface (exported functions, component props, class methods)
State transitions and side effects
Data transformations and calculations
Error handling and recovery paths
Boundary conditions and edge cases

What NOT to Test

Third-party library internals (Recharts rendering, Mantine components, pandas operations)
Private implementation details that may change
CSS/styling (unless testing conditional class application)
Simple pass-through getters with no logic

Detailed Reference

For fixture best practices, mock patterns, and accessibility selector priority, consult:

references/fixture-patterns.md — Comprehensive fixture organization and mock examples

Related Skills

policyengine/review-program

development

VerifiedTrustedCommunity

ALWAYS LOAD THIS SKILL for PolicyEngine PR reviews, including when the user invokes $review-program or Codex /review on a PolicyEngine PR. Performs read-only code validation, source-reference checks, regulatory review, optional PDF audit, summary reporting, and optional GitHub comment posting.

28SKILL.mdUpdated May 20, 2026

policyengine/review-program

policyengine/fix-pr

development

VerifiedTrustedCommunity

Use when the user invokes $fix-pr or asks Codex to apply fixes to a PolicyEngine PR based on $review-program findings, GitHub review comments, CI failures, or local review reports.

28SKILL.mdUpdated May 20, 2026

policyengine/encode-policy-v2

development

VerifiedTrustedCommunity

Use when the user invokes $encode-policy-v2 or asks Codex to implement a new PolicyEngine-US state benefit program from official rules. Covers research, source collection, requirement extraction, scoped implementation, tests, validation, and draft PR preparation.

28SKILL.mdUpdated May 20, 2026

policyengine/encode-policy-v2

policyengine/policyengine-vercel-deployment

development

VerifiedTrustedCommunity

Deploying PolicyEngine frontend apps to Vercel - naming, scope, team settings

28SKILL.mdUpdated Apr 28, 2026

policyengine/policyengine-vercel-deployment

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/policyengine/policyengine-claude.git

# Copy into Claude Code skills folder (global)
cp -r policyengine-claude/skills/technical-patterns/policyengine-test-writing-skill ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

policyengine/policyengine-claude

28 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT