Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

thermiteau/skills/do-test

Name: skills/do-test
Author: thermiteau

skills/do-test/SKILL.md

npx skillsauth add thermiteau/maverick skills/do-test

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Depends on: mav-bp-unit-testing, mav-bp-integration-testing, mav-local-verification, mav-scope-boundaries

Write or Update Tests

Wraps test-writing work in a Maverick skill so the workflow report records what was tested and the agent applies the project's testing standards before writing tests. Invoke this skill once per testable change from inside do-issue-solo / do-issue-guided / do-epic Phase 5 — typically as a sibling call to do-code, before the commit for that task.

This skill is a thin wrapper. The actual test files are written by the orchestrating Claude Code session using its built-in tools (Read, Edit, Write, Bash). The skill's job is to pin the right standards for the mode and keep scope tight.

Preflight (mandatory)

Run this first. If it exits non-zero, halt and report the stderr output to the user verbatim. Do not proceed.

uv run maverick preflight do-test

The check verifies the project is initialised and uv is on PATH.

Mode Selection

`` must specify a mode:

unit — module-scoped, fast, deterministic. No network, no real database, no real filesystem outside the test's own tmp_path. Use this for any change that can be exercised through its public surface in-process.
integration — crosses module / service / database / external API boundaries. Real I/O is allowed (and usually required). Slower; reserved for changes whose behaviour only emerges across that boundary.

If no mode is specified, halt and ask the caller to pick one. Do not guess — getting the mode wrong invites flaky tests or fast-but-untrue coverage.

Unit Mode

Apply mav-bp-unit-testing for the project's unit-testing conventions (test framework, naming, fixture style, assertion patterns).

Locate the test file. Look for an existing test file that corresponds to the production code you just changed. If none exists, follow the project's convention for placement (tests/unit/test_<module>.py, <module>.test.ts, etc.) — check how nearby modules are tested.
Identify what changed in observable behaviour. Write tests against the public surface, not the internals. If the change is purely refactoring with no behaviour change, the existing tests should already cover it — re-running them is the validation.
Write one test per branch / case. Cover the happy path, relevant edge cases, and any error paths the change introduced. Don't write a single mega-test that exercises everything.
Verify by running the project's unit suite per mav-local-verification. The whole suite — not just the new tests — must stay green.

Integration Mode

Apply mav-bp-integration-testing for the project's integration-testing conventions (which real services are stood up, fixture lifecycle, isolation strategy, slowness budget).

Locate or create the integration test file. Integration tests typically live separately from unit tests (tests/integration/, e2e/, __tests__/integration/, etc.) and have their own runner config.
Stand up real dependencies. Do not mock the boundary the test is supposed to exercise — if a test "covers" the database layer with a mocked database, it's a unit test, not an integration test.
Test the boundary, not the unit. Assert observable end-to-end behaviour, not implementation details on either side of the boundary.
Clean up. Integration tests must leave no state behind — transactions roll back, fixtures tear down, temporary resources are released. Failure to clean up makes the next test flaky.
Verify by running the project's integration suite. Note the wall-clock cost — if a new test pushes a green CI run past the project's slowness budget, refactor it or split it.

Rules

No mocks at the unit's own boundary. If you're testing foo(), don't mock foo()'s direct collaborators with such fidelity that the test passes regardless of foo()'s behaviour. Mock at one hop out: the network, the database, the clock.
Tests describe behaviour, not implementation. A test that breaks on internal refactors without a behaviour change is a maintenance cost. Write assertions against return values, side effects, and invariants — not against private call sequences.
No assert True placeholders. If you don't know how to test the change, halt and ask rather than ship an empty test.
Stay inside the task envelope. Don't refactor adjacent unrelated tests in this skill. Follow mav-scope-boundaries.
Tests must be deterministic. No reliance on system time, random seeds, network availability outside the boundary under test, or ordering between independent tests.

Return Protocol

do-test is a subroutine of the calling workflow's per-task loop — not a terminal action. Returning from this skill is a hand-back to the caller's next numbered step, not a completion event (#106).

When you return from this skill, do not post a closing summary, do not stop, do not treat green-tests as "task done." The calling workflow's loop still owns, in order:

Closing the skill-dispatch interval that wrapped this invocation (uv run maverick report end skill-dispatch … --outcome success).
Conventional commit referencing the issue number, then push (with stacked-PR retarget if applicable).
Logging the commit row to the workflow report (uv run maverick report commit …).
Updating the tasks comment / closing the sub-issue.
Heartbeat refresh.

If you find yourself drafting a final summary after returning here, that is the signal: scroll back to the calling workflow's per-task loop and resume from the step immediately after the /do-test invocation.

thermiteau/skills/do-test

skills/do-test/SKILL.md

--- name: do-test description: Write or update tests for a code change. Operates in two modes: `unit` (module-scoped, fast, deterministic) and `integration` (crosses module / service / database boundaries). Intended to be invoked once per testable change from inside a do-issue-* or do-epic phase. Mode is required. argument-hint: mode: unit or integration user-invocable: true disable-model-invocation: false --- **Depends on:** mav-bp-unit-testing, mav-bp-integration-testing, mav-local-verificati

9 stars

development

Updated Jun 8, 2026

$ install --global

skillsauth

npx skillsauth add thermiteau/maverick skills/do-test

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 8, 2026, 2:25 AM114.5s1 file scanned

SKILL.md

name:: do-test
description:: Write or update tests for a code change. Operates in two modes: `unit` (module-scoped, fast, deterministic) and `integration` (crosses module / service / database boundaries). Intended to be invoked once per testable change from inside a do-issue-* or do-epic phase. Mode is required.
argument-hint:: mode: unit or integration
user-invocable:: true
disable-model-invocation:: false

Depends on: mav-bp-unit-testing, mav-bp-integration-testing, mav-local-verification, mav-scope-boundaries

Write or Update Tests

Preflight (mandatory)

Run this first. If it exits non-zero, halt and report the stderr output to the user verbatim. Do not proceed.

uv run maverick preflight do-test

The check verifies the project is initialised and uv is on PATH.

Mode Selection

`` must specify a mode:

unit — module-scoped, fast, deterministic. No network, no real database, no real filesystem outside the test's own tmp_path. Use this for any change that can be exercised through its public surface in-process.
integration — crosses module / service / database / external API boundaries. Real I/O is allowed (and usually required). Slower; reserved for changes whose behaviour only emerges across that boundary.

If no mode is specified, halt and ask the caller to pick one. Do not guess — getting the mode wrong invites flaky tests or fast-but-untrue coverage.

Unit Mode

Apply mav-bp-unit-testing for the project's unit-testing conventions (test framework, naming, fixture style, assertion patterns).

Locate the test file. Look for an existing test file that corresponds to the production code you just changed. If none exists, follow the project's convention for placement (tests/unit/test_<module>.py, <module>.test.ts, etc.) — check how nearby modules are tested.
Identify what changed in observable behaviour. Write tests against the public surface, not the internals. If the change is purely refactoring with no behaviour change, the existing tests should already cover it — re-running them is the validation.
Write one test per branch / case. Cover the happy path, relevant edge cases, and any error paths the change introduced. Don't write a single mega-test that exercises everything.
Verify by running the project's unit suite per mav-local-verification. The whole suite — not just the new tests — must stay green.

Integration Mode

Apply mav-bp-integration-testing for the project's integration-testing conventions (which real services are stood up, fixture lifecycle, isolation strategy, slowness budget).

Locate or create the integration test file. Integration tests typically live separately from unit tests (tests/integration/, e2e/, __tests__/integration/, etc.) and have their own runner config.
Stand up real dependencies. Do not mock the boundary the test is supposed to exercise — if a test "covers" the database layer with a mocked database, it's a unit test, not an integration test.
Test the boundary, not the unit. Assert observable end-to-end behaviour, not implementation details on either side of the boundary.
Clean up. Integration tests must leave no state behind — transactions roll back, fixtures tear down, temporary resources are released. Failure to clean up makes the next test flaky.
Verify by running the project's integration suite. Note the wall-clock cost — if a new test pushes a green CI run past the project's slowness budget, refactor it or split it.

Rules

No mocks at the unit's own boundary. If you're testing foo(), don't mock foo()'s direct collaborators with such fidelity that the test passes regardless of foo()'s behaviour. Mock at one hop out: the network, the database, the clock.
Tests describe behaviour, not implementation. A test that breaks on internal refactors without a behaviour change is a maintenance cost. Write assertions against return values, side effects, and invariants — not against private call sequences.
No assert True placeholders. If you don't know how to test the change, halt and ask rather than ship an empty test.
Stay inside the task envelope. Don't refactor adjacent unrelated tests in this skill. Follow mav-scope-boundaries.
Tests must be deterministic. No reliance on system time, random seeds, network availability outside the boundary under test, or ordering between independent tests.

Return Protocol

When you return from this skill, do not post a closing summary, do not stop, do not treat green-tests as "task done." The calling workflow's loop still owns, in order:

Closing the skill-dispatch interval that wrapped this invocation (uv run maverick report end skill-dispatch … --outcome success).
Conventional commit referencing the issue number, then push (with stacked-PR retarget if applicable).
Logging the commit row to the workflow report (uv run maverick report commit …).
Updating the tasks comment / closing the sub-issue.
Heartbeat refresh.

Related Skills

thermiteau/do-code

development

VerifiedTrustedCommunity

Implement a focused code change. Use this skill as the wrapper for any implementation work so the Maverick workflow report captures what was done and so the agent applies the project's coding standards before editing. Intended to be invoked once per task from inside a do-issue-* or do-epic phase, not standalone.

9SKILL.mdUpdated May 21, 2026

thermiteau/mav-stacked-prs

testing

VerifiedTrustedCommunity

How to stack a PR on top of an unmerged sibling branch, and how to retarget it to the repo's default branch once the sibling merges. Prevents orphan-merge incidents when a dependent story is ready before its parent.

9SKILL.mdUpdated Apr 28, 2026

thermiteau/mav-stacked-prs

thermiteau/mav-multi-instance-coordination

development

VerifiedTrustedCommunity

Claim, lease, heartbeat, and release protocols for when multiple Claude Code instances may act on the same issue or epic concurrently. GitHub labels and marker comments are the coordination surface; local state is a cache.

9SKILL.mdUpdated Apr 28, 2026

thermiteau/mav-multi-instance-coordination

thermiteau/mav-durability-on-gh

documentation

VerifiedTrustedCommunity

Durability conventions for multi-instance Maverick workflows. Covers cold-start hydration from GitHub, marker-write protocols, push-per-task cadence, and recreating worktrees from remote branches. GitHub is the source of truth; local files are a cache.

9SKILL.mdUpdated Apr 28, 2026

thermiteau/mav-durability-on-gh

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/thermiteau/maverick.git

# Copy into Claude Code skills folder (global)
cp -r maverick/skills/do-test ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

thermiteau/maverick

9 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT