Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

mryll/dual-testing

Name: dual-testing
Author: mryll

skills/dual-testing/SKILL.md

npx skillsauth add mryll/skills dual-testing

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Dual Testing

A language-agnostic strategy for code at the boundary between your application and external infrastructure: integration tests prove the full chain works for happy paths; unit/slice tests prove the error-handling and mapping logic. It aligns with the Testing Honeycomb / Test Diamond philosophy — boundary code is dominated by interaction complexity, so lean on integration for wiring and on fast mock-based tests for the branchy error logic.

The strategy is the same in every language. Only the tooling changes. Concrete per-language references are listed at the bottom; if none matches your language, the Adapt When No Reference Exists algorithm tells you how to apply it anyway.

Definitions

Integration test — exercises the real application boundary plus a real external dependency (real database, broker, etc., typically via Testcontainers). Proves the wiring and the happy-path behavior end to end.
Unit/slice test — the smallest test that still includes the mapping/adapter code (input validation, domain-error → outcome mapping), with the semantic dependencies mocked or faked. In some frameworks the mapping lives outside the handler class (an exception filter, @ControllerAdvice, error middleware), so the right test is a framework slice (in-process pipeline with the service mocked), not a pure class unit test.
The rule: never duplicate happy paths in both layers. Integration proves the chain; unit/slice proves the mapping. Some overlap on boundary scenarios (e.g. not-found) is fine because they exercise different concerns.

Decision List

Where a scenario belongs. The "outcome" is described generically — map it to your transport: HTTP status, gRPC status code, CLI exit code + stderr, or queue ack/nack/retry/dead-letter.

Happy path (successful create/read/update/delete) — Integration: proves real wiring. Outcome: success (e.g. HTTP 2xx).
Handler input validation (bad/missing field) — Unit/slice: pure mapping logic, no infra. Outcome: client error (e.g. HTTP 400).
Framework/middleware/filter validation (idempotency key, body binding, content negotiation) — Integration or slice that includes that pipeline: the check runs before your handler.
Auth/authz policy (middleware-owned) — Integration/slice including the middleware.
Resource not found — Both acceptable: integration proves the chain, unit/slice proves the mapping. Outcome: not-found (e.g. HTTP 404).
Infrastructure/DB error — Unit/slice: cannot force reliably with real infra. Outcome: server error (e.g. HTTP 500, or queue nack→retry/DLQ).
Circuit breaker open / resilience pattern — Unit/slice: deterministic only via a mock. Outcome: unavailable (e.g. HTTP 503).
Timeout / cancellation — Unit/slice: drive via a cancelled context/token.
Idempotency / deduplication — Integration: requires real state.
Notifications / events emitted — Integration: side effect of real infra.
Side effects in another store — Integration: verifies the real effect.
Empty-collection serialization ([] not null) — Integration: contract verified with real data.
Pure business/domain logic (defaults, calculations, optimistic-lock version handling) — Ordinary unit tests with real objects, no mocks or infra needed.

Do not bake specific status numbers into the strategy. HTTP 499 in particular is client-closed-request (nginx-specific), not a portable code — keep such codes in a transport-specific note, never in the portable list.

Testability Prerequisites (Principles)

Make the boundary mockable without coupling tests to infrastructure:

Depend on a narrow semantic port. Each handler needs a small abstraction for the data operations it uses (a use-case / port / repository interface), not the DB driver. If the project already has service interfaces, ports, mediators or handlers, mock those — do not add a redundant per-controller interface.
No external I/O at construction. A factory/constructor that wraps a connection must not ping or validate at construction time — unless the framework explicitly owns startup validation (e.g. a DI-container health check). This lets a test inject a broken or mock dependency and exercise the error path at request time.
Inject the port at the wiring point. The setup/registration code takes the abstraction; inject the real implementation in production and integration tests, a mock in unit/slice tests.
Mock the semantic port, not the driver. Mock domain methods (ListItems, DeleteItem); never mock Query/Rows/Tx. Driver-level mocks test SQL strings, not behavior, and are fragile.
Encapsulate a transaction in one method. When a handler orchestrates begin → queries → commit, put the whole transaction in one port method returning a domain result or a domain error. The handler only maps result → outcome.
Error model — two idioms, one decision. Distinguish business errors (→ not-found / client error) from infrastructure errors (→ server error). Languages that return errors use sentinels/typed errors (Go errors.Is(err, ErrNotFound), Rust Result); languages that throw use typed exceptions (NotFoundException) caught at the mapping layer (@ControllerAdvice, exception filter, error middleware). The mapping decision is identical; only the mechanism differs — and because the mapping layer is often a separate class, the test that covers it is usually a slice test.

Worker / Background-Job Variant

The same split applies to consumers and workers:

One integration test drives a real message/job through the real dispatcher and real infra (happy path).
Unit/slice tests mock the dispatch protocol to prove failure handling (retry, dead-letter, status transitions).
Create a fresh dispatcher/engine per test when registration is one-shot — some dispatchers reject or panic on duplicate handler registration.
Mock the full dispatch protocol the worker depends on (claim/next, resolve target, set status, succeed/fail).

Recommendations

Trade-offs, not hard rules:

Fixed error strings on server errors for contract-stable endpoints — do not leak internal error text. This keeps integration tests from coupling to internal messages.
Empty collections serialize as [], not null — verify in integration. (The language-specific mechanics, if any, live in the per-language reference.)
Update the dependency manifest/lockfile after adding a mocking library (and any tidy/restore step), or CI may silently skip the new tests.

When This Strategy Does NOT Apply

Pure functions (no external dependencies) — output-based tests directly.
Domain logic without infrastructure — unit tests with real objects, no mocks.
Utility code — simple tests, no strategy needed.

The dual strategy is specifically for code that sits at the boundary between your application and external infrastructure (databases, APIs, message queues).

Language References

If a reference matches your language or framework, read only that reference before writing tests. The core above tells you what belongs in each layer; the reference gives the idiomatic tooling, test shape, and language-specific gotchas — the core alone is not enough to get the idioms right.

Go (Gin, pgx, testify, testcontainers-go) — references/go.md
.NET / C# (ASP.NET Core; xUnit, Testcontainers for .NET, WebApplicationFactory, Moq) — references/dotnet.md
Java (Spring Boot; JUnit 5, Testcontainers @ServiceConnection, MockMvc, Mockito) — references/java.md

Adapt When No Reference Exists

No reference for your language/framework? Do not skip the strategy — adapt it. Follow this algorithm:

Identify the boundary adapter (handler/controller/consumer) and its transport contract (HTTP/gRPC/CLI/queue).
Identify the real external dependencies (DB, cache, broker, external API).
Write one happy-path test through the real wiring and real infrastructure. Testcontainers has libraries for most languages (Java, .NET, Go, Node, Python, Rust, and more); otherwise use an equivalent ephemeral real dependency and state the confidence gap.
Put each error-mapping branch in the smallest unit/slice test that still includes the mapping code.
Mock the semantic ports/use-cases — never the DB driver or framework internals.
Keep the Decision List above. Only the tooling changes, not the strategy.

Composing with Other Skills (Optional)

This skill is self-contained. If you also use them, these compose cleanly:

/test-namer — how to name the test (this skill decides where it goes; test-namer decides what to call it).
/vertical-slice-architecture — how to structure the feature directory.
/low-complexity — keeps test functions readable.

None are required to apply dual testing.

Checklist

For each new handler, feature, or worker:

[ ] Each handler depends on a narrow semantic port (or reuses an existing one), not the DB driver
[ ] The wiring point injects the port — real in prod/integration, mock in unit/slice
[ ] Constructors/factories do no external I/O at construction (unless the framework owns startup validation)
[ ] One integration test per happy path, through real wiring and real infrastructure
[ ] One unit/slice test per error-mapping branch (validation, not-found, infra error, resilience), with the port mocked
[ ] Happy paths are NOT duplicated in the unit/slice layer
[ ] The matching language reference was read — or, if none exists, the 6-step adaptation was applied

mryll/dual-testing

skills/dual-testing/SKILL.md

Language-agnostic strategy for testing code at the boundary with external infrastructure (databases, APIs, queues): integration tests with real infrastructure (e.g. Testcontainers) prove the full chain works for happy paths; unit/slice tests with mocks prove error-handling and mapping logic (domain error to status, input validation, infra failure). Works in any language/framework — Go, .NET/C#, Java, Python, TypeScript and more — with concrete references for Go, .NET (ASP.NET Core) and Java (Spring Boot) and an explicit path to adapt when no reference matches your language. Apply when designing a test strategy, creating a handler/feature/worker that needs tests, or deciding what type of test a scenario needs. Triggers: 'dual testing', 'integration vs unit', 'testcontainers vs mocks', 'what type of test', 'where should this test go', 'error path coverage'. Does NOT trigger on writing individual test assertions or test naming conventions (use test-namer for those).

3 stars

development

Updated Jun 5, 2026

$ install --global

skillsauth

npx skillsauth add mryll/skills dual-testing

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 5, 2026, 6:34 AM57.8s10 files scanned

SKILL.md

name:: dual-testing
version:: 1.1.0
description:: Language-agnostic strategy for testing code at the boundary with external infrastructure (databases, APIs, queues): integration tests with real infrastructure (e.g. Testcontainers) prove the full chain works for happy paths; unit/slice tests with mocks prove error-handling and mapping logic (domain error to status, input validation, infra failure). Works in any language/framework — Go, .NET/C#, Java, Python, TypeScript and more — with concrete references for Go, .NET (ASP.NET Core) and Java (Spring Boot) and an explicit path to adapt when no reference matches your language. Apply when designing a test strategy, creating a handler/feature/worker that needs tests, or deciding what type of test a scenario needs. Triggers: 'dual testing', 'integration vs unit', 'testcontainers vs mocks', 'what type of test', 'where should this test go', 'error path coverage'. Does NOT trigger on writing individual test assertions or test naming conventions (use test-namer for those).

Dual Testing

Definitions

Integration test — exercises the real application boundary plus a real external dependency (real database, broker, etc., typically via Testcontainers). Proves the wiring and the happy-path behavior end to end.
Unit/slice test — the smallest test that still includes the mapping/adapter code (input validation, domain-error → outcome mapping), with the semantic dependencies mocked or faked. In some frameworks the mapping lives outside the handler class (an exception filter, @ControllerAdvice, error middleware), so the right test is a framework slice (in-process pipeline with the service mocked), not a pure class unit test.
The rule: never duplicate happy paths in both layers. Integration proves the chain; unit/slice proves the mapping. Some overlap on boundary scenarios (e.g. not-found) is fine because they exercise different concerns.

Decision List

Where a scenario belongs. The "outcome" is described generically — map it to your transport: HTTP status, gRPC status code, CLI exit code + stderr, or queue ack/nack/retry/dead-letter.

Happy path (successful create/read/update/delete) — Integration: proves real wiring. Outcome: success (e.g. HTTP 2xx).
Handler input validation (bad/missing field) — Unit/slice: pure mapping logic, no infra. Outcome: client error (e.g. HTTP 400).
Framework/middleware/filter validation (idempotency key, body binding, content negotiation) — Integration or slice that includes that pipeline: the check runs before your handler.
Auth/authz policy (middleware-owned) — Integration/slice including the middleware.
Resource not found — Both acceptable: integration proves the chain, unit/slice proves the mapping. Outcome: not-found (e.g. HTTP 404).
Infrastructure/DB error — Unit/slice: cannot force reliably with real infra. Outcome: server error (e.g. HTTP 500, or queue nack→retry/DLQ).
Circuit breaker open / resilience pattern — Unit/slice: deterministic only via a mock. Outcome: unavailable (e.g. HTTP 503).
Timeout / cancellation — Unit/slice: drive via a cancelled context/token.
Idempotency / deduplication — Integration: requires real state.
Notifications / events emitted — Integration: side effect of real infra.
Side effects in another store — Integration: verifies the real effect.
Empty-collection serialization ([] not null) — Integration: contract verified with real data.
Pure business/domain logic (defaults, calculations, optimistic-lock version handling) — Ordinary unit tests with real objects, no mocks or infra needed.

Testability Prerequisites (Principles)

Make the boundary mockable without coupling tests to infrastructure:

Depend on a narrow semantic port. Each handler needs a small abstraction for the data operations it uses (a use-case / port / repository interface), not the DB driver. If the project already has service interfaces, ports, mediators or handlers, mock those — do not add a redundant per-controller interface.
No external I/O at construction. A factory/constructor that wraps a connection must not ping or validate at construction time — unless the framework explicitly owns startup validation (e.g. a DI-container health check). This lets a test inject a broken or mock dependency and exercise the error path at request time.
Inject the port at the wiring point. The setup/registration code takes the abstraction; inject the real implementation in production and integration tests, a mock in unit/slice tests.
Mock the semantic port, not the driver. Mock domain methods (ListItems, DeleteItem); never mock Query/Rows/Tx. Driver-level mocks test SQL strings, not behavior, and are fragile.
Encapsulate a transaction in one method. When a handler orchestrates begin → queries → commit, put the whole transaction in one port method returning a domain result or a domain error. The handler only maps result → outcome.
Error model — two idioms, one decision. Distinguish business errors (→ not-found / client error) from infrastructure errors (→ server error). Languages that return errors use sentinels/typed errors (Go errors.Is(err, ErrNotFound), Rust Result); languages that throw use typed exceptions (NotFoundException) caught at the mapping layer (@ControllerAdvice, exception filter, error middleware). The mapping decision is identical; only the mechanism differs — and because the mapping layer is often a separate class, the test that covers it is usually a slice test.

Worker / Background-Job Variant

The same split applies to consumers and workers:

One integration test drives a real message/job through the real dispatcher and real infra (happy path).
Unit/slice tests mock the dispatch protocol to prove failure handling (retry, dead-letter, status transitions).
Create a fresh dispatcher/engine per test when registration is one-shot — some dispatchers reject or panic on duplicate handler registration.
Mock the full dispatch protocol the worker depends on (claim/next, resolve target, set status, succeed/fail).

Recommendations

Trade-offs, not hard rules:

Fixed error strings on server errors for contract-stable endpoints — do not leak internal error text. This keeps integration tests from coupling to internal messages.
Empty collections serialize as [], not null — verify in integration. (The language-specific mechanics, if any, live in the per-language reference.)
Update the dependency manifest/lockfile after adding a mocking library (and any tidy/restore step), or CI may silently skip the new tests.

When This Strategy Does NOT Apply

Pure functions (no external dependencies) — output-based tests directly.
Domain logic without infrastructure — unit tests with real objects, no mocks.
Utility code — simple tests, no strategy needed.

The dual strategy is specifically for code that sits at the boundary between your application and external infrastructure (databases, APIs, message queues).

Language References

Go (Gin, pgx, testify, testcontainers-go) — references/go.md
.NET / C# (ASP.NET Core; xUnit, Testcontainers for .NET, WebApplicationFactory, Moq) — references/dotnet.md
Java (Spring Boot; JUnit 5, Testcontainers @ServiceConnection, MockMvc, Mockito) — references/java.md

Adapt When No Reference Exists

No reference for your language/framework? Do not skip the strategy — adapt it. Follow this algorithm:

Identify the boundary adapter (handler/controller/consumer) and its transport contract (HTTP/gRPC/CLI/queue).
Identify the real external dependencies (DB, cache, broker, external API).
Write one happy-path test through the real wiring and real infrastructure. Testcontainers has libraries for most languages (Java, .NET, Go, Node, Python, Rust, and more); otherwise use an equivalent ephemeral real dependency and state the confidence gap.
Put each error-mapping branch in the smallest unit/slice test that still includes the mapping code.
Mock the semantic ports/use-cases — never the DB driver or framework internals.
Keep the Decision List above. Only the tooling changes, not the strategy.

Composing with Other Skills (Optional)

This skill is self-contained. If you also use them, these compose cleanly:

/test-namer — how to name the test (this skill decides where it goes; test-namer decides what to call it).
/vertical-slice-architecture — how to structure the feature directory.
/low-complexity — keeps test functions readable.

None are required to apply dual testing.

Checklist

For each new handler, feature, or worker:

[ ] Each handler depends on a narrow semantic port (or reuses an existing one), not the DB driver
[ ] The wiring point injects the port — real in prod/integration, mock in unit/slice
[ ] Constructors/factories do no external I/O at construction (unless the framework owns startup validation)
[ ] One integration test per happy path, through real wiring and real infrastructure
[ ] One unit/slice test per error-mapping branch (validation, not-found, infra error, resilience), with the port mocked
[ ] Happy paths are NOT duplicated in the unit/slice layer
[ ] The matching language reference was read — or, if none exists, the 6-step adaptation was applied

Related Skills

mryll/como-si-fuera-de-boca

tools

VerifiedTrustedCommunity

Explain anything — code, an error, a concept, or a non-technical topic — in the simplest, most plain-language way possible, ELI5-style, with a natural Río de la Plata (Argentine) voice that puts clarity first. Use ONLY when the user explicitly asks to have something dumbed down or simplified. Triggers (Spanish + English): 'explicámelo como si fuera de Boca' (or de River / de cualquier cuadro), 'explicámelo simple', 'explicalo fácil', 'más fácil', 'bajámelo un cambio', 'en criollo', 'como si tuviera 5 años', 'para tontos', 'ELI5', 'explain like I'm 5', 'dumb it down', 'in plain terms'. Optimized for technical material (code, architecture, tooling, errors) but the same method works for any topic. Do NOT use when the user wants full technical depth, a code review, or did not ask to simplify — this skill is for deliberate, on-request simplification, not for talking down to the user by default.

3SKILL.mdUpdated Jun 5, 2026

mryll/como-si-fuera-de-boca

mryll/codex-discuss

tools

VerifiedTrustedCommunity

Iterative non-code discussion between the local agent and Codex CLI on any open-ended topic: diet, fitness, writing, decisions, strategy, study plans, life choices, brainstorming. Orchestrates an automatic back-and-forth debate where both agents critique, propose alternatives, and iterate on the user's idea until reaching consensus. Codex CLI runs READ-ONLY, forms its own opinions, and normally does not navigate the filesystem unless the user provides file paths. Use when the user says discuss with codex, iterate with codex, consult codex, debate with codex, ask codex for a second opinion, get codex's take, or brainstorm with codex, including pasting or describing a plan, draft, idea, decision, or proposal and wanting a critical iterative review. Does NOT trigger on code review, plan-mode review of implementation plans, architecture discussions, or any technical software-engineering analysis; use codex-review for those.

3SKILL.mdUpdated May 15, 2026

mryll/codex-review

tools

VerifiedTrustedCommunity

Iterative code review and planning discussion between the local agent and Codex CLI. Orchestrates an automatic back-and-forth debate where both agents discuss findings, architecture decisions, or implementation plans until reaching consensus. Codex CLI runs READ-ONLY and never modifies files; model and reasoning effort come from the user's local Codex config. Supports plan mode: when the local agent has a plan ready, Codex evaluates and iterates on it before implementation, producing an updated consensus plan. Use when the user asks to review with codex, analyze with codex, discuss code with codex, iterate with codex, consult codex, ask codex, review the plan with codex, validate plan with codex, or any Codex CLI request for code review, architecture review, plan review, or implementation strategy. Does NOT trigger on non-code topics like diet, fitness, writing, life decisions, or general strategy; use codex-discuss for those.

3SKILL.mdUpdated Apr 21, 2026

mryll/explain-pr

development

VerifiedTrustedCommunity

Explain a GitHub Pull Request (PR) or GitLab Merge Request (MR) to the user in plain, easy-to-understand language: WHAT was done, WHY/what for, and HOW — with the relevant code snippets embedded. Invoke this proactively and automatically right after creating or finishing a PR/MR (e.g. after running `gh pr create`, `glab mr create`, or pushing a branch and opening a PR/MR), even if the user did not explicitly ask for an explanation. Also use whenever the user asks to explain, summarize, walk through, recap, or 'tell me what you did' about a PR/MR or the changes in a branch. Works with any coding agent and relies on the local git diff, so it does NOT require gh/glab to function. Do NOT use for unrelated code reviews, bug hunting, or writing the PR/MR description itself — this skill only explains finished work back to the user.

2SKILL.mdUpdated May 30, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/mryll/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/skills/dual-testing ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

mryll/skills

3 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT