Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

antoniocascais/test-quality

Name: test-quality
Author: antoniocascais

skills/test-quality/SKILL.md

npx skillsauth add antoniocascais/claude-code-toolkit test-quality

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Test Quality Guide

Apply these principles when writing or reviewing tests. High line coverage does NOT mean strong tests — tests must verify correctness, not just exercise code paths.

Principles Checklist

When generating tests, systematically apply each technique:

1. Boundary Value Analysis (BVA)

Test at the edges of valid ranges, not the middle.

Lower bound, upper bound, just past each bound
For i < 10: test 0, 9, 10 (and optionally -1)
For strings: empty string, single char, max length, max+1

2. Equivalence Partitioning

Group inputs into classes that should behave identically. Test one representative per class.

Valid vs invalid partitions
Reduces redundant tests while maintaining coverage

3. Decision Table Testing

For combinatorial logic (multiple conditions → different outcomes):

Enumerate all condition combinations
Especially important for business rules with compound conditions
AI frequently misses edge combos — be exhaustive

4. State Transition Testing

For stateful code (workflows, FSMs, connection pools):

Test all valid state transitions
Test INVALID transitions — verify they're rejected
Test sequences: what happens after multiple transitions?

5. Error Path Testing — AI's Biggest Blind Spot

Explicitly test every failure mode:

Null/undefined/empty inputs
Malformed data (wrong types, invalid formats)
Timeouts and network failures
Permission denied / authorization failures
Resource exhaustion (full disk, OOM)
Concurrent access / race conditions
Empty collections, single-element collections

6. Property-Based Testing

Define invariants that must ALWAYS hold, let the framework generate inputs:

sort(x) output is always ordered and same length
encode(decode(x)) == x (roundtrip)
f(x) >= 0 for all valid x (domain constraints)

Tools by language:

Python: hypothesis
JS/TS: fast-check
Java: jqwik
Rust: proptest

7. Assertion Quality

Every test MUST verify something meaningful:

BAD: call function, assert no exception → proves nothing
BAD: assert result is not null → barely proves anything
GOOD: assert specific return value matches expected
GOOD: assert side effects occurred (DB write, API call, event emitted)
GOOD: assert error type AND message for failure cases

8. AAA Pattern

Structure every test as: Arrange → Act → Assert

One logical assertion per test (multiple assert calls are fine if testing one behavior)
Test name describes the behavior being verified

9. Test Behavior, Not Implementation

Test the public contract / API surface
If mocking 3+ internal methods, the test is too coupled
Refactors should not break tests unless behavior changes

Mutation Testing

After writing tests, recommend running mutation testing to validate test strength:

JS/TS: Stryker (npx stryker run)
Python: mutmut (mutmut run)
Java: PIT (mvn org.pitest:pitest-maven:mutationCoverage)
Rust: cargo-mutants (cargo mutants)

A mutation score below 60% with high line coverage = weak tests.

When Writing New Tests

Identify the function/module under test
List input partitions (valid classes, invalid classes)
For each partition, identify boundaries
Write happy path tests with specific assertions
Write error path tests for every failure mode
Consider: are there invariants suitable for property-based tests?
Check: would a mutation (flipping < to <=, removing a line) be caught?

When Reviewing Existing Tests

Flag these weaknesses:

Tests that call code without meaningful assertions
Missing boundary values
No error/failure path tests
Over-mocking (testing mocks, not behavior)
Redundant tests in the same equivalence class

Test Strength Summary

After generating or reviewing tests, output a brief summary:

Test Strength:
- Boundaries: [covered/partial/missing] — list any gaps
- Error paths: [covered/partial/missing] — list untested failures
- Assertion quality: [strong/moderate/weak]
- Property-based candidates: [yes/no] — suggest if applicable
- Mutation resilience: [likely high/moderate/likely low]

antoniocascais/test-quality

skills/test-quality/SKILL.md

Guides strong, effective unit test generation using proven testing techniques. Use when writing unit tests, reviewing test quality, improving existing tests, generating test cases, checking test coverage strength, or when tests exist but may be weak. Triggers on: unit test, test quality, test coverage, write tests, improve tests, review tests, test strength, mutation testing, boundary testing.

5 stars

testing

Updated Apr 4, 2026

$ install --global

skillsauth

npx skillsauth add antoniocascais/claude-code-toolkit test-quality

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 23, 2026, 1:36 AM1.7s1 file scanned

SKILL.md

name:: test-quality
description:: >-
but may be weak. Triggers on:: unit test, test quality, test coverage, write tests,

Test Quality Guide

Apply these principles when writing or reviewing tests. High line coverage does NOT mean strong tests — tests must verify correctness, not just exercise code paths.

Principles Checklist

When generating tests, systematically apply each technique:

1. Boundary Value Analysis (BVA)

Test at the edges of valid ranges, not the middle.

Lower bound, upper bound, just past each bound
For i < 10: test 0, 9, 10 (and optionally -1)
For strings: empty string, single char, max length, max+1

2. Equivalence Partitioning

Group inputs into classes that should behave identically. Test one representative per class.

Valid vs invalid partitions
Reduces redundant tests while maintaining coverage

3. Decision Table Testing

For combinatorial logic (multiple conditions → different outcomes):

Enumerate all condition combinations
Especially important for business rules with compound conditions
AI frequently misses edge combos — be exhaustive

4. State Transition Testing

For stateful code (workflows, FSMs, connection pools):

Test all valid state transitions
Test INVALID transitions — verify they're rejected
Test sequences: what happens after multiple transitions?

5. Error Path Testing — AI's Biggest Blind Spot

Explicitly test every failure mode:

Null/undefined/empty inputs
Malformed data (wrong types, invalid formats)
Timeouts and network failures
Permission denied / authorization failures
Resource exhaustion (full disk, OOM)
Concurrent access / race conditions
Empty collections, single-element collections

6. Property-Based Testing

Define invariants that must ALWAYS hold, let the framework generate inputs:

sort(x) output is always ordered and same length
encode(decode(x)) == x (roundtrip)
f(x) >= 0 for all valid x (domain constraints)

Tools by language:

Python: hypothesis
JS/TS: fast-check
Java: jqwik
Rust: proptest

7. Assertion Quality

Every test MUST verify something meaningful:

BAD: call function, assert no exception → proves nothing
BAD: assert result is not null → barely proves anything
GOOD: assert specific return value matches expected
GOOD: assert side effects occurred (DB write, API call, event emitted)
GOOD: assert error type AND message for failure cases

8. AAA Pattern

Structure every test as: Arrange → Act → Assert

One logical assertion per test (multiple assert calls are fine if testing one behavior)
Test name describes the behavior being verified

9. Test Behavior, Not Implementation

Test the public contract / API surface
If mocking 3+ internal methods, the test is too coupled
Refactors should not break tests unless behavior changes

Mutation Testing

After writing tests, recommend running mutation testing to validate test strength:

JS/TS: Stryker (npx stryker run)
Python: mutmut (mutmut run)
Java: PIT (mvn org.pitest:pitest-maven:mutationCoverage)
Rust: cargo-mutants (cargo mutants)

A mutation score below 60% with high line coverage = weak tests.

When Writing New Tests

Identify the function/module under test
List input partitions (valid classes, invalid classes)
For each partition, identify boundaries
Write happy path tests with specific assertions
Write error path tests for every failure mode
Consider: are there invariants suitable for property-based tests?
Check: would a mutation (flipping < to <=, removing a line) be caught?

When Reviewing Existing Tests

Flag these weaknesses:

Tests that call code without meaningful assertions
Missing boundary values
No error/failure path tests
Over-mocking (testing mocks, not behavior)
Redundant tests in the same equivalence class

Test Strength Summary

After generating or reviewing tests, output a brief summary:

Test Strength:
- Boundaries: [covered/partial/missing] — list any gaps
- Error paths: [covered/partial/missing] — list untested failures
- Assertion quality: [strong/moderate/weak]
- Property-based candidates: [yes/no] — suggest if applicable
- Mutation resilience: [likely high/moderate/likely low]

Related Skills

antoniocascais/workflow-review

tools

VerifiedTrustedCommunity

Reviews Claude Code sessions and proposes workflow improvements. Use when: (1) /workflow-review command, (2) "review my workflow", "how can I improve", (3) after long sessions when nudged, (4) start of session with pending review. Analyzes tool usage patterns, CLAUDE.md configuration, and compares against CC best practices. Proposes: CLAUDE.md updates, new skills, underused CC features. Saves session summaries to .claude/workflow-reviews/ for cross-session continuity.

5SKILL.mdUpdated Apr 4, 2026

antoniocascais/workflow-review

antoniocascais/voice-mode

devops

VerifiedTrustedCommunity

Activates voice conversation mode using Pocket TTS Docker container. Use when user says "voice mode", "let's talk", "talk to me", "speak your responses", or wants Claude to respond with spoken audio. Speaks all responses through TTS and plays via speakers.

5SKILL.mdUpdated Apr 4, 2026

antoniocascais/voice-mode

antoniocascais/skill-forge

development

VerifiedTrustedCommunity

Creates new Claude Code skills with proper structure and best practices. Use when user wants to create a skill, update an existing skill, add a new command, scaffold a workflow, define skill hooks, or asks "how do I make a skill".

5SKILL.mdUpdated Apr 4, 2026

antoniocascais/skill-forge

antoniocascais/quiz

testing

VerifiedTrustedCommunity

Generates multiple choice quiz questions based on current conversation context. Use when testing understanding, reviewing what was discussed, or wanting a knowledge check on the session.

5SKILL.mdUpdated Apr 4, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/antoniocascais/claude-code-toolkit.git

# Copy into Claude Code skills folder (global)
cp -r claude-code-toolkit/skills/test-quality ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

antoniocascais/claude-code-toolkit

5 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT