Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

bigeasyfreeman/test-strength

Name: test-strength
Author: bigeasyfreeman

skills/test-strength/SKILL.md

npx skillsauth add bigeasyfreeman/adlc test-strength

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Why This Exists

Passing tests are necessary but not sufficient. SWE-ABS-style test strength asks whether the generated verifier suite actually exercises the changed code and detects wrong behavior instead of merely replaying the happy path. This skill keeps coverage and mutation measurement deterministic, and uses LLM judgement only when surviving mutants still need interpretation.

Trigger

Run at phase 4 after qa passes and before slop_gate. In repos where slop_gate is still implicit, run after qa and before the next delivery gate.

Inputs

Use these inputs:

changed files from git diff
generated tests from .adlc/test_plan.json
language-appropriate mutation config

Mutation tooling is optional at the repo level, but not optional for a passing audit. If the repo language is unsupported or the standard mutator is unavailable, emit stuck instead of silently passing.

Judgement is not optional once mutants survive. Use mutant-materiality-judge via the active runtime's deep_judge slot to classify surviving mutants in batches.

Supported Language Detection

Detect the dominant repo language from changed files and repo conventions:

Python -> mutmut
JavaScript or TypeScript -> stryker
Rust -> cargo-mutants

If the changed files span multiple supported languages, audit the language that owns the generated tests in .adlc/test_plan.json and record the rationale in the report.

Audit Workflow

Read .adlc/test_plan.json and collect the generated test paths plus their target acceptance criteria.
Compute changed executable lines from git diff for the files under audit.
Run the generated tests with coverage scoped to the changed files.
Calculate coverage diff on changed executable lines only.
Detect the repo language and the standard mutation tool for that language.
Run mutation analysis on the changed files only.
If mutants survive, batch the surviving-mutant diffs and classify each survivor with mutant-materiality-judge as trivial or material.
Write .adlc/test_strength_report.json with thresholds, per-file coverage, mutant counts, surviving-mutant classifications, language detection rationale, and the verdict.

Gates

Coverage Diff

Threshold: >= 80% of changed executable lines must be covered by the generated tests.
Use the generated tests from .adlc/test_plan.json as the audit scope, not the whole suite.
Coverage is measured per changed file and summarized across the changed-file set.

Mutation Survival

Threshold: >= 60% mutation kill rate on changed files.
Use the language's standard mutator:
- Python -> mutmut
- JavaScript / TypeScript -> stryker
- Rust -> cargo-mutants
If the mutator is unsupported or unavailable, emit stuck.
If any mutants survive, classify those survivors with mutant-materiality-judge.
Any material surviving mutant forces the overall verdict to weak, even when the aggregate kill rate is still above 0.6.

Output

Write .adlc/test_strength_report.json.

{
  "$schema": "http://json-schema.org/draft-07/schema#",
  "title": "ADLC Test Strength Report",
  "type": "object",
  "additionalProperties": false,
  "required": [
    "report_path",
    "language",
    "language_detection_rationale",
    "coverage_threshold",
    "mutation_threshold",
    "files",
    "mutants_generated",
    "mutants_killed",
    "kill_rate",
    "verdict"
  ],
  "properties": {
    "report_path": {
      "type": "string",
      "const": ".adlc/test_strength_report.json"
    },
    "language": {
      "type": "string",
      "enum": ["python", "javascript", "typescript", "rust", "unknown"]
    },
    "language_detection_rationale": {
      "type": "string",
      "minLength": 1
    },
    "coverage_threshold": {
      "type": "number",
      "const": 0.8
    },
    "mutation_threshold": {
      "type": "number",
      "const": 0.6
    },
    "files": {
      "type": "array",
      "minItems": 1,
      "items": {
        "type": "object",
        "additionalProperties": false,
        "required": [
          "path",
          "changed_executable_lines",
          "covered_changed_lines",
          "coverage_ratio"
        ],
        "properties": {
          "path": {
            "type": "string",
            "minLength": 1
          },
          "changed_executable_lines": {
            "type": "integer",
            "minimum": 0
          },
          "covered_changed_lines": {
            "type": "integer",
            "minimum": 0
          },
          "coverage_ratio": {
            "type": "number",
            "minimum": 0,
            "maximum": 1
          }
        }
      }
    },
    "mutants_generated": {
      "type": "integer",
      "minimum": 0
    },
    "mutants_killed": {
      "type": "integer",
      "minimum": 0
    },
    "kill_rate": {
      "type": "number",
      "minimum": 0,
      "maximum": 1
    },
    "verdict": {
      "type": "string",
      "enum": ["pass", "weak", "stuck"]
    },
    "stuck_reason": {
      "type": "string"
    }
  }
}

Failure Modes

Coverage diff below 0.8 -> emit weak.
Mutation kill rate below 0.6 -> emit weak.
Any material surviving mutant -> emit weak.
Unsupported repo language -> emit stuck.
Standard mutator unavailable -> emit stuck.
Missing or invalid .adlc/test_plan.json -> emit stuck.

Weak findings are for test strengthening, not for waving away. The current DAG routes weak into the repair loop; that loop must return through test authoring before the next audit, and the retry budget is capped at test_strength_retry = 2.

Quality Gates

Before emitting pass, weak, or stuck, confirm:

.adlc/test_strength_report.json parses against the schema in this skill.
The report records coverage_threshold as 0.8 and mutation_threshold as 0.6.
The report includes a non-empty language_detection_rationale.
No threshold is silently defaulted or omitted.

Output Labels

pass: both thresholds met
weak: coverage or mutation strength below threshold
stuck: no supported language, no available standard mutator, or invalid audit input

bigeasyfreeman/test-strength

skills/test-strength/SKILL.md

Audits generated test strength on changed files using deterministic coverage and mutation measurement, then judges only surviving-mutant materiality.

testing

Updated Apr 23, 2026

$ install --global

skillsauth

npx skillsauth add bigeasyfreeman/adlc test-strength

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 23, 2026, 2:05 AM50.8s1 file scanned

SKILL.md

name:: test-strength
description:: Audits generated test strength on changed files using deterministic coverage and mutation measurement, then judges only surviving-mutant materiality.
contract_version:: 1.0.0
side_effect_profile:: mutating

Why This Exists

Trigger

Run at phase 4 after qa passes and before slop_gate. In repos where slop_gate is still implicit, run after qa and before the next delivery gate.

Inputs

Use these inputs:

changed files from git diff
generated tests from .adlc/test_plan.json
language-appropriate mutation config

Judgement is not optional once mutants survive. Use mutant-materiality-judge via the active runtime's deep_judge slot to classify surviving mutants in batches.

Supported Language Detection

Detect the dominant repo language from changed files and repo conventions:

Python -> mutmut
JavaScript or TypeScript -> stryker
Rust -> cargo-mutants

If the changed files span multiple supported languages, audit the language that owns the generated tests in .adlc/test_plan.json and record the rationale in the report.

Audit Workflow

Read .adlc/test_plan.json and collect the generated test paths plus their target acceptance criteria.
Compute changed executable lines from git diff for the files under audit.
Run the generated tests with coverage scoped to the changed files.
Calculate coverage diff on changed executable lines only.
Detect the repo language and the standard mutation tool for that language.
Run mutation analysis on the changed files only.
If mutants survive, batch the surviving-mutant diffs and classify each survivor with mutant-materiality-judge as trivial or material.
Write .adlc/test_strength_report.json with thresholds, per-file coverage, mutant counts, surviving-mutant classifications, language detection rationale, and the verdict.

Gates

Coverage Diff

Threshold: >= 80% of changed executable lines must be covered by the generated tests.
Use the generated tests from .adlc/test_plan.json as the audit scope, not the whole suite.
Coverage is measured per changed file and summarized across the changed-file set.

Mutation Survival

Threshold: >= 60% mutation kill rate on changed files.
Use the language's standard mutator:
- Python -> mutmut
- JavaScript / TypeScript -> stryker
- Rust -> cargo-mutants
If the mutator is unsupported or unavailable, emit stuck.
If any mutants survive, classify those survivors with mutant-materiality-judge.
Any material surviving mutant forces the overall verdict to weak, even when the aggregate kill rate is still above 0.6.

Output

Write .adlc/test_strength_report.json.

{
  "$schema": "http://json-schema.org/draft-07/schema#",
  "title": "ADLC Test Strength Report",
  "type": "object",
  "additionalProperties": false,
  "required": [
    "report_path",
    "language",
    "language_detection_rationale",
    "coverage_threshold",
    "mutation_threshold",
    "files",
    "mutants_generated",
    "mutants_killed",
    "kill_rate",
    "verdict"
  ],
  "properties": {
    "report_path": {
      "type": "string",
      "const": ".adlc/test_strength_report.json"
    },
    "language": {
      "type": "string",
      "enum": ["python", "javascript", "typescript", "rust", "unknown"]
    },
    "language_detection_rationale": {
      "type": "string",
      "minLength": 1
    },
    "coverage_threshold": {
      "type": "number",
      "const": 0.8
    },
    "mutation_threshold": {
      "type": "number",
      "const": 0.6
    },
    "files": {
      "type": "array",
      "minItems": 1,
      "items": {
        "type": "object",
        "additionalProperties": false,
        "required": [
          "path",
          "changed_executable_lines",
          "covered_changed_lines",
          "coverage_ratio"
        ],
        "properties": {
          "path": {
            "type": "string",
            "minLength": 1
          },
          "changed_executable_lines": {
            "type": "integer",
            "minimum": 0
          },
          "covered_changed_lines": {
            "type": "integer",
            "minimum": 0
          },
          "coverage_ratio": {
            "type": "number",
            "minimum": 0,
            "maximum": 1
          }
        }
      }
    },
    "mutants_generated": {
      "type": "integer",
      "minimum": 0
    },
    "mutants_killed": {
      "type": "integer",
      "minimum": 0
    },
    "kill_rate": {
      "type": "number",
      "minimum": 0,
      "maximum": 1
    },
    "verdict": {
      "type": "string",
      "enum": ["pass", "weak", "stuck"]
    },
    "stuck_reason": {
      "type": "string"
    }
  }
}

Failure Modes

Coverage diff below 0.8 -> emit weak.
Mutation kill rate below 0.6 -> emit weak.
Any material surviving mutant -> emit weak.
Unsupported repo language -> emit stuck.
Standard mutator unavailable -> emit stuck.
Missing or invalid .adlc/test_plan.json -> emit stuck.

Quality Gates

Before emitting pass, weak, or stuck, confirm:

.adlc/test_strength_report.json parses against the schema in this skill.
The report records coverage_threshold as 0.8 and mutation_threshold as 0.6.
The report includes a non-empty language_detection_rationale.
No threshold is silently defaulted or omitted.

Output Labels

pass: both thresholds met
weak: coverage or mutation strength below threshold
stuck: no supported language, no available standard mutator, or invalid audit input

Related Skills

bigeasyfreeman/build-feature

development

VerifiedTrustedCommunity

Orchestration skill: chains the full ADLC Build Loop. PRD → Brief → Council → Scaffold → Codegen → LDD → TDD → Council → PR. Use when implementing a new feature end-to-end.

1SKILL.mdUpdated Apr 22, 2026

bigeasyfreeman/build-feature

bigeasyfreeman/skills/helm-argocd-deployment

development

VerifiedTrustedCommunity

# Skill: Helm & ArgoCD Deployment > Validates Helm charts and generates ArgoCD Application manifests when the ADLC pipeline produces infrastructure or service code. Ensures every deployable artifact has correct chart structure, environment-specific values, and a GitOps-ready Application manifest before code review. --- ## Why This Exists Without deployment validation in the pipeline, common failures slip through to production: - **Helm charts fail `helm template`** because of missing values,

SKILL.mdUpdated May 5, 2026

bigeasyfreeman/skills/helm-argocd-deployment

bigeasyfreeman/verifier-semantic-judge

testing

VerifiedTrustedCommunity

Decide whether an intersecting verifier actually exercises the semantic change.

SKILL.mdUpdated Apr 23, 2026

bigeasyfreeman/verifier-semantic-judge

bigeasyfreeman/skills/ux-flow-builder

development

VerifiedTrustedCommunity

# Skill: UX Flow Builder > Generates user flow diagrams (Mermaid) from PRD personas and screen specifications. Surfaces dead ends, missing screens, and disconnected flows before design or engineering starts. Helps PMs think in screens, not features. --- ## Trigger - Automatically during PRD Phase 4 (Personas & Flows) to visualize the user journey - On-demand when the PM says "show me the flow" or "map the user journey" - During PRD evaluation to verify screen connectivity --- ## Input ```

SKILL.mdUpdated Apr 23, 2026

bigeasyfreeman/skills/ux-flow-builder

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/bigeasyfreeman/adlc.git

# Copy into Claude Code skills folder (global)
cp -r adlc/skills/test-strength ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

bigeasyfreeman/adlc

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT