Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

thkt/verify

Name: verify
Author: thkt

skills/verify/SKILL.md

npx skillsauth add thkt/dotclaude verify

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

/verify - Independent Outcome-Based Verification

Codex verifies independently in an isolated worktree. Claude Code orchestrates and synthesizes. Emits a binary Gate decision (Ready / NotReady) from reconciled static + dynamic evidence. No numeric score.

Rationalization Counters

| Excuse | Counter | | ------------------------------------ | ----------------------------------------------------------------- | | "Tests pass, so the code is correct" | Your tests, your environment. Independent verification is the gap | | "Codex will just find the same bugs" | Different model = different blind spots. That is the value | | "Adversarial testing takes too long" | Skip it if it does. Gate falls back to static-only mode | | "The code review already covered it" | Reviews read code. Verification runs code. Different evidence |

Input

| Arg | Value | Result | | ---- | ----------------------- | ------------------------- | | $1 | file path or directory | target mode | | $1 | omitted (changes exist) | diff mode (auto-detect) |

Mode Selection

| Condition | Mode | Scope | | ------------------------------------- | -------- | --------------------------- | | $1 is a file path or directory | target | Specified paths | | No $1, uncommitted changes exist | diff | Changed files (uncommitted) | | No $1, commits ahead of base branch | diff | Changed files (branch diff) | | No $1, no changes | — | Abort: "Nothing to verify" |

Base branch detection: main (default), override with --base <branch>.

Execution

| Phase | Action | Depends On | Detail | | ----- | ----------------------------------------- | ---------- | ------------------------------------------------------- | | 0 | Bootstrap worktree | — | references/bootstrap.md | | 1 | Evidence collection (parallel) | Phase 0 | references/phase-details.md § Phase 1 | | 2 | Deep verification (parallel) | Phase 1 | references/phase-details.md § Phase 2 | | 2.5 | Intent verification (adversarial results) | Phase 2 | references/phase-details.md § Phase 2.5 | | 3 | Evidence synthesis | Phase 2.5 | references/phase-details.md § Phase 3 | | final | Worktree cleanup | Always | references/phase-details.md § Cleanup |

Phase 0 constraints: Timeout 300s. On failure: skip Phase 1c, 2a → static-only verification. Log reason in report.

Parallel spawn rule: Phase 1 and Phase 2 must issue all Task / Bash / Codex exec calls concurrently within a single response. Sequential invocation negates the fan-out and doubles wall time.

Report

Gate rule canonical: references/gate-decision.md.

## Verification Report

| Field     | Value                                                  |
| --------- | ------------------------------------------------------ |
| gate      | Ready / NotReady                                       |
| mode      | diff (main) / diff (uncommitted) / target              |
| scope     | {file count} files                                     |
| bootstrap | success / failed: {reason}                             |

### Gate Decision

| Check       | Value                                       |
| ----------- | ------------------------------------------- |
| Build       | pass / fail / skipped                       |
| Tests       | pass / fail (N passed, M failed) / skipped  |
| Findings    | 0 / N high, M medium, L low                 |
| Adversarial | N/M passed / skipped                        |

### Blockers

[All reconciled findings + build/test failures + adversarial failures with Fix suggestions]

Empty: `(none)` when gate = Ready.

### Root Causes

[RC-001... with description, category, findings, action]

### Findings

[High / Medium severity tables with Source, File:Line, Description, Evidence]

### Adversarial Test Results

[test name, target, result, verdict per test]

### Outcome Evidence

[build/test pass/fail with stderr excerpts]

### Diff from previous

[Resolved / New / Carried from workspace/history/. "Legacy format — diff skipped" for Trust Score era reports.]

`<promise>PASS</promise>` is emitted by evidence-integrator when gate = Ready. Leader relays verbatim without regenerating.

Error Handling

| Error | Recovery | | --------------------------- | ----------------------------------------------- | | codex not installed | Print install instructions, abort | | Bootstrap timeout (300s) | Skip outcome + adversarial, static-only mode | | Codex review fails | Log error, proceed with audit reviewers only | | Codex exec timeout (600s) | Skip that phase, log in report | | Reviewer stall (120s) | Proceed without, log warning | | Challenger stall | Proceed with verifier only | | Verifier stall | Proceed with challenger only | | Integrator stall | Leader synthesizes manually (simplified report) | | No findings from any source | gate = Ready with note "no issues found" | | Worktree cleanup fails | Log warning, suggest manual cleanup |

Escalation

| Condition | Action | | ------------------------------------- | ---------------------------------- | | Any reconciled finding | Block merge, suggest /fix | | Architectural root causes found | Suggest /think for design review | | Adversarial tests reveal coverage gap | Suggest /code to add tests |

Verification

| Check | Required | | -------------------------------- | -------- | | Mode detected? | Yes | | Bootstrap attempted? | Yes | | Phase 1 produced evidence? | Yes | | Phase 2 challenger/verifier ran? | Yes | | Integrator produced report? | Yes | | Gate decision displayed? | Yes | | Worktree cleaned up? | Yes |

thkt/verify

skills/verify/SKILL.md

Independent outcome-based verification with Codex + audit reviewers. Emits binary Ready/NotReady gate from reconciled static + dynamic evidence. Use when user mentions 検証して, verify, 独立検証, outcome verification, gate decision, trust score (legacy), adversarial testing. Do NOT use for quick code review (use /polish) or static-only audit (use /audit).

9 stars

development

Updated Apr 20, 2026

$ install --global

skillsauth

npx skillsauth add thkt/dotclaude verify

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 20, 2026, 9:28 AM8.0s5 files scanned

SKILL.md

name:: verify
description:: |
allowed-tools:: Bash(codex:*), Bash(git worktree:*), Bash(git diff:*),
Bash(git status:: *), Bash(git log:*), Bash(git branch:*),
Bash(npm ci:: *), Bash(npm run:*), Bash(npm test:*),
Bash(cargo:: *), Bash(make:*), Bash(bun:*), Bash(pnpm:*),
Bash(yarn:: *), Bash(which:*), Bash(date:*), Bash(rm:*),
model:: opus
argument-hint:: [file paths or directory for target mode]
user-invocable:: true

/verify - Independent Outcome-Based Verification

Rationalization Counters

Input

Mode Selection

Base branch detection: main (default), override with --base <branch>.

Execution

Phase 0 constraints: Timeout 300s. On failure: skip Phase 1c, 2a → static-only verification. Log reason in report.

Parallel spawn rule: Phase 1 and Phase 2 must issue all Task / Bash / Codex exec calls concurrently within a single response. Sequential invocation negates the fan-out and doubles wall time.

Report

Gate rule canonical: references/gate-decision.md.

## Verification Report

| Field     | Value                                                  |
| --------- | ------------------------------------------------------ |
| gate      | Ready / NotReady                                       |
| mode      | diff (main) / diff (uncommitted) / target              |
| scope     | {file count} files                                     |
| bootstrap | success / failed: {reason}                             |

### Gate Decision

| Check       | Value                                       |
| ----------- | ------------------------------------------- |
| Build       | pass / fail / skipped                       |
| Tests       | pass / fail (N passed, M failed) / skipped  |
| Findings    | 0 / N high, M medium, L low                 |
| Adversarial | N/M passed / skipped                        |

### Blockers

[All reconciled findings + build/test failures + adversarial failures with Fix suggestions]

Empty: `(none)` when gate = Ready.

### Root Causes

[RC-001... with description, category, findings, action]

### Findings

[High / Medium severity tables with Source, File:Line, Description, Evidence]

### Adversarial Test Results

[test name, target, result, verdict per test]

### Outcome Evidence

[build/test pass/fail with stderr excerpts]

### Diff from previous

[Resolved / New / Carried from workspace/history/. "Legacy format — diff skipped" for Trust Score era reports.]

`<promise>PASS</promise>` is emitted by evidence-integrator when gate = Ready. Leader relays verbatim without regenerating.

Error Handling

Escalation

Verification

Related Skills

thkt/use-cli-herdr

tools

VerifiedTrustedCommunity

Delegate implementation to codex (coder) via the herdr-agentchat plugin and drive a two-pane conversation to completion.

11SKILL.mdUpdated Jul 27, 2026

thkt/scribe

development

VerifiedTrustedCommunity

Extract recurring patterns from past closed PRs/issues and the research findings in workspace/research/, verify them against the latest code, and propose them to docs/wiki/ via PR.

11SKILL.mdUpdated Jul 20, 2026

thkt/dr

development

VerifiedTrustedCommunity

Create Decision Records (DR) in MADR v4 format with auto-numbering.

11SKILL.mdUpdated Jul 15, 2026

thkt/shake

development

VerifiedTrustedCommunity

Detect flaky tests by shaking them — repeated runs under varied order, parallelism, and seed — plus a static smell scan that flags latent flakiness in tests that currently pass. Classify each target as confirmed-flaky, latent-flaky, or stable and fix the root cause without weakening the test. Do NOT use to fix a confirmed single bug (use /fix) or for static-only code review (use /audit).

11SKILL.mdUpdated Jul 13, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/thkt/dotclaude.git

# Copy into Claude Code skills folder (global)
cp -r dotclaude/skills/verify ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

thkt/dotclaude

9 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT