Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

tctinh/verification-reviewer

Name: verification-reviewer
Author: tctinh

packages/opencode-hive/skills/verification-reviewer/SKILL.md

npx skillsauth add tctinh/agent-hive verification-reviewer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Verification Reviewer

Overview

Verify implementation claims by attempting to falsify them. Your job is not to confirm success; it is to find where success claims break down.

Core principle: Try to prove claims wrong. If you cannot, they are likely correct.

When to Use

Use this skill when:

Reviewing implementation changes that claim to be complete
Conducting post-merge verification of a task batch
A reviewer needs to independently confirm that acceptance criteria are met
Verifying that a bug fix actually resolves the reported symptom

Do not use this skill for:

Plan or documentation review (use the default Hygienic review path)
Code style or architecture review (use code-reviewer)
Pre-implementation planning

The Iron Law

RATIONALIZATIONS ARE NOT EVIDENCE

"The code looks correct" is not verification. "It should work because..." is not verification. "The tests pass" without showing test output is not verification.

Only command output, tool results, and observable behavior count as evidence.

Verification Protocol

For each claim in the implementation:

Identify the claim: What specific thing is being asserted?
Find the falsification test: What command or check would fail if the claim is wrong?
Run the test: Execute the command fresh. Do not rely on cached or previous results.
Record the evidence: Quote the relevant output.
Verdict: Does the evidence support or contradict the claim?

Verification Depth by Change Type

Not all changes carry equal risk. Scale verification effort accordingly:

| Change type | Verification depth | Examples | |---|---|---| | Config / docs / prompts | Spot-check: confirm the file exists, syntax is valid, key content is present | Skill files, AGENTS.md, prompt strings | | Logic changes | Targeted: run the relevant test suite, check edge cases mentioned in the plan | New utility function, bug fix, refactor | | API / interface changes | Broad: run full test suite, check downstream consumers, verify types compile | New tool, changed function signatures | | Data model / migration | Exhaustive: run tests, verify data integrity, check backward compatibility | Schema changes, serialization format changes |

Anti-Rationalization Checklist

Before accepting any verification result, check yourself:

| Rationalization | Reality | |---|---| | "The code looks correct to me" | Reading code is not running code | | "The author said it passes" | Author claims are hypotheses, not evidence | | "It passed last time" | Stale evidence is not evidence | | "The linter is clean" | Linting does not prove correctness | | "The types compile" | Type-checking does not prove runtime behavior | | "I ran a similar check" | Similar is not the same | | "It's a trivial change" | Trivial changes break builds regularly |

Output Format

## Verification Report

**Scope**: [What was reviewed - task name, PR, batch]

### Claims Verified

| # | Claim | Test | Evidence | Verdict |
|---|-------|------|----------|---------|
| 1 | [What was claimed] | [Command/check run] | [Output excerpt] | PASS / FAIL / INCONCLUSIVE |

### Summary

[1-3 sentences: overall assessment, any gaps, recommended actions]

### Unverifiable Claims

[List any claims that could not be independently verified and why]

Verification Failures

When a claim fails verification:

Report the actual output verbatim (do not summarize or interpret).
State what was expected vs what was observed.
Do not suggest fixes unless specifically asked. Your role is to identify the gap, not fill it.
Flag severity: Does this block the work, or is it a minor discrepancy?

Key Principles

Attempt falsification first. Look for reasons the claim might be wrong before looking for reasons it is right.
One claim, one test. Do not batch multiple claims into a single verification step.
Fresh runs only. Re-run commands; do not reuse output from previous sessions or other agents.
Quote output. Paraphrasing introduces interpretation. Quote the relevant lines.
Proportional effort. Match verification depth to change risk. Do not spend 30 minutes verifying a typo fix.

tctinh/verification-reviewer

packages/opencode-hive/skills/verification-reviewer/SKILL.md

Use when independently verifying implementation claims, post-merge review, or when a reviewer needs to falsify success assertions with command-and-output evidence

142 stars

testing

Updated Apr 22, 2026

$ install --global

skillsauth

npx skillsauth add tctinh/agent-hive verification-reviewer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 22, 2026, 11:53 AM123.5s1 file scanned

SKILL.md

name:: verification-reviewer
description:: Use when independently verifying implementation claims, post-merge review, or when a reviewer needs to falsify success assertions with command-and-output evidence

Verification Reviewer

Overview

Verify implementation claims by attempting to falsify them. Your job is not to confirm success; it is to find where success claims break down.

Core principle: Try to prove claims wrong. If you cannot, they are likely correct.

When to Use

Use this skill when:

Reviewing implementation changes that claim to be complete
Conducting post-merge verification of a task batch
A reviewer needs to independently confirm that acceptance criteria are met
Verifying that a bug fix actually resolves the reported symptom

Do not use this skill for:

Plan or documentation review (use the default Hygienic review path)
Code style or architecture review (use code-reviewer)
Pre-implementation planning

The Iron Law

RATIONALIZATIONS ARE NOT EVIDENCE

"The code looks correct" is not verification. "It should work because..." is not verification. "The tests pass" without showing test output is not verification.

Only command output, tool results, and observable behavior count as evidence.

Verification Protocol

For each claim in the implementation:

Identify the claim: What specific thing is being asserted?
Find the falsification test: What command or check would fail if the claim is wrong?
Run the test: Execute the command fresh. Do not rely on cached or previous results.
Record the evidence: Quote the relevant output.
Verdict: Does the evidence support or contradict the claim?

Verification Depth by Change Type

Not all changes carry equal risk. Scale verification effort accordingly:

Anti-Rationalization Checklist

Before accepting any verification result, check yourself:

Output Format

## Verification Report

**Scope**: [What was reviewed - task name, PR, batch]

### Claims Verified

| # | Claim | Test | Evidence | Verdict |
|---|-------|------|----------|---------|
| 1 | [What was claimed] | [Command/check run] | [Output excerpt] | PASS / FAIL / INCONCLUSIVE |

### Summary

[1-3 sentences: overall assessment, any gaps, recommended actions]

### Unverifiable Claims

[List any claims that could not be independently verified and why]

Verification Failures

When a claim fails verification:

Report the actual output verbatim (do not summarize or interpret).
State what was expected vs what was observed.
Do not suggest fixes unless specifically asked. Your role is to identify the gap, not fill it.
Flag severity: Does this block the work, or is it a minor discrepancy?

Key Principles

Attempt falsification first. Look for reasons the claim might be wrong before looking for reasons it is right.
One claim, one test. Do not batch multiple claims into a single verification step.
Fresh runs only. Re-run commands; do not reuse output from previous sessions or other agents.
Quote output. Paraphrasing introduces interpretation. Quote the relevant lines.
Proportional effort. Match verification depth to change risk. Do not spend 30 minutes verifying a typo fix.

Related Skills

tctinh/writing-plans

development

VerifiedTrustedCommunity

Use when you have a spec or requirements for a multi-step task, before touching code

142SKILL.mdUpdated Apr 22, 2026

tctinh/verification-before-completion

data-ai

VerifiedTrustedCommunity

Use when about to claim work is complete, fixed, or passing, before committing or creating PRs - requires running verification commands and confirming output before making any success claims; evidence before assertions always

142SKILL.mdUpdated Apr 22, 2026

tctinh/verification-before-completion

tctinh/test-driven-development

development

VerifiedTrustedCommunity

Use when implementing any feature or bugfix, before writing implementation code

142SKILL.mdUpdated Apr 22, 2026

tctinh/test-driven-development

tctinh/systematic-debugging

development

VerifiedTrustedCommunity

Use when encountering any bug, test failure, or unexpected behavior, before proposing fixes

142SKILL.mdUpdated Apr 22, 2026

tctinh/systematic-debugging

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/tctinh/agent-hive.git

# Copy into Claude Code skills folder (global)
cp -r agent-hive/packages/opencode-hive/skills/verification-reviewer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

tctinh/agent-hive

142 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT