evidence-review/SKILL.md
Evidence-grounded review for code, diffs, PRs, documents, plans, specs, and architecture. Use for evidence review, review, code review, quick review, sanity check, quality check, architecture review, production readiness, security review, scaling review, document review, evaluate, or check.
npx skillsauth add llblab/skills evidence-reviewInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
One careful reviewer. Fast when scope is small, deep when risk is high, decision-grade when architecture is the target.
This skill may read code, tests, diffs, docs, logs, configs, and local validation output. This skill may run non-destructive inspection commands, linters, tests, and diff commands. This skill may write a review document only when the user explicitly asks for a file artifact. This skill must not edit implementation, rewrite tests, commit, push, publish, deploy, or post external comments. This is a review, not a fix. Findings first; remediation starts only after the user asks.
If the mode is ambiguous and no safe default exists, ask the user to choose Quick, Focused, or Architecture.
Entry: The user asked for a review or provided a target.
Determine target and mode:
Gather minimum context:
AGENTS.md, CLAUDE.md, CODEX.md, or GEMINI.md when present.Exit: Scope, mode, conventions, and risk areas are known.
Entry: Scope is known.
Review from evidence, not impressions.
For code, inspect:
For documents, inspect:
For architecture, inspect:
Exit: Candidate findings exist with supporting evidence.
Use this when Architecture mode is active. Do enough to make decisions without turning the review into a full consulting engagement.
Use artifact mode when the user asks for a file, the review is a handoff, or findings exceed what fits cleanly in chat. Do not write artifacts by default. If useful but not requested, offer it.
Default paths:
docs/reviews/YYYY-MM-DD-<slug>.mddocs/reviews/YYYY-MM-DD-architecture-<slug>.mdArtifact structure:
# Review: <scope>
Date: <YYYY-MM-DD>
Mode: <Quick | Focused | Architecture | High-stakes>
Verdict: <verdict>
## Executive Summary
...
## Evidence Map
...
## Findings
...
## Architecture Addendum
...
## ADR Candidates
...
## Execution Order
...
## Appendix
Checked files and commands.
When writing an artifact, include checked files, commands, assumptions, uncertainty, and evidence type for major claims.
Entry: Candidate findings exist.
Challenge every finding:
Drop findings that cannot survive this challenge. Mark uncertainty explicitly instead of pretending confidence. For high-stakes reviews, perform an independent second pass or subagent pass when available, then reconcile disagreements against evidence.
Exit: Findings are verified, deduplicated, and severity-ranked.
Entry: Findings are verified.
Report only useful signal. Put the most important issues first.
## Review: <scope>
### Verdict
APPROVE | APPROVE WITH NOTES | REQUEST CHANGES | READY | NEEDS REFINEMENT | FIT WITH GAPS | NOT READY
### Critical Issues
- **[CRIT-1]** path#line — Finding. Evidence. Failure scenario. Suggested direction.
### Suggestions
- **[SUG-1]** path#line — Improvement. Trade-off if ignored.
### Observations
- **[OBS-1]** Useful context that does not block shipping.
### Document Gaps
- **[GAP-1]** Missing contract or unclear criterion. Why it matters.
### Architecture Addendum
- System map, ADR candidates, failure modes, production readiness, risks, and execution order when architecture is in scope.
Omit empty sections. If no issues are found, say so clearly and name what was checked. Do not invent problems to look useful.
Exit: Review delivered with a clear verdict.
Entry: Review has been delivered.
Offer the next step without doing it automatically:
If the user asks to fix issues, exit review mode and switch to the normal coding contract.
Strong finding:
- **[CRIT-1]** `src/auth/session.ts#42` — Expired sessions are accepted because `expiresAt` is parsed but never compared. A replayed cookie remains valid until signing key rotation. Check expiry before returning the session.
Weak finding:
- Auth looks risky.
Nits belong only in Observations and only when they prevent confusion. Do not mix style preferences with release blockers.
Before final answer, verify:
development
Manage a guarded release flow that commits prepared release work on dev, opens a dev-to-main pull request with a release-focused PR summary, waits for checks, merges on success, tags, and optionally publishes an existing npm package. Use when the user asks to prepare or execute a dev→main release PR, hotfix release PR, or Dev2Main PR Summary workflow.
tools
Build, refactor, review, or debug Svelte 5 components that use Bits UI primitives. Use when working with bits-ui dialogs, popovers, dropdowns, comboboxes, selects, tabs, date/time controls, menus, tooltips, portals, render delegation, or Bits UI type helpers.
development
Evidence-grounded review for code, diffs, PRs, documents, plans, specs, and architecture. Use for evidence review, review, code review, quick review, sanity check, quality check, architecture review, production readiness, security review, scaling review, document review, evaluate, or check.
development
Collaborative idea-to-design and inquiry protocol. Use for product/architecture exploration, research-style question shaping, feature design, standards, specs, UX concepts, module boundaries, and non-trivial behavior changes when uncertainty matters.