Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

axiomantic/code-review

Name: code-review
Author: axiomantic

skills/code-review/SKILL.md

npx skillsauth add axiomantic/spellbook code-review

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Code Review

<ROLE> Code Review Specialist. Catch real issues. Respect developer time. </ROLE> <analysis> Unified skill routes to specialized handlers via mode flags. Self-review catches issues early. Feedback mode processes received comments. Give mode provides helpful reviews. Audit mode does deep security/quality passes. </analysis>

Invariant Principles

Evidence Over Assertion - Every finding needs file:line reference
Severity Honesty - Critical=security/data loss; Important=correctness; Minor=style
Context Awareness - Same code may warrant different severity in different contexts
Respect Time - False positives erode trust; prioritize signal

Inputs

| Input | Required | Description | |-------|----------|-------------| | args | Yes | Mode flags and targets | | git diff | Auto | Changed files | | PR data | If --pr | PR metadata via GitHub |

Outputs

| Output | Type | Description | |--------|------|-------------| | findings | List | Issues with severity, file:line | | status | Enum | PASS/WARN/FAIL or APPROVE/REQUEST_CHANGES |

Mode Router

| Flag | Mode | Command File | |------|------|-------------| | --self, -s, (default: no flag given) | Pre-PR self-review | (inline below) | | --feedback, -f | Process received feedback | code-review-feedback | | --give <target> | Review someone else's code | code-review-give | | --audit [scope] | Multi-pass deep-dive | (inline below) |

Modifiers: --tarot (roundtable dialogue via code-review-tarot), --pr <num> (PR source)

MCP Tool Integration

| Tool | Purpose | |------|---------| | pr_fetch(num_or_url) | Fetch PR metadata and diff | | pr_diff(raw_diff) | Parse diff into FileDiff objects | | pr_match_patterns(files, root) | Heuristic pre-filtering | | pr_files(pr_result) | Extract file list |

MCP tools for read/analyze. gh CLI for write operations (posting reviews, replies). Fallback: MCP unavailable -> gh CLI -> local diff -> manual paste.

Self Mode (`--self`)

<reflection> Self-review finds what you missed. Assume bugs exist. Hunt them. </reflection>

Workflow:

Get diff: git diff $(git merge-base origin/main HEAD)..HEAD
Memory Priming: Before starting review passes, call memory_recall(query="review finding [project_or_module]") to surface:
- Recurring issues in this codebase (focus review effort here)
- Known false positives (avoid re-flagging accepted patterns)
- Prior review decisions (respect precedent unless circumstances changed) If you received <spellbook-memory> context from reading the files under review, incorporate that as well. The explicit recall supplements auto-injection by surfacing project-wide patterns, not just file-specific ones.
Multi-pass: Logic > Integration > Security > Style
Generate findings with severity, file:line, description

Example finding: src/auth/login.py:42 [Critical] Token written to log — data exposure risk

Persist Review Findings: After finalizing findings, store significant ones for future reviews:
```
memory_store_memories(memories='{"memories": [{"content": "[Finding description]. Severity: [level]. Status: [confirmed/false_positive/deferred].", "memory_type": "[fact or antipattern]", "tags": ["review", "[finding_category]", "[module]"], "citations": [{"file_path": "[reviewed_file]", "line_range": "[lines]"}]}]}')
```
- Confirmed issues: memory_type = "antipattern" (warns future reviewers)
- Confirmed false positives: memory_type = "fact" with tag "false-positive" (prevents re-flagging)
- Do NOT store every minor finding. Store only: recurring patterns, surprising discoveries, and false positive determinations.
Gate: Critical=FAIL, Important=WARN, Minor only=PASS

Audit Mode (`--audit [scope]`)

Scopes: (none)=branch changes, file.py, dir/, security, all

Memory Priming: Before starting audit passes, call memory_recall(query="review finding [project_or_module]") to surface recurring issues, known false positives, and prior review decisions. Incorporate any <spellbook-memory> context from files under audit as well.

Passes: Correctness > Security > Performance > Maintainability > Edge Cases

API Hallucination Detection (Correctness Pass):

During the Correctness pass, check for API hallucination patterns:

[ ] Method calls use APIs that exist in the imported library version (not invented methods)
[ ] Function signatures match actual library definitions (parameter names, types, order)
[ ] Configuration keys and environment variables are real (not plausible-sounding inventions)
[ ] Import paths resolve to actual modules (not hallucinated package structures)
[ ] Return types match actual API contracts (not assumed shapes)

When reviewing AI-generated code, these checks are elevated to HIGH severity. LLMs frequently generate syntactically valid but non-existent API calls that pass linting but fail at runtime.

Output: Executive Summary, findings by category (same severity thresholds as Self Mode), Risk Assessment (LOW/MEDIUM/HIGH/CRITICAL)

Persist Review Findings: After finalizing audit findings, store significant ones using the same protocol as Self Mode (see step 5 above). Audit findings are especially valuable to persist given the depth of analysis.

<FORBIDDEN> - Skip self-review for "small" changes - Ignore Critical findings - Dismiss feedback without evidence - Give vague feedback without file:line - Approve to avoid conflict - Rate severity by effort instead of impact </FORBIDDEN>

Self-Check

[ ] Correct mode identified
[ ] All findings have file:line
[ ] Severity based on impact, not effort
[ ] Output matches mode spec

<FINAL_EMPHASIS> Every finding without file:line is noise. Every severity inflated by effort is a lie. Your credibility as a reviewer depends on signal quality — accurate severity, concrete evidence, zero false positives that waste developer time. </FINAL_EMPHASIS>

axiomantic/code-review

skills/code-review/SKILL.md

Use when reviewing code. Triggers: 'review my code', 'check my work', 'look over this', 'review PR #X', 'PR comments to address', 'reviewer said', 'address feedback', 'self-review before PR', 'audit this code'. For heavyweight multi-phase analysis, use advanced-code-review instead.

5 stars

development

Updated Apr 3, 2026

$ install --global

skillsauth

npx skillsauth add axiomantic/spellbook code-review

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 20, 2026, 3:20 PM1.9s1 file scanned

SKILL.md

name:: code-review
description:: Use when reviewing code. Triggers: 'review my code', 'check my work', 'look over this', 'review PR #X', 'PR comments to address', 'reviewer said', 'address feedback', 'self-review before PR', 'audit this code'. For heavyweight multi-phase analysis, use advanced-code-review instead.
intro:: |
Quick code review covering correctness, style, and common issues across four modes:: self-review before PRs, processing received feedback, reviewing others' code, and deep audit passes. Catches real issues with file-and-line references and honest severity classification. A core spellbook capability for routine review of changes before committing.

Code Review

Invariant Principles

Evidence Over Assertion - Every finding needs file:line reference
Severity Honesty - Critical=security/data loss; Important=correctness; Minor=style
Context Awareness - Same code may warrant different severity in different contexts
Respect Time - False positives erode trust; prioritize signal

Inputs

| Input | Required | Description | |-------|----------|-------------| | args | Yes | Mode flags and targets | | git diff | Auto | Changed files | | PR data | If --pr | PR metadata via GitHub |

Outputs

| Output | Type | Description | |--------|------|-------------| | findings | List | Issues with severity, file:line | | status | Enum | PASS/WARN/FAIL or APPROVE/REQUEST_CHANGES |

Mode Router

Modifiers: --tarot (roundtable dialogue via code-review-tarot), --pr <num> (PR source)

MCP Tool Integration

MCP tools for read/analyze. gh CLI for write operations (posting reviews, replies). Fallback: MCP unavailable -> gh CLI -> local diff -> manual paste.

Self Mode (`--self`)

<reflection> Self-review finds what you missed. Assume bugs exist. Hunt them. </reflection>

Workflow:

Get diff: git diff $(git merge-base origin/main HEAD)..HEAD
Memory Priming: Before starting review passes, call memory_recall(query="review finding [project_or_module]") to surface:
- Recurring issues in this codebase (focus review effort here)
- Known false positives (avoid re-flagging accepted patterns)
- Prior review decisions (respect precedent unless circumstances changed) If you received <spellbook-memory> context from reading the files under review, incorporate that as well. The explicit recall supplements auto-injection by surfacing project-wide patterns, not just file-specific ones.
Multi-pass: Logic > Integration > Security > Style
Generate findings with severity, file:line, description

Example finding: src/auth/login.py:42 [Critical] Token written to log — data exposure risk

Persist Review Findings: After finalizing findings, store significant ones for future reviews:
```
memory_store_memories(memories='{"memories": [{"content": "[Finding description]. Severity: [level]. Status: [confirmed/false_positive/deferred].", "memory_type": "[fact or antipattern]", "tags": ["review", "[finding_category]", "[module]"], "citations": [{"file_path": "[reviewed_file]", "line_range": "[lines]"}]}]}')
```
- Confirmed issues: memory_type = "antipattern" (warns future reviewers)
- Confirmed false positives: memory_type = "fact" with tag "false-positive" (prevents re-flagging)
- Do NOT store every minor finding. Store only: recurring patterns, surprising discoveries, and false positive determinations.
Gate: Critical=FAIL, Important=WARN, Minor only=PASS

Audit Mode (`--audit [scope]`)

Scopes: (none)=branch changes, file.py, dir/, security, all

Passes: Correctness > Security > Performance > Maintainability > Edge Cases

API Hallucination Detection (Correctness Pass):

During the Correctness pass, check for API hallucination patterns:

[ ] Method calls use APIs that exist in the imported library version (not invented methods)
[ ] Function signatures match actual library definitions (parameter names, types, order)
[ ] Configuration keys and environment variables are real (not plausible-sounding inventions)
[ ] Import paths resolve to actual modules (not hallucinated package structures)
[ ] Return types match actual API contracts (not assumed shapes)

When reviewing AI-generated code, these checks are elevated to HIGH severity. LLMs frequently generate syntactically valid but non-existent API calls that pass linting but fail at runtime.

Output: Executive Summary, findings by category (same severity thresholds as Self Mode), Risk Assessment (LOW/MEDIUM/HIGH/CRITICAL)

Self-Check

[ ] Correct mode identified
[ ] All findings have file:line
[ ] Severity based on impact, not effort
[ ] Output matches mode spec

Related Skills

axiomantic/writing-skills

testing

VerifiedTrustedCommunity

Use when creating new skills, editing existing skills, or verifying skills work before deployment. Triggers: 'write a skill', 'new skill', 'create a skill', 'skill doesn't work', 'skill isn't firing', 'edit skill', 'skill quality'. NOT for: general prompt improvement (use instruction-engineering) or command creation (use writing-commands).

5SKILL.mdUpdated Apr 3, 2026

axiomantic/writing-skills

axiomantic/writing-plans

development

VerifiedTrustedCommunity

Use when you have a spec, design doc, or requirements and need a detailed implementation plan before coding. Triggers: 'write a plan', 'create implementation plan', 'plan this out', 'break this down into steps', 'convert design to tasks', 'implementation order'. Also invoked by develop during planning. NOT for: reviewing existing plans (use reviewing-impl-plans).

5SKILL.mdUpdated Apr 3, 2026

axiomantic/writing-plans

axiomantic/writing-commands

testing

VerifiedTrustedCommunity

Use when creating new commands, editing existing commands, or reviewing command quality. Triggers: 'write command', 'new command', 'create a command', 'review command', 'fix command', 'command doesn't work', 'add a slash command'. NOT for: skill creation (use writing-skills).

5SKILL.mdUpdated Apr 3, 2026

axiomantic/writing-commands

axiomantic/verifying-hunches

development

VerifiedTrustedCommunity

Use when about to claim discovery during debugging. Triggers: "I found", "this is the issue", "I think I see", "looks like the problem", "that's why", "the bug is", "root cause", "culprit", "smoking gun", "aha", "got it", "here's what's happening", "the reason is", "causing the", "explains why", "mystery solved", "figured it out", "the fix is", "should fix", "this will fix". Also invoked by debugging, scientific-debugging, systematic-debugging before any root cause claim.

5SKILL.mdUpdated Apr 3, 2026

axiomantic/verifying-hunches

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/axiomantic/spellbook.git

# Copy into Claude Code skills folder (global)
cp -r spellbook/skills/code-review ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

axiomantic/spellbook

5 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

axiomantic/code-review

$ install --global

Security Scan Results

SKILL.md

Code Review

Invariant Principles

Inputs

Outputs

Mode Router

MCP Tool Integration

Self Mode (--self)

Audit Mode (--audit [scope])

Self-Check

Related Skills

axiomantic/writing-skills

axiomantic/writing-plans

axiomantic/writing-commands

axiomantic/verifying-hunches

axiomantic/code-review

$ install --global

Security Scan Results

SKILL.md

Code Review

Invariant Principles

Inputs

Outputs

Mode Router

MCP Tool Integration

Self Mode (--self)

Audit Mode (--audit [scope])

Self-Check

Related Skills

axiomantic/writing-skills

axiomantic/writing-plans

axiomantic/writing-commands

axiomantic/verifying-hunches

Self Mode (`--self`)

Audit Mode (`--audit [scope]`)

Self Mode (`--self`)

Audit Mode (`--audit [scope]`)