Codex Code Review

Trigger

Keywords: review, PR, code review, second opinion, audit, check

When NOT to Use

Document review (use doc-review)
Security-specific review (use security-review)
Test coverage review (use test-review)
Just want to understand code (use code-explore)

Variants

| Variant | Command | Scope | Pre-checks | |---------|---------|-------|------------| | Fast | /codex-review-fast | Diff only | None | | Full | /codex-review | Diff + local checks | lint:fix + build | | Branch | /codex-review-branch | Full branch | None |

Shared Workflow

Step 0 (PENDING) → Collect changes → [Pre-checks if Full] → Dual Review (Codex + Task) → Await Results → Aggregate → Emit Gate → Loop if Blocked

Step 0: Dual Review Init (Fail-closed)

Execute: bash scripts/emit-review-gate.sh PENDING

This sets review_mode=dual and aggregate_gate.executed=false in state file, ensuring fail-closed semantics — if the process crashes before Step 4.5, stop-guard blocks.

Step 1: Collect Change Metadata

Collect metadata only — Codex reads the actual diffs and file contents itself via sandbox access.

| Variant | Collection Method | |---------|-------------------| | Fast | CHANGED_FILES: git diff --name-only HEAD + DIFF_STAT: git diff --stat HEAD | | Full | Same as Fast | | Branch | Same + CURRENT_BRANCH + BASE_BRANCH + COMMIT_COUNT |

Codex independently reads full diffs and file contents via git diff HEAD -- <file> + cat (per research instructions).

Step 1.5: Feature Context & AC Detection (Spec-Driven Review)

Execute: bash scripts/resolve-feature.sh → parse JSON output.

| Field | Use | |-------|-----| | has_requests | Gate: only proceed if true | | docs_path | Glob for request docs | | confidence | Require >= medium |

If has_requests=true AND confidence in (high, medium):

Glob ${docs_path}/requests/*.md, sort descending, take latest
Read latest request doc
Extract ## Acceptance Criteria section (parse - [ ] / - [x] items)
Filter out quality-gate ACs matching: /codex-review-fast, /codex-review-doc, /codex-review, /precommit, /precommit-fast, /pr-review
Cap: max 20 ACs (truncate with "... and N more" note)
Build SPEC_CHECKLIST variable, set REQUEST_DOC_PATH

Graceful degradation: resolve-feature fails / no requests / no AC section / parse error → SPEC_CHECKLIST = null (skip silently).

Step 1.6: Deferred Finding Context

Prerequisite: Requires R4 (Nit History Persistence) hook-side write path. Until R4 is deployed, .claude_nit_history.json will not exist and this step is a no-op (graceful degradation).

Read .claude_nit_history.json (if exists):

Filter .deferred[] entries where last_seen + ttl_days > now
Sanitize each entry before injection (mandatory):
- canonical_issue <= 120 chars (truncate)
- Strip markdown control chars (**, backticks, #, >, |)
- No raw code snippets; only file:line references
- No secrets/tokens/passwords/API keys (per rules/security.md)
- Reject entries with shell metacharacters (;, &, |, backtick, $()
Format as <deferred_context> XML block (max 10 entries)
Store in DEFERRED_CONTEXT variable

Inject into all prompt variants after research instructions, before Review Dimensions:

${DEFERRED_CONTEXT ? DEFERRED_CONTEXT : ''}

Format:

<deferred_context>
Previously deferred (do not re-report without new evidence):
- [Nit] src/service.ts | naming convention (deferred 2x)
- [P2] src/utils.ts | error handling pattern (deferred 1x)
</deferred_context>

Graceful degradation: file missing / invalid JSON / no entries / sanitization failure → DEFERRED_CONTEXT = null (skip silently).

Step 2: Pre-checks (Full variant only)

{LINT_FIX_COMMAND}
{BUILD_COMMAND}

These placeholders are resolved from the host project's CLAUDE.md or package.json scripts. Record results as LOCAL_CHECKS.

Step 3: Dual Review (Parallel Dispatch)

Case A: First review (no --continue)

Launch two reviewers in parallel (single message, multiple tool calls):

Codex MCP (primary): Use mcp__codex__codex with variant-specific prompt:

| Variant | Prompt Template | |---------|-----------------| | Fast | references/codex-prompt-fast.md | | Full | references/codex-prompt-full.md | | Branch | references/codex-prompt-branch.md |

Config: sandbox: 'read-only', approval-policy: 'never'

Save the returned threadId.

Secondary reviewer: Use Task tool with reviewer selection cascade:

| Priority | Reviewer | subagent_type | Condition | |----------|----------|---------------|-----------| | 1 | pr-review-toolkit:code-reviewer | pr-review-toolkit:code-reviewer | Default choice | | 2 | strict-reviewer | strict-reviewer | Priority 1 fails/times out | | 3 | Codex-only (degraded) | — | Both unavailable |

Selection: Try priority 1 first. If Task fails or times out (30s), try priority 2. If both unavailable, fall back to Codex-only (degraded mode — proceed with Codex results only, apply degradation matrix from references/review-common.md).

Task prompt (provide changed file list + diff stats, request P0/P1/P2/Nit findings in standard output format):

Review the code changes for correctness, security, performance, and maintainability issues.

## Changed Files
<git diff --name-only output>

## Diff Stats
<git diff --stat output>

Read the actual diffs and file contents yourself to perform the review.

Before reporting findings, independently verify each one:
1. Evidence check: what specific code proves it's real? (file:line)
2. Context check: did you read enough surrounding code?
3. False positive check: could it be intentional design?
4. Severity check: could it be more severe than initially assessed?
5. Gap check: what related issues might you have overlooked?
Only report findings that survive all 5 checks.

Output findings in this format:
- [P0/P1/P2/Nit] file:line issue description → fix recommendation

Group by severity. Include a final gate: ✅ Ready (no P0/P1) or ⛔ Blocked (has P0/P1).

Case B: Loop review (has --continue)

Codex: Use mcp__codex__codex-reply with re-review template from references/review-common.md
Secondary: Re-dispatch in parallel (same mechanism as first pass, fresh context). Always dispatched in v1 — no skip exception. Cycle resets on any code edit.

Step 3.5: Await Codex + Reconcile Secondary

Codex is the blocking reviewer — await its result for the initial gate. Secondary runs in background (run_in_background: true) and is non-blocking:

| Secondary Status | Action | |-----------------|--------| | Completed before Codex | Include in aggregation (Step 4) | | Completed after Codex, before precommit | Reconcile at pre-precommit checkpoint | | Still running at precommit | Proceed with Codex gate (authoritative); if late result has P0/P1, re-open fix→re-review loop | | Failed/timed out | Apply degradation matrix per references/review-common.md § Dual Reviewer Aggregation |

Step 4: Consolidate Output (Dual Mode)

Normalize both sets of findings to unified format: [severity] file:line description → fix
- Codex findings: already in standard format
- toolkit findings: apply Severity Mapping (see references/review-common.md § Severity Mapping)
- strict-reviewer findings: already use P0/P1/P2/Nit
Deduplicate using key = file + canonical_issue_text (ignore line ±5 difference)
- Same key → keep highest severity (P0 > P1 > P2 > Nit)
Tag source: source = codex | toolkit | both
Sort: P0 → P1 → P2 → Nit
Gate decision: any P0/P1 → BLOCKED; else → READY

Output format includes source tag:

- [P0] file:line issue → fix [source: both]
- [P1] file:line issue → fix [source: codex]

Step 4.5: Emit Review Gate

Execute: bash scripts/emit-review-gate.sh READY or bash scripts/emit-review-gate.sh BLOCKED

This updates aggregate_gate.executed=true and aggregate_gate.gate in the state file.

Then output the standard gate sentinel:

✅ Ready — if READY (no P0/P1)
⛔ Blocked — if BLOCKED (has P0/P1)

Shared Definitions

See references/review-common.md for:

Severity levels (P0/P1/P2/Nit)
Review dimensions
Merge gate definitions
Re-review prompt template
Gate sentinels (hook + behavior-layer)
Dual Reviewer Aggregation (severity mapping, deduplication, degradation matrix, source attribution)

Review Loop

⚠️ @CLAUDE.md auto-loop: fix → re-review → ... → ✅ PASS ⚠️

Blocked → fix P0/P1 → /codex-review-fast --continue <threadId> → repeat until Ready. Ready + P2/Nit → batch fix → 1 Codex --continue verify → evaluate (see rules/auto-loop.md P2/Nit Quality Sweep).

3 rounds on same issue → report blocker, request intervention.

Dual Mode Loop Behavior

| Reviewer | Loop Behavior | |----------|---------------| | Codex MCP | Stateful → mcp__codex__codex-reply(threadId) continues context | | Secondary | Re-dispatched every iteration (fresh context). Always dispatched in v1 (no skip exception). |

Codex gate is authoritative for timing. Secondary runs non-blocking in background. Aggregation reconciled at pre-precommit checkpoint. Any code edit resets the review cycle — both reviewers must re-run.

Pre-precommit Checkpoint

Before triggering /precommit, reconcile any pending secondary result:

| Condition | Action | |-----------|--------| | Task completed + has P0/P1 | Re-emit BLOCKED → fix → re-review (Codex --continue + Secondary fresh) | | Task completed + no P0/P1 | Union aggregate → proceed to precommit | | Task still running | Proceed with Codex gate (authoritative); if late result has P0/P1, re-open fix→re-review loop |

Verification

[ ] Each issue tagged with severity (P0/P1/P2/Nit)
[ ] Gate is clear (✅ Ready / ⛔ Blocked)
[ ] Issues include: file:line, description, fix suggestion
[ ] Codex performed independent project research
[ ] Branch variant: dimension rating table included

References

Shared definitions: references/review-common.md
Fast prompt: references/codex-prompt-fast.md
Full prompt: references/codex-prompt-full.md
Branch prompt: references/codex-prompt-branch.md
Research instructions: references/codex-research-instructions.md

Examples

Input: /codex-review-fast
Action: emit PENDING → git diff → Codex + Task(code-reviewer) parallel → aggregate → emit gate → P0/P1/P2/Nit + Gate

Input: /codex-review --focus "auth"
Action: emit PENDING → lint:fix → build → git diff → Codex + Task parallel (focus: auth) → aggregate → emit gate

Input: /codex-review-branch origin/develop
Action: emit PENDING → branch diff + history → Codex + Task parallel → aggregate → emit gate → Rating table + Findings + Gate

Input: /codex-review-fast (Codex unavailable)
Action: emit PENDING → git diff → Task(code-reviewer) only → degraded aggregate → emit gate + ⚠️ warning

Codex Code Review

Trigger

Keywords: review, PR, code review, second opinion, audit, check

When NOT to Use

Document review (use doc-review)
Security-specific review (use security-review)
Test coverage review (use test-review)
Just want to understand code (use code-explore)

Variants

Shared Workflow

Step 0 (PENDING) → Collect changes → [Pre-checks if Full] → Dual Review (Codex + Task) → Await Results → Aggregate → Emit Gate → Loop if Blocked

Step 0: Dual Review Init (Fail-closed)

Execute: bash scripts/emit-review-gate.sh PENDING

This sets review_mode=dual and aggregate_gate.executed=false in state file, ensuring fail-closed semantics — if the process crashes before Step 4.5, stop-guard blocks.

Step 1: Collect Change Metadata

Collect metadata only — Codex reads the actual diffs and file contents itself via sandbox access.

Codex independently reads full diffs and file contents via git diff HEAD -- <file> + cat (per research instructions).

Step 1.5: Feature Context & AC Detection (Spec-Driven Review)

Execute: bash scripts/resolve-feature.sh → parse JSON output.

| Field | Use | |-------|-----| | has_requests | Gate: only proceed if true | | docs_path | Glob for request docs | | confidence | Require >= medium |

If has_requests=true AND confidence in (high, medium):

Glob ${docs_path}/requests/*.md, sort descending, take latest
Read latest request doc
Extract ## Acceptance Criteria section (parse - [ ] / - [x] items)
Filter out quality-gate ACs matching: /codex-review-fast, /codex-review-doc, /codex-review, /precommit, /precommit-fast, /pr-review
Cap: max 20 ACs (truncate with "... and N more" note)
Build SPEC_CHECKLIST variable, set REQUEST_DOC_PATH

Graceful degradation: resolve-feature fails / no requests / no AC section / parse error → SPEC_CHECKLIST = null (skip silently).

Step 1.6: Deferred Finding Context

Prerequisite: Requires R4 (Nit History Persistence) hook-side write path. Until R4 is deployed, .claude_nit_history.json will not exist and this step is a no-op (graceful degradation).

Read .claude_nit_history.json (if exists):

Filter .deferred[] entries where last_seen + ttl_days > now
Sanitize each entry before injection (mandatory):
- canonical_issue <= 120 chars (truncate)
- Strip markdown control chars (**, backticks, #, >, |)
- No raw code snippets; only file:line references
- No secrets/tokens/passwords/API keys (per rules/security.md)
- Reject entries with shell metacharacters (;, &, |, backtick, $()
Format as <deferred_context> XML block (max 10 entries)
Store in DEFERRED_CONTEXT variable

Inject into all prompt variants after research instructions, before Review Dimensions:

${DEFERRED_CONTEXT ? DEFERRED_CONTEXT : ''}

Format:

<deferred_context>
Previously deferred (do not re-report without new evidence):
- [Nit] src/service.ts | naming convention (deferred 2x)
- [P2] src/utils.ts | error handling pattern (deferred 1x)
</deferred_context>

Graceful degradation: file missing / invalid JSON / no entries / sanitization failure → DEFERRED_CONTEXT = null (skip silently).

Step 2: Pre-checks (Full variant only)

{LINT_FIX_COMMAND}
{BUILD_COMMAND}

These placeholders are resolved from the host project's CLAUDE.md or package.json scripts. Record results as LOCAL_CHECKS.

Step 3: Dual Review (Parallel Dispatch)

Case A: First review (no --continue)

Launch two reviewers in parallel (single message, multiple tool calls):

Codex MCP (primary): Use mcp__codex__codex with variant-specific prompt:

| Variant | Prompt Template | |---------|-----------------| | Fast | references/codex-prompt-fast.md | | Full | references/codex-prompt-full.md | | Branch | references/codex-prompt-branch.md |

Config: sandbox: 'read-only', approval-policy: 'never'

Save the returned threadId.

Secondary reviewer: Use Task tool with reviewer selection cascade:

Task prompt (provide changed file list + diff stats, request P0/P1/P2/Nit findings in standard output format):

Review the code changes for correctness, security, performance, and maintainability issues.

## Changed Files
<git diff --name-only output>

## Diff Stats
<git diff --stat output>

Read the actual diffs and file contents yourself to perform the review.

Before reporting findings, independently verify each one:
1. Evidence check: what specific code proves it's real? (file:line)
2. Context check: did you read enough surrounding code?
3. False positive check: could it be intentional design?
4. Severity check: could it be more severe than initially assessed?
5. Gap check: what related issues might you have overlooked?
Only report findings that survive all 5 checks.

Output findings in this format:
- [P0/P1/P2/Nit] file:line issue description → fix recommendation

Group by severity. Include a final gate: ✅ Ready (no P0/P1) or ⛔ Blocked (has P0/P1).

Case B: Loop review (has --continue)

Codex: Use mcp__codex__codex-reply with re-review template from references/review-common.md
Secondary: Re-dispatch in parallel (same mechanism as first pass, fresh context). Always dispatched in v1 — no skip exception. Cycle resets on any code edit.

Step 3.5: Await Codex + Reconcile Secondary

Codex is the blocking reviewer — await its result for the initial gate. Secondary runs in background (run_in_background: true) and is non-blocking:

Step 4: Consolidate Output (Dual Mode)

Normalize both sets of findings to unified format: [severity] file:line description → fix
- Codex findings: already in standard format
- toolkit findings: apply Severity Mapping (see references/review-common.md § Severity Mapping)
- strict-reviewer findings: already use P0/P1/P2/Nit
Deduplicate using key = file + canonical_issue_text (ignore line ±5 difference)
- Same key → keep highest severity (P0 > P1 > P2 > Nit)
Tag source: source = codex | toolkit | both
Sort: P0 → P1 → P2 → Nit
Gate decision: any P0/P1 → BLOCKED; else → READY

Output format includes source tag:

- [P0] file:line issue → fix [source: both]
- [P1] file:line issue → fix [source: codex]

Step 4.5: Emit Review Gate

Execute: bash scripts/emit-review-gate.sh READY or bash scripts/emit-review-gate.sh BLOCKED

This updates aggregate_gate.executed=true and aggregate_gate.gate in the state file.

Then output the standard gate sentinel:

✅ Ready — if READY (no P0/P1)
⛔ Blocked — if BLOCKED (has P0/P1)

Shared Definitions

See references/review-common.md for:

Severity levels (P0/P1/P2/Nit)
Review dimensions
Merge gate definitions
Re-review prompt template
Gate sentinels (hook + behavior-layer)
Dual Reviewer Aggregation (severity mapping, deduplication, degradation matrix, source attribution)

Review Loop

⚠️ @CLAUDE.md auto-loop: fix → re-review → ... → ✅ PASS ⚠️

3 rounds on same issue → report blocker, request intervention.

Dual Mode Loop Behavior

Pre-precommit Checkpoint

Before triggering /precommit, reconcile any pending secondary result:

Verification

[ ] Each issue tagged with severity (P0/P1/P2/Nit)
[ ] Gate is clear (✅ Ready / ⛔ Blocked)
[ ] Issues include: file:line, description, fix suggestion
[ ] Codex performed independent project research
[ ] Branch variant: dimension rating table included

References

Shared definitions: references/review-common.md
Fast prompt: references/codex-prompt-fast.md
Full prompt: references/codex-prompt-full.md
Branch prompt: references/codex-prompt-branch.md
Research instructions: references/codex-research-instructions.md

Examples

Input: /codex-review-fast
Action: emit PENDING → git diff → Codex + Task(code-reviewer) parallel → aggregate → emit gate → P0/P1/P2/Nit + Gate

Input: /codex-review --focus "auth"
Action: emit PENDING → lint:fix → build → git diff → Codex + Task parallel (focus: auth) → aggregate → emit gate

Input: /codex-review-branch origin/develop
Action: emit PENDING → branch diff + history → Codex + Task parallel → aggregate → emit gate → Rating table + Findings + Gate

Input: /codex-review-fast (Codex unavailable)
Action: emit PENDING → git diff → Task(code-reviewer) only → degraded aggregate → emit gate + ⚠️ warning

Adoption

sd0xdev/codex-code-review

$ install --global

Security Scan Results

SKILL.md

Codex Code Review

Trigger

When NOT to Use

Variants

Shared Workflow

Step 0: Dual Review Init (Fail-closed)

Step 1: Collect Change Metadata

Step 1.5: Feature Context & AC Detection (Spec-Driven Review)

Step 1.6: Deferred Finding Context

Step 2: Pre-checks (Full variant only)

Step 3: Dual Review (Parallel Dispatch)

Step 3.5: Await Codex + Reconcile Secondary

Step 4: Consolidate Output (Dual Mode)

Step 4.5: Emit Review Gate

Shared Definitions

Review Loop

Dual Mode Loop Behavior

Pre-precommit Checkpoint

Verification

References

Examples

Related Skills

sd0xdev/zh-tw

sd0xdev/watch-ci

sd0xdev/verify

sd0xdev/update-docs

sd0xdev/codex-code-review

$ install --global

Security Scan Results

SKILL.md

Codex Code Review

Trigger

When NOT to Use

Variants

Shared Workflow

Step 0: Dual Review Init (Fail-closed)

Step 1: Collect Change Metadata

Step 1.5: Feature Context & AC Detection (Spec-Driven Review)

Step 1.6: Deferred Finding Context

Step 2: Pre-checks (Full variant only)

Step 3: Dual Review (Parallel Dispatch)

Step 3.5: Await Codex + Reconcile Secondary

Step 4: Consolidate Output (Dual Mode)

Step 4.5: Emit Review Gate

Shared Definitions

Review Loop

Dual Mode Loop Behavior

Pre-precommit Checkpoint

Verification

References

Examples

Related Skills

sd0xdev/zh-tw

sd0xdev/watch-ci

sd0xdev/verify

sd0xdev/update-docs