Raw-Requirement Dual-Agent Collaboration

Use this skill when the user explicitly wants multiple agents to work from the same original request, or when the risk of requirement paraphrase drift is high.

When to Use

Dual-agent mode is a boundary protocol. Hardness lives in the harness UserPromptSubmit hook (rule 3), not this skill — the hook enforces whether dual must run; this skill describes how to run it.

Boundary triggers (hook-enforced)

The three boundaries that the harness hook gates as hard:

(a) Requirement boundary: major direction shift, cross-stage alignment, or overturning a ≥3-day-old self-authored locked design decision as a load-bearing argument. → requirement-side alignment review.
(b) Decision node: overturning a plan premise that previously passed dual, switching paths, or rejecting an approach. → plan-prior review.
(c) Post-implementation verification: after substantive code or readout production, the reviewer independently re-runs the source-code trace and independently reproduces empirical numbers in the readout. Acceptance gate: reviewer's independent re-derivation converges on the same number / same trace conclusion before the verdict enters paper-facing context. Substitutes that don't qualify: executor's self-assessment; single-source numbers; caveats embedded in commit body; "I caught the issue myself" as a stand-in for an independent second opinion.

Cross-session burden: if (a)/(b)/(c) triggered in a prior session and dual wasn't run, the next session inheriting the work runs the missed dual before advancing any paper-facing claim derived from the unreviewed work. The harness hook re-injects this expectation each session.

Other useful triggers (skill-level recommendation, not hook-gated)

High paraphrase-drift risk: the user's raw request has subtle wording that an executor's restatement is likely to lose.
Explicit user request: the user wants two agents on the same raw request.

When NOT to Use

Routine implementation, debugging, probes (in the implementation flow, before the (c) post-impl checkpoint), variant trials in progress, single-file edits, Q&A, governance-meta-doc tweaks.
Tasks already clear enough that an extra reviewer adds no value AND none of (a)/(b)/(c) is triggered.
Tasks with critical requirement ambiguity — clarify with the user first; entering dual mode on an ambiguous brief just multiplies the misreading.

Note: "routine implementation" is NOT an excuse to skip (c). Once code/readout is finalized, (c) triggers regardless of how routine the implementation felt.

Core defaults

Both executor and reviewer receive the raw user request verbatim (no compression before forwarding).
Reviewer judges against "raw request + acceptance checklist + actual output," not executor's paraphrase alone.
Executor and reviewer have asymmetric roles — avoid duplicate work.
State key assumptions explicitly — don't package guesses as confirmed facts.
If reviewer finds deviation, report the specific gap, evidence, and impact — vague endorsement is not enough.
Any factual claim that affects scope / implementation / review verdict / final status / conflict outcome is either verified with a traceable trail or explicitly marked as an assumption. See Verification Discipline below.

Default Role Assignment

Claude: planner, task framer, final synthesizer
Executor: implementation, local verification, deliverable production
Reviewer: independent requirement decomposition, acceptance checklist, deviation check

When Claude serves as reviewer, write the acceptance checklist independently before seeing executor output.

Standard Process

Preserve raw request
- Keep user's original words verbatim — do not compress before forwarding.
Claude generates acceptance checklist
- Goals
- Non-goals
- Constraints
- High-risk misunderstanding points
- Minimum completion criteria
Codex independently generates its own acceptance checklist
- Dispatch Codex with the raw user request only (no Claude checklist) and ask: "Read the user request and produce your own acceptance checklist: goals, non-goals, constraints, high-risk misunderstanding points, minimum completion criteria."
- Claude compares both checklists. If they diverge on a key point, classify the divergence:
  - Requirement ambiguity → surface to the user before proceeding
  - Factual disagreement about codebase / data state → Claude performs the minimal verification (read source / grep / inspect artifact) before proceeding; the verified result is binding
Dispatch executor task
- Raw request verbatim
- Merged checklist (reconciled from step 2-3)
- Constraints
- Expected output
- Non-goals
- Known verified facts (with trail) — anything verified during steps 2-3
- Open assumptions requiring verification — anything not yet verified that would change implementation choices
Claude reviews executor output against
- Raw request verbatim
- Merged acceptance checklist
- Executor's actual output
- Verification trail for any claim that affects acceptance, rejection, or major criticism — Claude independently verifies critical claims (don't rely on executor's trail alone)
Final summary distinguishes three statuses
- Met
- Partially met or assumption-dependent
- Not met, deviated, or unverifiable

Two-File Handoff

See codex-orchestration/SKILL.md § 3 for the Two-File Handoff defaults (file naming, large-prompt handling, parallel dispatch threshold). The Executor / Reviewer Task Packet templates below are what goes into /tmp/codex_task_<ID>.md.

BRIEF §0 DR check (added 2026-05-22)

Every cowork BRIEF (_BRIEF_FROM_CLAUDE.md or equivalent) must open with §0 DR-check line immediately after the front-matter metadata block:

This row is VISIBLE to the reviewer (codex / second agent), NOT a Claude-side meta. Showing it does NOT bias the substantive lean because DR-check is a meta-decision, not the cowork question itself.

When DR=yes

Spec design for a new sprint phase or sub-phase (e.g. D1/D2/D5 family, Component design)
Cross-discipline concept borrowing (clinical→PHM, ML→aerospace, etc.)
P0 production issue exposing a spec gap (today's NaN-robust cowork)
Novelty defense (paper contribution claim)
Paper-grade metric / report / threshold design

When DR=no (opt-out)

Mechanical refactor / rename / typo / formatting
Code review of already-spec-locked implementation (spec-fidelity check, not redesign)
Status / readout cowork (no decision)
Workflow / governance topic (e.g. this rule itself, hook design, dispatch wrapper)
Sprint boundary commit cowork where the methodology was already DR-anchored
Trivial bug fix on a non-methodology surface

DR=maybe / defer

Surface the trade-off in the cowork verdict so user can adjudicate. Default if uncertain: write "maybe; need user confirm" and proceed with the substantive lean — do NOT block.

Project-local hook (if enabled)

Project may install scripts/hooks/dr_trigger_advisory.sh (PreToolUse on Write/Edit for cowork BRIEF paths) that flags BRIEFs lacking a §0 DR check line. The hook is warn-only and honors  file-scope markers.

Calibration history

| Sprint moment | DR commissioned on time? | |---|---| | S2.1 feature engineering | yes (user-initiated) | | S2.3 model layer | yes (user-initiated) | | D5 ingest (Innovation 2 + 3) | yes (user-initiated) | | D5.1.5 NaN-robust | barely (Claude raised AFTER writing BRIEF; trigger was P0 production discovery) |

The §0 DR-check requirement was introduced after the D5.1.5 calibration showed the "Claude + user both forget" failure mode.

Verification Discipline

When a factual claim about the codebase, data, or completed work could change scope, implementation, review verdict, final status, or conflict outcome, the claim is either verified with a traceable trail or explicitly marked as an assumption.

When to verify

Claims that affect checklist content, implementation direction, review verdict, final summary, or conflict resolution.
Any rejection or approval of a critical claim made by the other party.

When verification is NOT required

General knowledge (language semantics, library defaults, well-known patterns)
The user's just-spoken words (re-reading the message is enough)
Style preferences and subjective "this is cleaner" judgments
Implementation details that don't affect any verdict

Who performs verification

The agent making the claim performs the first verification using the cheapest sufficient method
The agent challenging or relying on a critical claim independently verifies it — don't trust the other party's trail alone for verdict-affecting claims
Conflict resolution: confidence and persistence are not tiebreakers; verified evidence is

How (ordered by cost — prefer the cheapest sufficient)

Re-read the user message verbatim
Read source with file path + line numbers
grep / rg for symbol or pattern
Inspect artifact (CSV / JSON / NPZ row + field)
git log / git blame for historical claims
Run a narrow probe (small script, single test case)

If verification cost would exceed roughly 5 minutes (heavy experiment, external system access), scope it down or escalate to the user; do not silently skip.

What counts as a sufficient trail

path/to/file.py:123-145 (file + line range)
command + key output excerpt
artifact path + row/field/value
Screenshots reserved for UI / external system claims

Where the trail is recorded

In the report (reviewer findings, executor completion notes, final summary)
Format: claim → evidence → implication
No separate audit log file required

Failure mode

If verification cannot be completed (no read access, external system, blocked), mark the item assumption-dependent or blocked with the missing access named explicitly.
Don't present an unverifiable claim as confirmed.

Executor Task Packet Template (`/tmp/codex_task.md`)

Role: Executor
(Raw user request is in /tmp/codex_user_context.md — read it first.)

Goal:
<objective only — do not rewrite business goals>

Constraints:
- <constraint 1>
- <constraint 2>

Expected output:
- <code / documentation / tests / plan / explanation>

Non-goals:
- <non-goal 1>
- <non-goal 2>

Working rules:
- State key assumptions explicitly
- Do not expand scope unilaterally
- Verify any factual claim that affects implementation choices or completion status (cite file:line, command output, or artifact path)
- Report only decision-relevant verification results with a traceable trail — do not log every action
- Mark unresolved facts as assumption-dependent or blocked rather than guessing

Reviewer Task Packet Template (`/tmp/codex_task.md`)

Role: Reviewer
(Raw user request is in /tmp/codex_user_context.md — read it first.)

Review target:
<executor's plan or actual output>

What to produce:
- Requirement decomposition
- Acceptance checklist
- Potential misunderstanding points
- Deviations from raw request
- Missing items / over-implementation / risks

Review rules:
- Do not substitute executor's paraphrase for user's request
- First judge "was the right problem solved," then "was it solved well"
- If the request itself has critical ambiguity, state how that ambiguity affects the conclusion
- Do not approve or reject a decision-critical factual claim without independent verification
- When challenging executor on factual grounds, cite the verification trail (file:line, command + output, or artifact path)
- Distinguish requirement ambiguity (escalate to user) from factual disagreement (verify against ground truth)

Acceptance Checklist Template

Input: what the user explicitly asked for
Processing: does the approach directly serve that goal
Output: does the deliverable match the request
Boundaries: any unauthorized scope expansion, missed constraints, or changed non-goals
Verification: were key behaviors actually verified
Risks: what unverified assumptions remain

Conflict Resolution

First classify the disagreement:
- Requirement interpretation dispute → go back to the raw request and compare item by item; if ambiguity remains, surface it to the user — neither role may decide unilaterally
- Factual dispute about code, data, artifacts, or completed work → resolve by verification against ground truth; confidence or persistence is not a tiebreaker
- Output visibility dispute → provide the complete output before continuing review
For factual disputes, the party challenging a claim provides counter-evidence (file:line, command + output, or artifact path) or shows that the original trail is insufficient; vague rejection doesn't move the verdict.
If verification is impossible with current access, report the conflict as unresolved and assumption-dependent rather than picking a side.
After verification, both parties update their conclusion to match the verified ground truth, even if it contradicts their original position.

Stuck Detection & Recovery

When Claude or Codex is making no forward progress, use this self-diagnosis protocol instead of retrying blindly.

Stuck signals (any one triggers diagnosis)

Same tool call attempted 3+ times with identical or near-identical parameters
Two consecutive correction rounds that produce no meaningful change in output
Context usage exceeds 80% with no clear path to completion
Codex returns errors or timeouts on 2+ consecutive calls

Diagnosis steps (in order)

Name the symptom: which stuck signal fired?
Classify the cause: | Symptom | Likely cause | Verification | |---------|-------------|-------------| | Repeated identical tool calls | Loop — same approach keeps failing | Check if the error message is identical each time | | Correction rounds produce no change | Requirement ambiguity or misaligned checklist | Re-read the raw user request | | Context >80% | Too much intermediate reasoning accumulated | Check TodoWrite for completed items that can be compacted | | Codex errors/timeouts | Service issue or prompt too broad | Try a narrower prompt; if still fails, fall back to Claude |
Apply the smallest reversible fix:
- Loop → change approach (different tool, different file, different hypothesis)
- Ambiguity → escalate to user with specific question
- Context overflow → /compact at the nearest phase boundary, then re-read critical files
- Codex unavailable → Claude proceeds directly (per codex-orchestration § Fallback Protocol)
If fix doesn't work after 1 attempt → stop and report to user: "I'm stuck. Symptom: X. Tried: Y. Need your input on Z."

Anti-pattern: silent retry loops

Avoid retrying the same failing operation more than twice without changing something. The third attempt should use a different approach, or escalate to the user.

Anti-Patterns

Both agents only receive executor's paraphrase of the request
Reviewer only gives style comments without checking goal alignment
Forcing dual-agent on trivial tasks where cost exceeds benefit
Describing process or governance improvements as model/algorithm/detector performance gains
Treating a decision-critical claim as true because one agent sounds more confident
Marking work as satisfied, broken, or verified without a traceable evidence trail
Conceding to the other party's claim without verifying it first

Final Report Format

A final report typically includes:

Whether dual-agent mode was used
Reason for using or not using it
Executor output summary
Reviewer's key findings
Whether the raw request was satisfied
Ambiguities still requiring user decision
Key decision-critical claims with evidence trail or explicit assumption status
Experience distillation (1-2 sentences): did this task reveal a reusable pattern, a new failure mode, or a codebase invariant that should be captured? If yes, suggest where to record it:
- New or updated skill rule → name the skill and section
- New MEMORY.md entry → draft the one-liner
- New mandatory read → suggest the file and task area for docs/architecture-map.md
- Nothing worth capturing → state "No new reusable pattern identified" (this is a valid answer; do not force extraction)

Raw-Requirement Dual-Agent Collaboration

Use this skill when the user explicitly wants multiple agents to work from the same original request, or when the risk of requirement paraphrase drift is high.

When to Use

Boundary triggers (hook-enforced)

The three boundaries that the harness hook gates as hard:

(a) Requirement boundary: major direction shift, cross-stage alignment, or overturning a ≥3-day-old self-authored locked design decision as a load-bearing argument. → requirement-side alignment review.
(b) Decision node: overturning a plan premise that previously passed dual, switching paths, or rejecting an approach. → plan-prior review.
(c) Post-implementation verification: after substantive code or readout production, the reviewer independently re-runs the source-code trace and independently reproduces empirical numbers in the readout. Acceptance gate: reviewer's independent re-derivation converges on the same number / same trace conclusion before the verdict enters paper-facing context. Substitutes that don't qualify: executor's self-assessment; single-source numbers; caveats embedded in commit body; "I caught the issue myself" as a stand-in for an independent second opinion.

Other useful triggers (skill-level recommendation, not hook-gated)

High paraphrase-drift risk: the user's raw request has subtle wording that an executor's restatement is likely to lose.
Explicit user request: the user wants two agents on the same raw request.

When NOT to Use

Routine implementation, debugging, probes (in the implementation flow, before the (c) post-impl checkpoint), variant trials in progress, single-file edits, Q&A, governance-meta-doc tweaks.
Tasks already clear enough that an extra reviewer adds no value AND none of (a)/(b)/(c) is triggered.
Tasks with critical requirement ambiguity — clarify with the user first; entering dual mode on an ambiguous brief just multiplies the misreading.

Note: "routine implementation" is NOT an excuse to skip (c). Once code/readout is finalized, (c) triggers regardless of how routine the implementation felt.

Core defaults

Both executor and reviewer receive the raw user request verbatim (no compression before forwarding).
Reviewer judges against "raw request + acceptance checklist + actual output," not executor's paraphrase alone.
Executor and reviewer have asymmetric roles — avoid duplicate work.
State key assumptions explicitly — don't package guesses as confirmed facts.
If reviewer finds deviation, report the specific gap, evidence, and impact — vague endorsement is not enough.
Any factual claim that affects scope / implementation / review verdict / final status / conflict outcome is either verified with a traceable trail or explicitly marked as an assumption. See Verification Discipline below.

Default Role Assignment

Claude: planner, task framer, final synthesizer
Executor: implementation, local verification, deliverable production
Reviewer: independent requirement decomposition, acceptance checklist, deviation check

When Claude serves as reviewer, write the acceptance checklist independently before seeing executor output.

Standard Process

Preserve raw request
- Keep user's original words verbatim — do not compress before forwarding.
Claude generates acceptance checklist
- Goals
- Non-goals
- Constraints
- High-risk misunderstanding points
- Minimum completion criteria
Codex independently generates its own acceptance checklist
- Dispatch Codex with the raw user request only (no Claude checklist) and ask: "Read the user request and produce your own acceptance checklist: goals, non-goals, constraints, high-risk misunderstanding points, minimum completion criteria."
- Claude compares both checklists. If they diverge on a key point, classify the divergence:
  - Requirement ambiguity → surface to the user before proceeding
  - Factual disagreement about codebase / data state → Claude performs the minimal verification (read source / grep / inspect artifact) before proceeding; the verified result is binding
Dispatch executor task
- Raw request verbatim
- Merged checklist (reconciled from step 2-3)
- Constraints
- Expected output
- Non-goals
- Known verified facts (with trail) — anything verified during steps 2-3
- Open assumptions requiring verification — anything not yet verified that would change implementation choices
Claude reviews executor output against
- Raw request verbatim
- Merged acceptance checklist
- Executor's actual output
- Verification trail for any claim that affects acceptance, rejection, or major criticism — Claude independently verifies critical claims (don't rely on executor's trail alone)
Final summary distinguishes three statuses
- Met
- Partially met or assumption-dependent
- Not met, deviated, or unverifiable

Two-File Handoff

BRIEF §0 DR check (added 2026-05-22)

Every cowork BRIEF (_BRIEF_FROM_CLAUDE.md or equivalent) must open with §0 DR-check line immediately after the front-matter metadata block:

This row is VISIBLE to the reviewer (codex / second agent), NOT a Claude-side meta. Showing it does NOT bias the substantive lean because DR-check is a meta-decision, not the cowork question itself.

When DR=yes

Spec design for a new sprint phase or sub-phase (e.g. D1/D2/D5 family, Component design)
Cross-discipline concept borrowing (clinical→PHM, ML→aerospace, etc.)
P0 production issue exposing a spec gap (today's NaN-robust cowork)
Novelty defense (paper contribution claim)
Paper-grade metric / report / threshold design

When DR=no (opt-out)

Mechanical refactor / rename / typo / formatting
Code review of already-spec-locked implementation (spec-fidelity check, not redesign)
Status / readout cowork (no decision)
Workflow / governance topic (e.g. this rule itself, hook design, dispatch wrapper)
Sprint boundary commit cowork where the methodology was already DR-anchored
Trivial bug fix on a non-methodology surface

DR=maybe / defer

Surface the trade-off in the cowork verdict so user can adjudicate. Default if uncertain: write "maybe; need user confirm" and proceed with the substantive lean — do NOT block.

Project-local hook (if enabled)

Calibration history

The §0 DR-check requirement was introduced after the D5.1.5 calibration showed the "Claude + user both forget" failure mode.

Verification Discipline

When to verify

Claims that affect checklist content, implementation direction, review verdict, final summary, or conflict resolution.
Any rejection or approval of a critical claim made by the other party.

When verification is NOT required

General knowledge (language semantics, library defaults, well-known patterns)
The user's just-spoken words (re-reading the message is enough)
Style preferences and subjective "this is cleaner" judgments
Implementation details that don't affect any verdict

Who performs verification

The agent making the claim performs the first verification using the cheapest sufficient method
The agent challenging or relying on a critical claim independently verifies it — don't trust the other party's trail alone for verdict-affecting claims
Conflict resolution: confidence and persistence are not tiebreakers; verified evidence is

How (ordered by cost — prefer the cheapest sufficient)

Re-read the user message verbatim
Read source with file path + line numbers
grep / rg for symbol or pattern
Inspect artifact (CSV / JSON / NPZ row + field)
git log / git blame for historical claims
Run a narrow probe (small script, single test case)

If verification cost would exceed roughly 5 minutes (heavy experiment, external system access), scope it down or escalate to the user; do not silently skip.

What counts as a sufficient trail

path/to/file.py:123-145 (file + line range)
command + key output excerpt
artifact path + row/field/value
Screenshots reserved for UI / external system claims

Where the trail is recorded

In the report (reviewer findings, executor completion notes, final summary)
Format: claim → evidence → implication
No separate audit log file required

Failure mode

If verification cannot be completed (no read access, external system, blocked), mark the item assumption-dependent or blocked with the missing access named explicitly.
Don't present an unverifiable claim as confirmed.

Executor Task Packet Template (`/tmp/codex_task.md`)

Role: Executor
(Raw user request is in /tmp/codex_user_context.md — read it first.)

Goal:
<objective only — do not rewrite business goals>

Constraints:
- <constraint 1>
- <constraint 2>

Expected output:
- <code / documentation / tests / plan / explanation>

Non-goals:
- <non-goal 1>
- <non-goal 2>

Working rules:
- State key assumptions explicitly
- Do not expand scope unilaterally
- Verify any factual claim that affects implementation choices or completion status (cite file:line, command output, or artifact path)
- Report only decision-relevant verification results with a traceable trail — do not log every action
- Mark unresolved facts as assumption-dependent or blocked rather than guessing

Reviewer Task Packet Template (`/tmp/codex_task.md`)

Role: Reviewer
(Raw user request is in /tmp/codex_user_context.md — read it first.)

Review target:
<executor's plan or actual output>

What to produce:
- Requirement decomposition
- Acceptance checklist
- Potential misunderstanding points
- Deviations from raw request
- Missing items / over-implementation / risks

Review rules:
- Do not substitute executor's paraphrase for user's request
- First judge "was the right problem solved," then "was it solved well"
- If the request itself has critical ambiguity, state how that ambiguity affects the conclusion
- Do not approve or reject a decision-critical factual claim without independent verification
- When challenging executor on factual grounds, cite the verification trail (file:line, command + output, or artifact path)
- Distinguish requirement ambiguity (escalate to user) from factual disagreement (verify against ground truth)

Acceptance Checklist Template

Input: what the user explicitly asked for
Processing: does the approach directly serve that goal
Output: does the deliverable match the request
Boundaries: any unauthorized scope expansion, missed constraints, or changed non-goals
Verification: were key behaviors actually verified
Risks: what unverified assumptions remain

Conflict Resolution

First classify the disagreement:
- Requirement interpretation dispute → go back to the raw request and compare item by item; if ambiguity remains, surface it to the user — neither role may decide unilaterally
- Factual dispute about code, data, artifacts, or completed work → resolve by verification against ground truth; confidence or persistence is not a tiebreaker
- Output visibility dispute → provide the complete output before continuing review
For factual disputes, the party challenging a claim provides counter-evidence (file:line, command + output, or artifact path) or shows that the original trail is insufficient; vague rejection doesn't move the verdict.
If verification is impossible with current access, report the conflict as unresolved and assumption-dependent rather than picking a side.
After verification, both parties update their conclusion to match the verified ground truth, even if it contradicts their original position.

Stuck Detection & Recovery

When Claude or Codex is making no forward progress, use this self-diagnosis protocol instead of retrying blindly.

Stuck signals (any one triggers diagnosis)

Same tool call attempted 3+ times with identical or near-identical parameters
Two consecutive correction rounds that produce no meaningful change in output
Context usage exceeds 80% with no clear path to completion
Codex returns errors or timeouts on 2+ consecutive calls

Diagnosis steps (in order)

Name the symptom: which stuck signal fired?
Classify the cause: | Symptom | Likely cause | Verification | |---------|-------------|-------------| | Repeated identical tool calls | Loop — same approach keeps failing | Check if the error message is identical each time | | Correction rounds produce no change | Requirement ambiguity or misaligned checklist | Re-read the raw user request | | Context >80% | Too much intermediate reasoning accumulated | Check TodoWrite for completed items that can be compacted | | Codex errors/timeouts | Service issue or prompt too broad | Try a narrower prompt; if still fails, fall back to Claude |
Apply the smallest reversible fix:
- Loop → change approach (different tool, different file, different hypothesis)
- Ambiguity → escalate to user with specific question
- Context overflow → /compact at the nearest phase boundary, then re-read critical files
- Codex unavailable → Claude proceeds directly (per codex-orchestration § Fallback Protocol)
If fix doesn't work after 1 attempt → stop and report to user: "I'm stuck. Symptom: X. Tried: Y. Need your input on Z."

Anti-pattern: silent retry loops

Avoid retrying the same failing operation more than twice without changing something. The third attempt should use a different approach, or escalate to the user.

Anti-Patterns

Both agents only receive executor's paraphrase of the request
Reviewer only gives style comments without checking goal alignment
Forcing dual-agent on trivial tasks where cost exceeds benefit
Describing process or governance improvements as model/algorithm/detector performance gains
Treating a decision-critical claim as true because one agent sounds more confident
Marking work as satisfied, broken, or verified without a traceable evidence trail
Conceding to the other party's claim without verifying it first

Final Report Format

A final report typically includes:

Whether dual-agent mode was used
Reason for using or not using it
Executor output summary
Reviewer's key findings
Whether the raw request was satisfied
Ambiguities still requiring user decision
Key decision-critical claims with evidence trail or explicit assumption status
Experience distillation (1-2 sentences): did this task reveal a reusable pattern, a new failure mode, or a codebase invariant that should be captured? If yes, suggest where to record it:
- New or updated skill rule → name the skill and section
- New MEMORY.md entry → draft the one-liner
- New mandatory read → suggest the file and task area for docs/architecture-map.md
- Nothing worth capturing → state "No new reusable pattern identified" (this is a valid answer; do not force extraction)

Adoption

develata/dual-agent-original-request-review

$ install --global

Security Scan Results

SKILL.md

Raw-Requirement Dual-Agent Collaboration

When to Use

Boundary triggers (hook-enforced)

Other useful triggers (skill-level recommendation, not hook-gated)

When NOT to Use

Core defaults

Default Role Assignment

Standard Process

Two-File Handoff

BRIEF §0 DR check (added 2026-05-22)

When DR=yes

When DR=no (opt-out)

DR=maybe / defer

Project-local hook (if enabled)

Calibration history

Verification Discipline

When to verify

When verification is NOT required

Who performs verification

How (ordered by cost — prefer the cheapest sufficient)

What counts as a sufficient trail

Where the trail is recorded

Failure mode

Executor Task Packet Template (/tmp/codex_task.md)

Reviewer Task Packet Template (/tmp/codex_task.md)

Acceptance Checklist Template

Conflict Resolution

Stuck Detection & Recovery

Stuck signals (any one triggers diagnosis)

Diagnosis steps (in order)

Anti-pattern: silent retry loops

Anti-Patterns

Final Report Format

Related Skills

develata/codex-tokens

develata/user-explanation-5step

develata/terminology-audit

develata/codex-dispatch-watchdog

develata/dual-agent-original-request-review

$ install --global

Security Scan Results

SKILL.md

Raw-Requirement Dual-Agent Collaboration

When to Use

Boundary triggers (hook-enforced)

Other useful triggers (skill-level recommendation, not hook-gated)

When NOT to Use

Core defaults

Default Role Assignment

Standard Process

Two-File Handoff

BRIEF §0 DR check (added 2026-05-22)

When DR=yes

When DR=no (opt-out)

DR=maybe / defer

Project-local hook (if enabled)

Calibration history

Verification Discipline

When to verify

When verification is NOT required

Who performs verification

How (ordered by cost — prefer the cheapest sufficient)

What counts as a sufficient trail

Where the trail is recorded

Failure mode

Executor Task Packet Template (/tmp/codex_task.md)

Reviewer Task Packet Template (/tmp/codex_task.md)

Acceptance Checklist Template

Conflict Resolution

Stuck Detection & Recovery

Stuck signals (any one triggers diagnosis)

Diagnosis steps (in order)

Anti-pattern: silent retry loops

Anti-Patterns

Final Report Format

Executor Task Packet Template (`/tmp/codex_task.md`)

Reviewer Task Packet Template (`/tmp/codex_task.md`)

Executor Task Packet Template (`/tmp/codex_task.md`)

Reviewer Task Packet Template (`/tmp/codex_task.md`)