Stepwise Codex Orchestration

Execute tasks by invoking the codex agent. Handles both single prompts and multi-step task documents (markdown files or GitHub issues). Processes actions sequentially and continues autonomously until all work is complete or an unresolvable blocker is encountered.

This command operates in fully autonomous mode. Once started, it MUST NOT pause, stop, or wait for user input until completion or blocking error.

Boy Scout Principle

Each orchestration loop must leave the repo and task source in a better state: close out stale TODOs, fix adjacent lint/test issues, and tidy drafts discovered while processing
When parsing multi-step docs uncovers duplicated items or outdated instructions, clean them immediately so downstream runs inherit precise guidance
Include cleanup proof (updated task docs, commit hashes, CI logs) in the per-step evidence you report back to the user

OPERATING MODES

Mode 1: Single Prompt Execution

User provides an explicit task as a string argument.

Example: /codex "Implement error handling in parser module"

The command will:

Create temporary task document with single item
Invoke codex agent with prompt as objective
Agent creates draft, calls Codex CLI, reviews, fixes, commits, pushes
Command receives result and reports to user

Mode 2: Multi-Step Task Document

User provides path to markdown file or GitHub issue with TODO items.

Example: /codex tasks/refactor.md or /codex 42

The command will:

Parse all TODO items from source
Execute each item in sequence (see full procedure below)
Autonomous loop until all items complete
Final comprehensive report

CORE OPERATING RULES

Autonomous Execution

Execute continuously through all steps without interruption
Do not pause between steps
Do not ask for user confirmation or approval
Only stop for unresolvable blockers that require user intervention

Evidence-Based Progress

Verify every step completion with concrete evidence (test output, build logs, artifacts)
Update task document after each completed step
Report progress concisely to user after each step
Never claim completion without proof

Precise Task Management

Process exactly one task step per Codex agent invocation
Perform self-review and apply follow-up fixes immediately after each step
Continue looping until every task item is fully satisfied
Escalate only if blocker cannot be resolved autonomously

Architectural Discipline

Keep orchestration responsibilities separate: this command coordinates, the codex agent drafts, and downstream tools execute so that each layer stays singly focused
When preparing context, pass only minimal contracts needed by the agent to maintain weak coupling and allow independent evolution of command and agent logic
If a task spans unrelated domains, split it into discrete invocations so every execution path respects the single responsibility principle

REQUIRED INPUTS

When user invokes /codex:

TASK_INPUT (required): One of:
- Explicit prompt string (quoted): /codex "Add tests for module X"
- Path to markdown task file: /codex tasks/refactor.md
- GitHub issue number/URL: /codex 42 or /codex https://github.com/user/repo/issues/42
MISSION_CLASS (optional): implementation, review, triage, etc.

INPUT DETECTION

The command automatically detects the input type:

Check if input is a file path (exists on disk) -> Mode 2: Multi-Step
Check if input is numeric or GitHub issue URL -> Mode 2: Multi-Step
Otherwise treat as explicit prompt string -> Mode 1: Single Prompt

STEP-BY-STEP PROCEDURE

1. Initialize

For Mode 1 (Single Prompt):

Create temporary task document with single TODO item
Set item text to user's prompt
Proceed to step 2 with single-item list

For Mode 2 (Multi-Step):

Load TASK_INPUT (file or GitHub issue)
Parse all actionable TODO items
Enumerate items in explicit execution order
Verify document is well-formed and unambiguous

2. For Each Step (Loop Until Complete)

a. Prepare Context

Extract current step objective
Identify completed steps (for context)
Identify remaining steps (for planning)
Gather repository state (branch, workspace status)

b. Invoke Codex Agent

Call codex agent with current step as mission
Provide full context (completed/remaining steps, repo state, constraints)
Agent creates draft, calls Codex CLI, reviews output, fixes issues, commits, and pushes
Wait for agent to return JSON result

c. Process Results

Parse JSON response from agent
Verify evidence exists for claimed success
Agent has already reviewed, fixed issues, and committed/pushed
If blocked/failed, assess if autonomous resolution possible
If blocker is resolvable, create fix step and continue
If blocker is unresolvable, escalate to user with full context

d. Update and Report

Update TASK_SOURCE with progress (checkboxes, status, evidence links)
Commit task source update if modified
Report brief status to user (2-3 sentences with evidence)

e. Continue Immediately

Proceed to next step without waiting for user input
Repeat loop until no steps remain

3. Completion

Verify all task items marked complete
Confirm all evidence captured
Provide comprehensive final report to user

CONTEXT PASSED TO CODEX AGENT

Each agent invocation receives:

CURRENT_STEP
The single task objective for this iteration

GLOBAL_CONTEXT
- Task source: {{task_source}}
- Mission class: {{mission_class}}
- Completed steps: {{completed_steps}}
- Remaining steps: {{remaining_steps}}
- Repository root: {{repo_root}}
- Current branch: {{branch_name}}
- Workspace summary: {{workspace_summary}}
- Linked references: {{linked_references}}

CONSTRAINTS
- All CLAUDE.md standards (size limits, style rules, git discipline)
- Evidence required for success claims
- Explicit file staging only
- [Task-specific constraints]

VERIFICATION
- Primary test command
- Build/lint commands if applicable

EXPECTED_OUTPUT
JSON report with: status, summary, changes, tests, artifacts, follow_up, notes

The agent is responsible for:

Creating detailed draft with gaps marked
Gathering complete context for Codex CLI
Rendering the Codex CLI prompt template
Executing codex exec once
Capturing and parsing results
Reviewing Codex output for correctness, completeness, cruft
Autonomously fixing any issues found
Updating task file/issue if it exists
Staging, committing, and pushing changes
Returning structured JSON to this command

USER REPORTING

After each step completion:

Summarize in 2-3 sentences what changed
Include evidence references (test output, build logs, file paths)
Note any follow-up actions applied during review
Do not wait for acknowledgment; proceed immediately to next step

After all steps complete:

Provide comprehensive wrap-up
Confirm task document reflects full completion
List all commits pushed
Highlight any remaining follow-up items

BLOCKER HANDLING

If a step cannot be completed autonomously:

Assess Resolvability
- Can the blocker be fixed by adjusting approach?
- Is additional information available in repo/docs?
- Can constraints be satisfied with alternative implementation?
Attempt Autonomous Fix (if possible)
- Create new step to address blocker
- Execute fix step using same loop
- Resume original step after fix
Escalate (if truly unresolvable)
- Capture complete failure context
- Mark task item as blocked in source document
- Report to user with:
  - Step objective
  - Blocker description
  - Evidence of failure
  - Attempted resolutions
  - Required user action

ALIGNMENT WITH CODEX AGENT

The codex agent (agents/codex.md) handles:

Single Codex CLI execution per invocation
Prompt template rendering
Evidence collection and validation
JSON result formatting
Post-execution review and fixes
Git operations (stage, commit, push)

This command handles:

Multi-step orchestration
Task document parsing and metadata updates
Step sequencing and context management
Autonomous loop control
User progress reporting
Blocker escalation decisions

The division ensures:

Agent does ALL the work for one step (draft -> codex -> review -> fix -> commit)
Command orchestrates multiple steps and updates task metadata
No duplication of responsibilities
Clear handoff protocol (command -> agent -> command)

Stepwise Codex Orchestration

This command operates in fully autonomous mode. Once started, it MUST NOT pause, stop, or wait for user input until completion or blocking error.

Boy Scout Principle

Each orchestration loop must leave the repo and task source in a better state: close out stale TODOs, fix adjacent lint/test issues, and tidy drafts discovered while processing
When parsing multi-step docs uncovers duplicated items or outdated instructions, clean them immediately so downstream runs inherit precise guidance
Include cleanup proof (updated task docs, commit hashes, CI logs) in the per-step evidence you report back to the user

OPERATING MODES

Mode 1: Single Prompt Execution

User provides an explicit task as a string argument.

Example: /codex "Implement error handling in parser module"

The command will:

Create temporary task document with single item
Invoke codex agent with prompt as objective
Agent creates draft, calls Codex CLI, reviews, fixes, commits, pushes
Command receives result and reports to user

Mode 2: Multi-Step Task Document

User provides path to markdown file or GitHub issue with TODO items.

Example: /codex tasks/refactor.md or /codex 42

The command will:

Parse all TODO items from source
Execute each item in sequence (see full procedure below)
Autonomous loop until all items complete
Final comprehensive report

CORE OPERATING RULES

Autonomous Execution

Execute continuously through all steps without interruption
Do not pause between steps
Do not ask for user confirmation or approval
Only stop for unresolvable blockers that require user intervention

Evidence-Based Progress

Verify every step completion with concrete evidence (test output, build logs, artifacts)
Update task document after each completed step
Report progress concisely to user after each step
Never claim completion without proof

Precise Task Management

Process exactly one task step per Codex agent invocation
Perform self-review and apply follow-up fixes immediately after each step
Continue looping until every task item is fully satisfied
Escalate only if blocker cannot be resolved autonomously

Architectural Discipline

Keep orchestration responsibilities separate: this command coordinates, the codex agent drafts, and downstream tools execute so that each layer stays singly focused
When preparing context, pass only minimal contracts needed by the agent to maintain weak coupling and allow independent evolution of command and agent logic
If a task spans unrelated domains, split it into discrete invocations so every execution path respects the single responsibility principle

REQUIRED INPUTS

When user invokes /codex:

TASK_INPUT (required): One of:
- Explicit prompt string (quoted): /codex "Add tests for module X"
- Path to markdown task file: /codex tasks/refactor.md
- GitHub issue number/URL: /codex 42 or /codex https://github.com/user/repo/issues/42
MISSION_CLASS (optional): implementation, review, triage, etc.

INPUT DETECTION

The command automatically detects the input type:

Check if input is a file path (exists on disk) -> Mode 2: Multi-Step
Check if input is numeric or GitHub issue URL -> Mode 2: Multi-Step
Otherwise treat as explicit prompt string -> Mode 1: Single Prompt

STEP-BY-STEP PROCEDURE

1. Initialize

For Mode 1 (Single Prompt):

Create temporary task document with single TODO item
Set item text to user's prompt
Proceed to step 2 with single-item list

For Mode 2 (Multi-Step):

Load TASK_INPUT (file or GitHub issue)
Parse all actionable TODO items
Enumerate items in explicit execution order
Verify document is well-formed and unambiguous

2. For Each Step (Loop Until Complete)

a. Prepare Context

Extract current step objective
Identify completed steps (for context)
Identify remaining steps (for planning)
Gather repository state (branch, workspace status)

b. Invoke Codex Agent

Call codex agent with current step as mission
Provide full context (completed/remaining steps, repo state, constraints)
Agent creates draft, calls Codex CLI, reviews output, fixes issues, commits, and pushes
Wait for agent to return JSON result

c. Process Results

Parse JSON response from agent
Verify evidence exists for claimed success
Agent has already reviewed, fixed issues, and committed/pushed
If blocked/failed, assess if autonomous resolution possible
If blocker is resolvable, create fix step and continue
If blocker is unresolvable, escalate to user with full context

d. Update and Report

Update TASK_SOURCE with progress (checkboxes, status, evidence links)
Commit task source update if modified
Report brief status to user (2-3 sentences with evidence)

e. Continue Immediately

Proceed to next step without waiting for user input
Repeat loop until no steps remain

3. Completion

Verify all task items marked complete
Confirm all evidence captured
Provide comprehensive final report to user

CONTEXT PASSED TO CODEX AGENT

Each agent invocation receives:

CURRENT_STEP
The single task objective for this iteration

GLOBAL_CONTEXT
- Task source: {{task_source}}
- Mission class: {{mission_class}}
- Completed steps: {{completed_steps}}
- Remaining steps: {{remaining_steps}}
- Repository root: {{repo_root}}
- Current branch: {{branch_name}}
- Workspace summary: {{workspace_summary}}
- Linked references: {{linked_references}}

CONSTRAINTS
- All CLAUDE.md standards (size limits, style rules, git discipline)
- Evidence required for success claims
- Explicit file staging only
- [Task-specific constraints]

VERIFICATION
- Primary test command
- Build/lint commands if applicable

EXPECTED_OUTPUT
JSON report with: status, summary, changes, tests, artifacts, follow_up, notes

The agent is responsible for:

Creating detailed draft with gaps marked
Gathering complete context for Codex CLI
Rendering the Codex CLI prompt template
Executing codex exec once
Capturing and parsing results
Reviewing Codex output for correctness, completeness, cruft
Autonomously fixing any issues found
Updating task file/issue if it exists
Staging, committing, and pushing changes
Returning structured JSON to this command

USER REPORTING

After each step completion:

Summarize in 2-3 sentences what changed
Include evidence references (test output, build logs, file paths)
Note any follow-up actions applied during review
Do not wait for acknowledgment; proceed immediately to next step

After all steps complete:

Provide comprehensive wrap-up
Confirm task document reflects full completion
List all commits pushed
Highlight any remaining follow-up items

BLOCKER HANDLING

If a step cannot be completed autonomously:

Assess Resolvability
- Can the blocker be fixed by adjusting approach?
- Is additional information available in repo/docs?
- Can constraints be satisfied with alternative implementation?
Attempt Autonomous Fix (if possible)
- Create new step to address blocker
- Execute fix step using same loop
- Resume original step after fix
Escalate (if truly unresolvable)
- Capture complete failure context
- Mark task item as blocked in source document
- Report to user with:
  - Step objective
  - Blocker description
  - Evidence of failure
  - Attempted resolutions
  - Required user action

ALIGNMENT WITH CODEX AGENT

The codex agent (agents/codex.md) handles:

Single Codex CLI execution per invocation
Prompt template rendering
Evidence collection and validation
JSON result formatting
Post-execution review and fixes
Git operations (stage, commit, push)

This command handles:

Multi-step orchestration
Task document parsing and metadata updates
Step sequencing and context management
Autonomous loop control
User progress reporting
Blocker escalation decisions

The division ensures:

Agent does ALL the work for one step (draft -> codex -> review -> fix -> commit)
Command orchestrates multiple steps and updates task metadata
No duplication of responsibilities
Clear handoff protocol (command -> agent -> command)

Adoption

krystophny/codex

$ install --global

Security Scan Results

SKILL.md

Stepwise Codex Orchestration

Boy Scout Principle

OPERATING MODES

Mode 1: Single Prompt Execution

Mode 2: Multi-Step Task Document

CORE OPERATING RULES

Autonomous Execution

Evidence-Based Progress

Precise Task Management

Architectural Discipline

REQUIRED INPUTS

INPUT DETECTION

STEP-BY-STEP PROCEDURE

1. Initialize

2. For Each Step (Loop Until Complete)

a. Prepare Context

b. Invoke Codex Agent

c. Process Results

d. Update and Report

e. Continue Immediately

3. Completion

CONTEXT PASSED TO CODEX AGENT

USER REPORTING

BLOCKER HANDLING

ALIGNMENT WITH CODEX AGENT

Related Skills

krystophny/pi

krystophny/opencode

krystophny/slopvault

krystophny/sloptools

krystophny/codex

$ install --global

Security Scan Results

SKILL.md

Stepwise Codex Orchestration

Boy Scout Principle

OPERATING MODES

Mode 1: Single Prompt Execution

Mode 2: Multi-Step Task Document

CORE OPERATING RULES

Autonomous Execution

Evidence-Based Progress

Precise Task Management

Architectural Discipline

REQUIRED INPUTS

INPUT DETECTION

STEP-BY-STEP PROCEDURE

1. Initialize

2. For Each Step (Loop Until Complete)

a. Prepare Context

b. Invoke Codex Agent

c. Process Results

d. Update and Report

e. Continue Immediately

3. Completion

CONTEXT PASSED TO CODEX AGENT

USER REPORTING

BLOCKER HANDLING

ALIGNMENT WITH CODEX AGENT

Related Skills

krystophny/pi

krystophny/opencode

krystophny/slopvault

krystophny/sloptools