skills/fleet/SKILL.md
Parallel campaign orchestrator. Runs multiple campaigns in coordinated waves within a single session. Spawns 2-3 agents per wave in isolated worktrees, collects discoveries, shares context between waves. Use when work decomposes into 3+ independent streams that can run simultaneously.
npx skillsauth add SethGammon/Citadel fleetInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use for 3+ independent work streams that can run simultaneously in isolated worktrees. Do NOT use for single-file scope, linear work, or when a marshal or skill suffices.
Use when: Running 2+ independent work streams in parallel — tasks with non-overlapping file scopes that can execute simultaneously.
Don't use when: Work must execute sequentially or accumulate findings across phases (use /archon), a single orchestrated session is enough (use /marshal), or the task is simple enough for a bare skill.
| Command | Behavior |
|---|---|
| /fleet [direction] | Decompose direction into parallel streams, execute in waves |
| /fleet [path-to-spec] | Read a spec file, decompose into streams |
| /fleet continue | Resume from the last fleet session file |
| /fleet (no args) | Health diagnostic → work queue → execute |
| /fleet --quick [task1]; [task2] | Lightweight parallel mode for solo devs — 2+ tasks, single wave, auto-merge, no session file |
| /fleet --speculative N [direction] | Try N different approaches to the same task in parallel — see Speculative Mode below |
.planning/campaigns/ for active campaigns.planning/coordination/claims/ for external claims.planning/momentum.json exists, run
node .citadel/scripts/momentum-read.cjs
and read the output. Use the active scopes and recurring decisions to inform
work queue prioritization. Skip silently if the file is absent or output is empty.Wave context restoration: Use the Claude Code Compaction API to restore fleet session context at the start of each session. Do NOT read
.claude/compact-state.json— that pattern is deprecated in favour of server-side compaction (available on Opus 4.6+). Fleet session files (.planning/fleet/session-{slug}.md) remain the source of truth for inter-wave discovery relay; compaction handles agent memory, not campaign state. If the Compaction API is unavailable, fall back to reading the fleet session file's Continuation State directly.
node .citadel/scripts/telemetry-log.cjs --event campaign-start --agent fleet --session {session-slug}
node .citadel/scripts/momentum-watch-start.cjs
The watcher runs in the background and re-synthesizes momentum.json within 500ms of any new discovery write. Safe to call if already running — only one watcher runs per project.
Produce a ranked list of campaigns with:
| Column | Purpose | |---|---| | Campaign name | What this stream does | | Scope | Which directories it touches | | Dependencies | What must complete before this can start | | Wave | Which wave to assign it to | | Agent type | What kind of agent to spawn |
Rules for work queue:
node scripts/fleet-steward.js --session .planning/fleet/session-{slug}.md
and use its READY TO RUN, BLOCKED, MERGE NEXT, MERGE BLOCKED, and
SCOPE CONFLICTS sections as the operational DAG.READINESS BLOCKED, do not spawn that task in a
high-autonomy wave unless a human verified the worktree and you pass
--override-readiness intentionally.For each wave:
Prepare context for each agent:
.claude/agent-context/rules-summary.md.planning/map/index.json exists): run
node scripts/map-index.js --slice "<agent's scope keywords>" --max-files 15
and inject the generated === MAP SLICE === block. If the index does
not exist, skip silently.momentum.json fresh at each
wave boundary via node .citadel/scripts/momentum-read.cjs and inject as a
=== PRIOR SESSION CONTEXT === block. Re-reading (rather than reusing the
Step 1 snapshot) picks up discoveries written by parallel Fleet sessions in
other terminals. If the output is empty, skip silently.node scripts/sandbox-provider.js status --provider worktree --worktree {path}Log wave start:
node .citadel/scripts/telemetry-log.cjs --event wave-start --agent fleet --session {session-slug} --meta '{"wave":N,"agents":["name1","name2"]}'
Spawn agents with isolation: "worktree":
Agent(
prompt: "{full context + direction}",
isolation: "worktree",
mode: "bypassPermissions"
)
Collect results from all agents in the wave
4.5. Validate wave results — spawn one Phase Validator per agent in parallel (validators are Haiku, read-only, effort: low):
Agent(
subagent_type: "citadel:phase-validator",
prompt: "Campaign: {session-slug}. Wave {N} agent: {agent-name}.
Exit conditions: {agent's scope goal and any stated conditions}.
HANDOFF: {agent's full handoff text}",
effort: "low"
)
Collect all validator verdicts. For each agent:
verdict: "pass": mark agent validated in session file. Proceed.verdict: "fail": check retry counter for this agent (max 2 retries in fleet;
single-session so lower budget than Archon's 3):
conditions_failed and suggestions appended to its prompt.
Collect its result and re-validate. Decrement counter.partial in session file. Log
validator_halt: {agent-name} wave {N} — {conditions_failed}. Continue.Run all validators for the wave in a single parallel batch — do not validate sequentially. The cost is proportional to the wave size, not multiplicative.
4.75. Validate task exit evidence if the Fleet session file has an
## Exit Evidence table:
node scripts/evidence-validate.js --file .planning/fleet/session-{slug}.md --target task:{id}
Failed required evidence creates a repair task or blocks advancement based on its retries remaining.
Log per-agent results:
node .citadel/scripts/telemetry-log.cjs --event agent-complete --agent {agent-name} --session {session-slug} --status {success|partial|failed}
Compress discoveries for each agent:
node .citadel/scripts/compress-discovery.cjs on each output.planning/fleet/briefs/6b. Write persistent discovery records for each agent (cross-session memory):
node .citadel/scripts/discovery-write.cjs \
--session {session-slug} \
--agent {agent-name} \
--wave {wave-number} \
--status {success|partial|failed} \
--scope "{comma-separated-scope-dirs}" \
--handoff "{json-array-of-handoff-items}" \
--decisions "{json-array-of-decisions}" \
--files "{json-array-of-files-touched}" \
--failures "{json-array-of-failures}"
Log wave complete:
node .citadel/scripts/telemetry-log.cjs --event wave-complete --agent fleet --session {session-slug} --meta '{"wave":N,"status":"complete"}'
Merge branches from worktrees:
node scripts/fleet-steward.js --session .planning/fleet/session-{slug}.mdMERGE NEXTUpdate session file with wave results and accumulated discoveries
After all waves:
node scripts/run-with-timeout.js 300 <typecheck-cmd>wave_test_fail: true in the session file.completednode .citadel/scripts/telemetry-log.cjs --event campaign-complete --agent fleet --session {session-slug}
node .citadel/scripts/momentum-synthesize.cjs
5.5. Propagate knowledge — for each campaign that completed this session, run:
npm run propagate -- --campaign {slug}
Run once per completed campaign slug (not per wave). If multiple campaigns
completed, run for each slug. If npm run propagate is unavailable, note each
slug in the fleet session file under ## Pending Propagation.
6. Output final HANDOFF
Create at .planning/fleet/session-{slug}.md:
# Fleet Session: {name}
Status: active | needs-continue | completed
Started: {ISO timestamp}
Direction: {original direction}
## Work Queue
| # | Campaign | Scope | Deps | Status | Wave | Agent | Branch | Evidence |
|---|----------|-------|------|--------|------|-------|--------|----------|
| 1 | {name} | {dirs} | none | pending | - | - | - | - |
## Wave N Results
### Agent: {name}
**Status:** complete | partial | failed
**Built:** ... **Decisions:** ... **Files:** ...
## Shared Context (Discovery Relay)
- {cross-agent finding → what Wave N+1 should know}
## Continuation State
Next wave: N Blocked items: ... Auto-continue: true
Before assigning agents to a wave:
src/api/ and src/api/auth/ OVERLAP (parent/child)src/api/ and src/ui/ do NOT overlap (siblings)(read-only) scopes never conflictAlso check .planning/coordination/claims/ for external claims.
Effort hints for wave agents (use the effort parameter, not budget_tokens):
| Agent Type | Effort | ~Tokens |
|---|---|---|
| Fleet scouts (research, mapping, audit) | medium | ~100K each |
| Execution agents (build, refactor, implement) | high | ~250K each |
| Verify agents (typecheck, visual-verify, QA) | low | ~60K each |
The effort parameter is GA as of April 2026 and produces ~20–40% token reduction
compared to manually tuned budget_tokens values. Always prefer effort for new wave definitions.
Sub-agents can hang indefinitely on tool calls. Fleet must enforce execution time limits at the orchestrator level.
| Agent Type | Default Timeout | Override Key |
|---|---|---|
| Skill-level agents | 10 minutes | agentTimeouts.skill |
| Research scouts | 15 minutes | agentTimeouts.research |
| Build agents | 30 minutes | agentTimeouts.build |
Timeouts are configurable in harness.json:
{
"agentTimeouts": {
"skill": 600000,
"research": 900000,
"build": 1800000
}
}
On timeout: log agent-timeout event, extract partial HANDOFF if present, retry once with simplified prompt (Wave 1 critical scope only), skip otherwise. Never block the wave. Record Status: timed out in session file.
Read timeout values from harness.json → agentTimeouts.{skill|research|build} (defaults: 600000/900000/1800000ms).
Parallel agents writing to the same .planning/ directory can silently overwrite
each other. Each shared resource has a declared merge strategy:
| Resource | Strategy | Rule |
|---|---|---|
| .planning/discoveries/*.md | append-only | Never overwrite an existing discovery file. Each agent writes its own uniquely-named file: {agent-id}-{timestamp}.md. |
| .planning/fleet/briefs/*.md | append-only | Each agent writes its own brief. Fleet coordinator reads all of them. |
| .planning/fleet/session-{slug}.md | lock-on-write | Only the Fleet coordinator updates the session file. Agents never write to it directly — they emit HANDOFFs that the coordinator reads and transcribes. |
| .planning/campaigns/{slug}.md | lock-on-write | Only the owning Archon instance updates this file. Fleet agents that produce campaign-adjacent work report it in their HANDOFF and let Archon update. |
| .planning/telemetry/*.jsonl | append-only | Append-only log. Multiple agents may append simultaneously — JSONL format guarantees each line is an atomic write. |
| .planning/wiki/_staging/*.jsonl | append-only | Each agent writes a uniquely-named staging file. Compile step resolves all of them in order. No concurrent write conflicts possible. |
| .planning/coordination/claims/*.json | lock-on-write | Each agent owns its own claim file (named by instance ID). No sharing; no conflicts. |
Enforcement: Before Step 3 (spawn agents), remind each agent in its context injection which paths it may write to and what the strategy is for each. Agents that attempt to modify lock-on-write resources they don't own must be blocked via scope claim verification.
For decisions that cannot easily be undone — campaign completion approval, campaign merge to main, fleet abort — spawn 3 judgment agents in parallel and require 2/3 agreement before proceeding.
When to vote:
completed when any wave is partialfail (even after retries)How to vote:
Vote prompt: "Fleet session {slug}, Wave {N}. The proposed decision is: {decision}.
Evidence: {handoff summaries, validator results}.
Should we proceed? Respond with JSON: {verdict: 'proceed'|'block', reason: '...'}"
Spawn 3 Phase Validators with the vote prompt. Collect results. Tally:
If voting agents time out: count the timeout as a proceed vote (conservative — don't let validator failure park the fleet).
Skip voting when the decision is clearly safe (all waves complete, all validators pass).
Every agent spawned by Fleet must have a unique instance ID.
Format: fleet-{session-slug}-{wave}-{agent-index}
Example: fleet-auth-refactor-w1-a3 (wave 1, agent 3)
The instance ID is:
.fleet-instance-idBefore spawning: compare all agent scopes pairwise (directory scopes overlap any file inside them). On overlap: merge tasks, narrow scopes, or sequence. NEVER proceed with overlapping scopes.
After each wave: read .planning/coordination/claims/, verify each instance is still alive (worktree exists + HANDOFF present). Release orphaned claims, return uncompleted scope to next wave's queue.
.planning/fleet/ does not exist: Create the directory before writing the session file..planning/ does not exist: Create .planning/fleet/ before starting. If .planning/coordination/ is absent, skip scope claim registration.partial. Log wave_validator_halt. Escalate to the user before proceeding to the next wave — partial wave results may invalidate downstream wave assumptions.node scripts/fleet-steward.js --session .planning/fleet/session-{slug}.md --mark-failed {id} --reason "{reason}" --write
to mark the row failed and add a repair task with inherited dependencies./fleet --speculative N [direction]
Try N different approaches to the same task simultaneously. Each approach gets its own worktree and branch. When all finish, the user picks the winner; losers are archived (not deleted).
Before spawning, enumerate N distinct approaches. Each approach must:
Each agent gets:
speculative/{session-slug}/{strategy-label}branch and worktree_status: active in its campaign frontmatterSpawn with isolation: "worktree". Scope overlap rules do NOT apply between speculative
agents — they will all touch the same files intentionally.
After all agents complete, for each:
node scripts/run-with-timeout.js 300 <typecheck-cmd>Present a comparison table to the user:
| Strategy | Branch | Typecheck | Key Decision | Notable Tradeoffs | |----------|--------|-----------|--------------|-------------------|
If ALL N approaches fail typecheck: present the comparison table with all entries marked FAIL typecheck. Ask the user to pick the least-broken approach or abort. Do not proceed to Step 4 without a user decision.
When the user picks a winner:
worktree_status: merged, proceed with normal mergeworktree_status: archived. Do NOT delete branches.# Optional: tag losers for clarity
git tag archive/{loser-branch} {loser-branch}
Add ## Speculative Comparison to session file: direction, N strategies, comparison table (strategy/branch/status/typecheck/notes), winner, merge timestamp.
/fleet --quick [task1]; [task2]; [task3]
| Property | Standard Fleet | Quick Mode |
|---|---|---|
| Min streams | 3 | 2 |
| Min complexity | 4 | 3 |
| Waves | Multi-wave with discovery relay | Single wave only |
| Session file | Written to .planning/fleet/ | Skipped — results reported inline |
| Discovery briefs | Compressed to .planning/fleet/briefs/ | Skipped |
| Merge | Per-wave confirmation | Auto-merge if no conflicts |
| Scope claim | Written to coordination/ | Skipped |
--quick argument (semicolon-separated)isolation: "worktree"/do routes to --quick mode (not standard fleet) when:
Entry from /do confirmation prompt: user chose yes (1) or always (2). Preferences stored under consent.fleetSpawn in harness.json via readConsent/writeConsent.
Red actions require explicit confirmation regardless of trust level.
Read trust level from harness.json:
Update the session file, then output:
---HANDOFF---
- Fleet session: {name} — {waves completed} waves, {agents} agents total
- Built: {summary of all wave results}
- Discoveries: {key cross-agent findings}
- Merge conflicts: {count and resolution}
- Next: {remaining work if any}
- Reversibility: amber -- multi-wave merges, revert each wave's merge commit
---
development
First-run experience for the harness. Three modes: Recommended (guided, ~3 min), Full Tour (guided + skill walkthrough, ~8 min), and Express (zero questions, ~30 sec). Installs hooks first, detects stack, configures harness.json, runs a live demo on real code, and prints a reference card.
development
Knowledge compiler. Extracts patterns, decisions, and anti-patterns from completed campaigns and evolve cycles, then compiles them into structured wiki pages that integrate with existing knowledge rather than appending isolated files. Implements flush→compile→lint pipeline. Auto-triggered by /postmortem and /evolve Phase 6.
tools
Unified router that auto-routes user intent to the right orchestrator or skill. Classifies input by scope, complexity, persistence needs, and parallelism, then dispatches to the cheapest path that can handle it: direct command, skill, marshal, archon, or fleet. Single entry point for all work.
data-ai
Real-time harness observability dashboard. Reads campaigns, fleet sessions, telemetry, and pending queues to present a snapshot of harness state at a glance. Invoked by /dashboard, /do status, or phrases like "what's happening" and "show activity".