Swarm

Subagent orchestration: delegated review, quorum consensus, scoped locks, clean-context merge, and post-merge review.

Purpose

Run subagents safely and predictably through reusable orchestration contracts.

Swarm is independent. It must not require concrete sibling skill names, private repositories, local model aliases, or a specific tool registry layout. Local agents may bind the contracts to their own tools, command templates, model names, and review protocols.

Core Concepts

Orchestrator: Main agent. Coordinates subagents, owns safety, and writes final artifacts.
Subagent: Bounded delegated model call for review, audit, merge, or scoped implementation.
Scope: File, directory, module, or logical domain that a subagent operates on.
Lock: Read or write ownership of a scope for bounded time.
Quorum: Multiple independent subagents reviewing the same target.
Merger: Clean-context fifth subagent that synthesizes raw quorum outputs.
Reviewer: Post-merge reviewer that checks report quality, not the original domain by default.
Async Run Adapter: Local async binding that starts, tracks, lists, tails, and cancels swarm runs through a generic lifecycle runtime.
Async Run: A local lifecycle envelope around a command-template swarm composer or utility. It owns state, logs, status, cancellation, and observability, not swarm semantics.
Lens: A deliberately narrow cognitive role assigned to one subagent, such as security, tests, architecture, economics, or operator UX.
Task Card: A bounded implementation assignment with goal, allowed files, avoided files, expected output, and validation gates.
Component Capability: An abstract adapter operation such as launcher, reviewer, verifier, merger, quorum, checkpoint, follow-up, judge, or normalizer. Swarm may target these capabilities, but local adapters bind them to concrete tools, recipes, command templates, async runs, or services.
Coordinator Checkpoint: A deliberate subagent pause where the subagent preserves its working context, sends a bounded question or status to the orchestrator, receives a coordinator reply, and continues in the same subagent context.
Evidence Checkpoint: A deliberate stop where a subagent records sources, assumptions, confidence, contradictions, or blocking evidence gaps before synthesis.
Integrator: The human or agent that merges isolated branches/worktrees into the shared target and owns conflict resolution.

Swarm Principle

One agent should hold one lens. Many independent lenses make the whole judgement stronger. Parallel lenses make the whole judgement faster.

Swarm is useful because focused subagents usually perform better than one overloaded model trying to reason about every concern at once. Each subagent receives the same target but a narrower mandate, reducing cognitive load and making misses less correlated. The merger then synthesizes the independent observations into one decision-grade artifact.

Use swarm when the work benefits from parallel cognitive lenses, not merely because more models are available. A good swarm decomposes the task into review surfaces that are independently meaningful, then preserves agreement, disagreement, and minority high-impact findings.

Swarm Shapes

Distinguish breadth from confidence before launching subagents.

Lens Swarm = different lenses on one object
Quorum     = one lens, multiple independent judges

Lens Swarm optimizes coverage. It asks which aspects matter and sends focused agents through different mandates: architecture, security, tests, UX, release, docs, economics, governance, or operations. Use it for brainstorming, product design, architecture exploration, development planning, and broad risk review.

Quorum optimizes confidence. It keeps the lens constant and varies models, seeds, prompts, or independent instances so several judges answer the same question. Use it mostly for verification: high-stakes review, security verdicts, ambiguous bug hypotheses, publish readiness, and final confidence checks.

Lens Swarm of Quorums optimizes both breadth and confidence. Each important lens gets its own mini-quorum, and a merger synthesizes across lenses while preserving vote counts and minority high-impact findings. This is expensive and should be reserved for high-stakes systems such as money movement, blockchain protocols, governance, security boundaries, migrations, or public releases.

Decision rule:

Need breadth?     Use Lens Swarm.
Need confidence?  Use Quorum.
Need both?        Use Lens Swarm of Quorums.

Work Modes

Swarm is not only for review. The same lens discipline supports ideation, research-style exploration, planning, implementation coordination, and verification.

Brainstorm Swarm

Purpose: turn many possible futures into one recommended direction.

Typical lenses:

product/user value
operator UX
protocol or platform cleanliness
implementation pragmatism
adoption/growth
skeptic/failure modes
constraints and portability

Merger output:

recommended direction
alternatives considered and rejected
open questions
first implementation slice
risks and validation gates

Brainstorm swarms should not vote mechanically. They should preserve creative tension and synthesize a direction that fits constraints.

Research Swarm

Purpose: turn a broad question into evidence-backed synthesis without letting one agent's search path dominate the answer.

Typical lenses:

question/scope formulation
source discovery
source verification
synthesis and contradiction mapping
devil's advocate / counter-evidence
ethics, policy, or stakeholder impact when relevant

Research swarm rules:

Scope first: the coordinator defines the question, non-goals, source classes, and stop condition before launching evidence work.
Search and verification are separate lenses when stakes are high; the searcher finds candidates, the verifier checks quality and support.
Every material claim in synthesis must trace to a source note, inspected artifact, or explicit uncertainty statement.
Contradictory evidence is preserved as a first-class output, not hidden by consensus language.
Evidence checkpoints block synthesis when sources are insufficient, unverifiable, conflicted, or ethically unsafe to use.

Merger output:

bounded research question
evidence map with confidence and source quality
areas of convergence and contradiction
limitations and not-checkable claims
recommended decision or next evidence slice

Research swarms are for inquiry and synthesis, not automatic publication pipelines. Keep report formatting local to the caller's domain.

Development Swarm

Purpose: turn one accepted direction into coordinated slices.

Recommended flow:

Planner → Lens Swarm → Merger → Scoped Implementers → Integrator → Review Swarm

Use scoped write locks or a repository-local soft-lock manifest when multiple implementers may edit files. Keep implementation agents bounded by module, artifact, or responsibility: code, tests, docs, examples, migration, or release notes. Prefer one owner per writable scope and run verification with fresh reviewers after implementation.

Small-Team Development Swarm

Also known as MAWP: Multi-Agent Worktree Protocol.

For 2–4 implementation agents, avoid building a heavyweight orchestrator. Use the minimal pattern:

1 project
→ 1 shared backlog
→ 2–4 isolated worktrees/branches
→ each agent owns one small task card
→ conflicts trigger structured reports
→ one integrator merges

Agents should not concurrently mutate the same shared checkout. Prefer isolated branches or worktrees:

git worktree add ../agent-a -b agent/a-task
git worktree add ../agent-b -b agent/b-task
git worktree add ../agent-c -b agent/c-task

Prefer splitting by mutation class, not by unrelated features:

2 agents: implementation; tests/docs/review
3 agents: implementation; tests; docs + review
4 agents: implementation; tests; docs/examples; integrator/refactor/audit

Stable pattern:

Agent A mutates behavior
Agent B verifies behavior
Agent C describes behavior
Agent D integrates behavior

Avoid the dangerous pattern where several agents implement unrelated features, refactor architecture, and clean shared types at the same time.

Every implementation agent receives a Task Card:

# Task

Goal:

- What needs to be done.

Allowed files:

- path/or/glob

Avoid files:

- path/or/glob

Expected output:

- patch
- short summary
- tests/checks run
- touched files list

For backlog-driven branch swarms, use stable task IDs and one scope file per agent. The coordinator partitions independent task cards, writes scope files with allowed files, avoided files, exit criteria, checks, branch name, and handoff path, then starts a local async-run adapter that runs each subagent in an isolated clone or worktree branch. Success means the expected branch contains a verified commit and has been pushed; partial branch failures are degraded runs for coordinator inspection, not automatic total failure.

For 2–4 agents, a simple .agents/locks.md soft-lock manifest is often enough. The manifest states each agent's task and owned files. Agents read it before starting, add their section before editing, avoid other agents' owned files, and remove or mark the section done after completion.

Use exclusive ownership for public API and central contract files. Examples: package manifests, schema files, public type modules, runtime roots, protocol specs, migration heads, and generated-client sources. If a task touches exclusive files, assign them to one agent or to the integrator.

Conflict handling is bounded:

No conflict       → agent handoff → integrator review → merge
Merge conflict   → both agents write Conflict Report → resolver merges
Semantic conflict→ stop affected workers → integrator replans
Architecture conflict → invalidate/split backlog task

Conflict reports exchange semantic deltas, not open-ended discussion: what changed, why, what must be preserved, what can be discarded, and suggested resolution. See docs/development-swarm.md for templates.

No agent may silently expand task scope. Do not opportunistically refactor unrelated code or edit outside declared files. If an out-of-scope change is needed, record it as a backlog item or ask the integrator to replan.

When a subagent is blocked by a coordinator-only decision, prefer a coordinator checkpoint over guessing or discarding context if the local runtime supports resumable subagent sessions. The checkpoint should contain the question, why coordinator input is needed, options considered, recommended option, risk if guessed, and state to preserve. The coordinator should answer only the checkpoint question; the subagent resumes in the same context and records the decision in its handoff.

Review Swarm

Purpose: turn one result into many risk lenses and a decision-grade verdict.

Use lens swarm for broad coverage, quorum for confidence on one critical judgement, or both for high-stakes releases. The final report should separate consensus findings, minority findings, merger findings, risks, and recommended next actions.

A review swarm synthesis must not fabricate claims. Every final finding should trace to a reviewer note, checked artifact, command output, source, or explicit merger rationale. Devil's Advocate critical findings must be preserved or explicitly disproved with evidence.

Lens Catalog

Choose lenses by risk. Do not run every lens by default; select the smallest set that covers the failure modes of the work.

General Software Lenses

Architecture: module boundaries, dependency direction, cohesion, coupling, extensibility, and long-term ownership.
Correctness: functional behavior, edge cases, invariants, state transitions, and input/output contracts.
Bug Hunter: likely defects, race conditions, null/empty cases, off-by-one errors, and broken assumptions.
Security: injection, privilege boundaries, path handling, secrets, deserialization, unsafe shell/process use, and abuse cases.
Tests: coverage quality, missing regressions, fixture realism, flaky risk, and validation gates.
Performance: algorithmic cost, latency, memory, IO, batching, backpressure, and hot paths.
Concurrency: locks, cancellation, timeouts, idempotency, retries, ordering, and stale state.
Data Integrity: migrations, schema compatibility, durability, rollback, deduplication, and corruption risk.
API Compatibility: public contract drift, backwards compatibility, versioning, deprecation, and client impact.
Operator UX: observability, error messages, status, logs, recovery paths, safe defaults, and support burden.
Developer UX: onboarding, local setup, naming, examples, docs, type ergonomics, and debugging clarity.
Product UX: user journey, surprising behavior, accessibility, copy, affordances, and feedback loops.
Release: changelog truth, version bump, package contents, CI gates, publish risk, and rollback readiness.
Documentation: README accuracy, examples, conceptual consistency, and stale references.
Compliance/Policy: licenses, privacy, retention, consent, audit trails, and organizational constraints.
Incident Response: blast radius, detection, containment, rollback, forensics, and runbook quality.
SRE/Reliability: availability, graceful degradation, monitoring, quotas, rate limits, and dependency failure.
Maintainability: simplicity, naming, duplication, dead code, testability, and future change cost.
Spec Consistency: whether code, docs, tests, prompts, and changelog describe the same behavior.

Complex System / Blockchain Lenses

Use these for blockchain, distributed systems, financial protocols, governance, or adversarial environments.

Protocol Invariants: conservation laws, supply rules, ledger consistency, finality assumptions, and impossible states.
Consensus Safety: fork choice, quorum thresholds, validator behavior, equivocation, reorg handling, and liveness tradeoffs.
Economic Security: incentives, MEV, griefing, sybil cost, fee dynamics, slashing, reward leakage, and manipulation paths.
Smart Contract Safety: reentrancy, authorization, upgradeability, storage layout, oracle trust, token standards, and invariant tests.
Cryptography: key management, signature/domain separation, nonce use, randomness, hash commitments, and proof assumptions.
Cross-Chain/Bridge: message replay, finality mismatch, validator set drift, withdrawal delays, and custody assumptions.
Governance: proposal lifecycle, quorum rules, timelocks, emergency powers, capture risk, and voter/operator UX.
Treasury/Accounting: balance reconciliation, fee distribution, rounding, precision, reserves, and auditability.
Adversarial Simulation: attacker goals, cheapest exploit path, denial-of-service vectors, sandwiching, and liquidation games.
Network/P2P: peer discovery, gossip propagation, eclipse risk, bandwidth limits, spam resistance, and partition behavior.
Node Operations: sync, snapshots, pruning, backups, observability, upgrades, config safety, and rollback.
Formal Methods: invariant specification, model checking candidates, property tests, and proof gaps.
Regulatory Surface: custody, KYC/AML implications, sanctions, securities risk, data retention, and jurisdictional assumptions.

Tool Contracts

These are abstract tool contracts. Register them in whatever local tool layer exists. Concrete syntax is an adapter detail outside this skill.

swarm_review: single subagent review.
swarm_quorum: multi-model independent review.
swarm_claim: acquire a scoped read/write lock.
swarm_release: release a scoped lock.
swarm_merge: synthesize raw quorum outputs.
swarm_post_merge_review: review the merged report quality.

File Handoff

Prefer path handoff over attachment syntax. Put the target path directly in the subagent prompt:

Review this local file: /path/to/file.md. First read it from disk.

Attachment syntax is optional local optimization, not a swarm contract. Path handoff keeps the skill portable across wrappers without attachment arguments.

`swarm_review`

Single subagent review.

Inputs:

scope: file path, directory, module, or logical domain.
model: model identifier in the local environment.
prompt: optional review lens or task-specific instructions.
thinking: optional reasoning depth.
tools: optional local tool allowlist.
timeout: timeout in seconds.

Behavior:

Optionally claim a read lock for the scope.
Spawn one subagent with a prompt that includes the scope path.
Require the subagent to read the target from disk before judging it.
Return the subagent's raw output.
Release the lock if one was claimed.

Reference prompt:

Review this local scope: <scope>.
First read it from disk.
Use the local review protocol if available.
Report white spots, contradictions, evidence, and risks.

Async Run Adapter

Async run management is an adapter concern, not a portable Swarm script requirement. For non-trivial asynchronous agentic work, use an async-run flow when the local runtime supports it: command-template execution plus a thin detached lifecycle envelope.

start: Launch a swarm run in the background and return run metadata.
status: Report whether the run is running, done, degraded, or failed.
tail: Show recent structured run events or raw logs.
list: Show known runs.
cancel: Stop an owned active run when the adapter can prove pid ownership.

Purpose: Keep the user interface responsive while reviewers, merger, and post-merge reviewer run. Generic run state belongs to a local async lifecycle runtime; swarm-specific execution stays in atomic utilities or command-template composition. The orchestrator should start the run, return metadata, then inspect status/tail after terminal events instead of blocking on sleeps or foreground waits.

Resumable checkpoint goal: Advanced adapters should strive to support a paused subagent that can ask the orchestrator for input and then resume in the same subagent context. The portable contract is a structured coordinator checkpoint plus a coordinator reply; the mechanism may be a TTY session, persistent model session, message queue, or runtime-specific resume token. If the runtime cannot preserve context, degrade to a handoff artifact and a new subagent, and mark the context loss explicitly.

Progress contract: async runs should expose structured state such as progress.json, events.jsonl, logs, and final result metadata. Local tools should read these files through async-run verbs instead of scraping process output.

Minimum state: an adapter should expose run_id, status, timestamps, state directory or output directory, recent events, stdout/stderr logs, and final result metadata.

Terminal statuses: done, failed, timeout, and cancelled are terminal. running and degraded are observable non-terminal states.

Cancellation boundary: cancel only an owned active run when pid ownership or runtime ownership can be verified. Stale pid reuse must fail closed.

Reference binding: Use a local generic async-run runtime or tool registry adapter. If the local runtime exposes a single action tool, bind these verbs as actions rather than adding more Swarm scripts. Swarm scripts themselves should stay atomic and narrowly specialized.

`swarm_quorum`

Multi-model review by independent subagents.

Inputs:

scope: target path or domain.
models: 2-6 model identifiers.
prompt: shared review lens.
thinking: per-model or shared reasoning depth.
timeout: per-subagent or whole-quorum timeout.
merge_mode: merge behavior, default consensus-first.
merger_model: explicit model for the clean merger.

Behavior:

Claim a shared read lock when the target is local and stable.
Spawn one subagent per model concurrently unless a local rate limit requires throttling.
Give every subagent the same target path and review lens.
Preserve every raw output.
Release the lock.
Run a clean-context merger subagent.
Store or return the merged report.
Optionally run post-merge review.

Reference binding: Implement this contract through a local adapter such as a command-template composer, registered tool, or async run. The portable Swarm skill intentionally does not ship a pi -p quorum runner.

Serious quorum rule: For high-stakes review, the merger is not the current orchestrator. The merger is a fifth clean-context subagent.

Lock Protocol

Locks prevent subagents from interfering with shared scopes. Locks are optional for read-only review, but required for concurrent mutation.

Lock Fields

scope: Scope identifier, usually a path or domain label.
owner: Subagent or quorum run that owns the lock.
type: read or write.
acquired: Acquisition time.
ttl: Time-to-live in seconds.
expires: Expiry time derived from acquisition plus TTL.

Conflict Rules

read request with no lock: allow.
read request with existing read: allow.
read request with existing write: conflict unless stale.
write request with no lock: allow.
write request with existing read or write: conflict unless stale.

Lifecycle

acquire: prune stale locks, then check conflicts.
hold: run the bounded subagent task.
release: remove the entry.
expire: any later lock operation may prune stale entries.

TTL rule: Every lock must have a TTL. A lock without expiry is invalid.

Reference implementation: scripts/swarm-lock.mjs exposes claim, release, status, and prune commands for local adapters and tests.

Validation

After changing Swarm scripts or adapter contracts, run:

node scripts/_self-test.mjs

This smoke gate checks lock behavior and script-boundary invariants: no broad coordinator and no pi -p-coupled quorum runner in portable Swarm scripts.

Merge Protocol

The merger is not a passive formatter. The merger is the final synthesis agent and can materially affect the output by choosing labels, grouping findings, deciding severity, preserving minority signals, and adding grounded judgement.

Merger influence: High.

Merger role: Read all raw outputs, deduplicate, rank, preserve dissent, and produce one decision-grade artifact. The merger may add its own finding only as merger finding and only when grounded in evidence.

Merger isolation: For serious quorum work, the merger should be a dedicated clean-context subagent with explicit model, thinking level, and minimal access. The orchestrator provides the input bundle and writes the final artifact after the merger returns.

Orchestrator merge exception: The orchestrator may merge quick, low-stakes, or exploratory quorum runs. The report must say the merge was not isolated.

Merge Parameters

merger_model: Explicit synthesis model. Required for quorum reports.
merger_context: Clean or inherited context. Default clean for serious quorum.
merger_tools: Tool access granted to merger. Prefer none if raw text is embedded; otherwise read-only access to raw outputs.
merger_thinking: Reasoning depth. Use medium normally, high for architecture, security, money, governance, migrations, or specs.
merge_mode: Synthesis style. Default consensus-first.
consensus_threshold: Votes needed to promote. Use 3/4 for critical promotion and 2/4 for major discussion by default.
minority_policy: Treatment of unique findings. Preserve high-impact, evidence-backed minority findings.
attribution: Which model found what. Required for quorum reports.
raw_retention: Keep raw outputs until the final report is accepted.

Merge Modes

faithful: Minimal editing; preserves subagent wording and uncertainty.
consensus-first: Groups by agreement and preserves dissent separately.
risk-first: Promotes high-impact minority findings even without consensus.
design-synthesis: Converts findings into decisions and repair order.

Merge Guardrails

Preserve raw outputs until the final report is accepted.
Do not hide model disagreement.
Do not drop high-impact minority findings only for lack of consensus.
Do not over-promote repeated low-value nitpicks.
Keep model attribution for every major finding.
Keep merger identity explicit: model, context, thinking, and tools.
Separate consensus finding, minority finding, and merger finding.
Separate current-quorum votes from prior-run corroboration.

Post-Merge Review

Run a reviewer on the merged report when the output will drive code, architecture, security, money, governance, migrations, or specifications.

Purpose: Review the review. The reviewer checks whether the merger preserved evidence, ranked severity honestly, avoided hallucinated synthesis, kept minority high-impact findings, and produced an actionable artifact.

Reviewer input:

target scope
raw subagent outputs
merged report
merge parameters
intended use of the report

Reviewer output: Meta-findings about report quality, not a second domain review unless explicitly requested.

Post-Merge Review Lens

Evidence: Does each major finding trace to raw outputs or clearly marked merger evidence?
Severity: Are labels justified by failure impact?
Consensus: Was agreement counted honestly?
Purity: Is current quorum separate from prior context?
Minority: Were unique high-impact findings preserved?
Bias: Did the merger impose an unsupported narrative?
Actionability: Are repair order, scope, and gates named?
Consistency: Do maps, headers, and attribution agree?
Compression: Were caveats or disagreements lost?

Post-Merge Decisions

Accept: Report is decision-grade.
Accept with notes: Report is useful, with caveats.
Revise merge: Rerun merger with adjusted parameters.
Rerun quorum: Raw outputs are too weak, divergent, or under-scoped.
Escalate: Run cross-merge with another clean merger.

Error Handling

LOCK_CONFLICT: Retry, back off, or choose another scope.
SUBAGENT_TIMEOUT: Keep partial results and release locks by TTL.
COORDINATOR_INPUT_REQUIRED: Pause only if the adapter can preserve subagent context; otherwise write a checkpoint artifact and stop degraded for coordinator replanning.
MODEL_FAILURE: Continue with fewer votes and mark degraded quorum.
MERGE_FAILURE: Return INSUFFICIENT_DATA or rerun merger.
REVIEW_FAILURE: Keep merged report and mark post-review missing.

Composition Contract

This skill composes with capabilities, not concrete sibling skills:

local tool registry for reusable command-template adapters
local review and implementation protocols
scoped lock standard with TTL
subagent runner with file-read access
artifact writer owned by the orchestrator

Do not make swarm depend on a specific sibling skill, repository, model alias, or tool registry. The orchestrator may tell subagents to use the local review protocol when one exists.

Portability Lens

Lock protocol: Mandatory kernel.
Quorum review: Optional capability.
Clean merger: Mandatory for serious quorum.
Post-merge review: Required for high-stakes outputs.
Command templates: Local adapter.
Test scripts: Protocol validation aid.

Operational tools should be registered at runtime through the local tool layer. Ad-hoc shell orchestration is acceptable only as an experiment or fallback. See docs/adapters.md for local adapter examples.

Swarm

Subagent orchestration: delegated review, quorum consensus, scoped locks, clean-context merge, and post-merge review.

Purpose

Run subagents safely and predictably through reusable orchestration contracts.

Core Concepts

Orchestrator: Main agent. Coordinates subagents, owns safety, and writes final artifacts.
Subagent: Bounded delegated model call for review, audit, merge, or scoped implementation.
Scope: File, directory, module, or logical domain that a subagent operates on.
Lock: Read or write ownership of a scope for bounded time.
Quorum: Multiple independent subagents reviewing the same target.
Merger: Clean-context fifth subagent that synthesizes raw quorum outputs.
Reviewer: Post-merge reviewer that checks report quality, not the original domain by default.
Async Run Adapter: Local async binding that starts, tracks, lists, tails, and cancels swarm runs through a generic lifecycle runtime.
Async Run: A local lifecycle envelope around a command-template swarm composer or utility. It owns state, logs, status, cancellation, and observability, not swarm semantics.
Lens: A deliberately narrow cognitive role assigned to one subagent, such as security, tests, architecture, economics, or operator UX.
Task Card: A bounded implementation assignment with goal, allowed files, avoided files, expected output, and validation gates.
Component Capability: An abstract adapter operation such as launcher, reviewer, verifier, merger, quorum, checkpoint, follow-up, judge, or normalizer. Swarm may target these capabilities, but local adapters bind them to concrete tools, recipes, command templates, async runs, or services.
Coordinator Checkpoint: A deliberate subagent pause where the subagent preserves its working context, sends a bounded question or status to the orchestrator, receives a coordinator reply, and continues in the same subagent context.
Evidence Checkpoint: A deliberate stop where a subagent records sources, assumptions, confidence, contradictions, or blocking evidence gaps before synthesis.
Integrator: The human or agent that merges isolated branches/worktrees into the shared target and owns conflict resolution.

Swarm Principle

One agent should hold one lens. Many independent lenses make the whole judgement stronger. Parallel lenses make the whole judgement faster.

Swarm Shapes

Distinguish breadth from confidence before launching subagents.

Lens Swarm = different lenses on one object
Quorum     = one lens, multiple independent judges

Decision rule:

Need breadth?     Use Lens Swarm.
Need confidence?  Use Quorum.
Need both?        Use Lens Swarm of Quorums.

Work Modes

Swarm is not only for review. The same lens discipline supports ideation, research-style exploration, planning, implementation coordination, and verification.

Brainstorm Swarm

Purpose: turn many possible futures into one recommended direction.

Typical lenses:

product/user value
operator UX
protocol or platform cleanliness
implementation pragmatism
adoption/growth
skeptic/failure modes
constraints and portability

Merger output:

recommended direction
alternatives considered and rejected
open questions
first implementation slice
risks and validation gates

Brainstorm swarms should not vote mechanically. They should preserve creative tension and synthesize a direction that fits constraints.

Research Swarm

Purpose: turn a broad question into evidence-backed synthesis without letting one agent's search path dominate the answer.

Typical lenses:

question/scope formulation
source discovery
source verification
synthesis and contradiction mapping
devil's advocate / counter-evidence
ethics, policy, or stakeholder impact when relevant

Research swarm rules:

Scope first: the coordinator defines the question, non-goals, source classes, and stop condition before launching evidence work.
Search and verification are separate lenses when stakes are high; the searcher finds candidates, the verifier checks quality and support.
Every material claim in synthesis must trace to a source note, inspected artifact, or explicit uncertainty statement.
Contradictory evidence is preserved as a first-class output, not hidden by consensus language.
Evidence checkpoints block synthesis when sources are insufficient, unverifiable, conflicted, or ethically unsafe to use.

Merger output:

bounded research question
evidence map with confidence and source quality
areas of convergence and contradiction
limitations and not-checkable claims
recommended decision or next evidence slice

Research swarms are for inquiry and synthesis, not automatic publication pipelines. Keep report formatting local to the caller's domain.

Development Swarm

Purpose: turn one accepted direction into coordinated slices.

Recommended flow:

Planner → Lens Swarm → Merger → Scoped Implementers → Integrator → Review Swarm

Small-Team Development Swarm

Also known as MAWP: Multi-Agent Worktree Protocol.

For 2–4 implementation agents, avoid building a heavyweight orchestrator. Use the minimal pattern:

1 project
→ 1 shared backlog
→ 2–4 isolated worktrees/branches
→ each agent owns one small task card
→ conflicts trigger structured reports
→ one integrator merges

Agents should not concurrently mutate the same shared checkout. Prefer isolated branches or worktrees:

git worktree add ../agent-a -b agent/a-task
git worktree add ../agent-b -b agent/b-task
git worktree add ../agent-c -b agent/c-task

Prefer splitting by mutation class, not by unrelated features:

2 agents: implementation; tests/docs/review
3 agents: implementation; tests; docs + review
4 agents: implementation; tests; docs/examples; integrator/refactor/audit

Stable pattern:

Agent A mutates behavior
Agent B verifies behavior
Agent C describes behavior
Agent D integrates behavior

Avoid the dangerous pattern where several agents implement unrelated features, refactor architecture, and clean shared types at the same time.

Every implementation agent receives a Task Card:

# Task

Goal:

- What needs to be done.

Allowed files:

- path/or/glob

Avoid files:

- path/or/glob

Expected output:

- patch
- short summary
- tests/checks run
- touched files list

Conflict handling is bounded:

No conflict       → agent handoff → integrator review → merge
Merge conflict   → both agents write Conflict Report → resolver merges
Semantic conflict→ stop affected workers → integrator replans
Architecture conflict → invalidate/split backlog task

Review Swarm

Purpose: turn one result into many risk lenses and a decision-grade verdict.

Lens Catalog

Choose lenses by risk. Do not run every lens by default; select the smallest set that covers the failure modes of the work.

General Software Lenses

Architecture: module boundaries, dependency direction, cohesion, coupling, extensibility, and long-term ownership.
Correctness: functional behavior, edge cases, invariants, state transitions, and input/output contracts.
Bug Hunter: likely defects, race conditions, null/empty cases, off-by-one errors, and broken assumptions.
Security: injection, privilege boundaries, path handling, secrets, deserialization, unsafe shell/process use, and abuse cases.
Tests: coverage quality, missing regressions, fixture realism, flaky risk, and validation gates.
Performance: algorithmic cost, latency, memory, IO, batching, backpressure, and hot paths.
Concurrency: locks, cancellation, timeouts, idempotency, retries, ordering, and stale state.
Data Integrity: migrations, schema compatibility, durability, rollback, deduplication, and corruption risk.
API Compatibility: public contract drift, backwards compatibility, versioning, deprecation, and client impact.
Operator UX: observability, error messages, status, logs, recovery paths, safe defaults, and support burden.
Developer UX: onboarding, local setup, naming, examples, docs, type ergonomics, and debugging clarity.
Product UX: user journey, surprising behavior, accessibility, copy, affordances, and feedback loops.
Release: changelog truth, version bump, package contents, CI gates, publish risk, and rollback readiness.
Documentation: README accuracy, examples, conceptual consistency, and stale references.
Compliance/Policy: licenses, privacy, retention, consent, audit trails, and organizational constraints.
Incident Response: blast radius, detection, containment, rollback, forensics, and runbook quality.
SRE/Reliability: availability, graceful degradation, monitoring, quotas, rate limits, and dependency failure.
Maintainability: simplicity, naming, duplication, dead code, testability, and future change cost.
Spec Consistency: whether code, docs, tests, prompts, and changelog describe the same behavior.

Complex System / Blockchain Lenses

Use these for blockchain, distributed systems, financial protocols, governance, or adversarial environments.

Protocol Invariants: conservation laws, supply rules, ledger consistency, finality assumptions, and impossible states.
Consensus Safety: fork choice, quorum thresholds, validator behavior, equivocation, reorg handling, and liveness tradeoffs.
Economic Security: incentives, MEV, griefing, sybil cost, fee dynamics, slashing, reward leakage, and manipulation paths.
Smart Contract Safety: reentrancy, authorization, upgradeability, storage layout, oracle trust, token standards, and invariant tests.
Cryptography: key management, signature/domain separation, nonce use, randomness, hash commitments, and proof assumptions.
Cross-Chain/Bridge: message replay, finality mismatch, validator set drift, withdrawal delays, and custody assumptions.
Governance: proposal lifecycle, quorum rules, timelocks, emergency powers, capture risk, and voter/operator UX.
Treasury/Accounting: balance reconciliation, fee distribution, rounding, precision, reserves, and auditability.
Adversarial Simulation: attacker goals, cheapest exploit path, denial-of-service vectors, sandwiching, and liquidation games.
Network/P2P: peer discovery, gossip propagation, eclipse risk, bandwidth limits, spam resistance, and partition behavior.
Node Operations: sync, snapshots, pruning, backups, observability, upgrades, config safety, and rollback.
Formal Methods: invariant specification, model checking candidates, property tests, and proof gaps.
Regulatory Surface: custody, KYC/AML implications, sanctions, securities risk, data retention, and jurisdictional assumptions.

Tool Contracts

These are abstract tool contracts. Register them in whatever local tool layer exists. Concrete syntax is an adapter detail outside this skill.

swarm_review: single subagent review.
swarm_quorum: multi-model independent review.
swarm_claim: acquire a scoped read/write lock.
swarm_release: release a scoped lock.
swarm_merge: synthesize raw quorum outputs.
swarm_post_merge_review: review the merged report quality.

File Handoff

Prefer path handoff over attachment syntax. Put the target path directly in the subagent prompt:

Review this local file: /path/to/file.md. First read it from disk.

Attachment syntax is optional local optimization, not a swarm contract. Path handoff keeps the skill portable across wrappers without attachment arguments.

`swarm_review`

Single subagent review.

Inputs:

scope: file path, directory, module, or logical domain.
model: model identifier in the local environment.
prompt: optional review lens or task-specific instructions.
thinking: optional reasoning depth.
tools: optional local tool allowlist.
timeout: timeout in seconds.

Behavior:

Optionally claim a read lock for the scope.
Spawn one subagent with a prompt that includes the scope path.
Require the subagent to read the target from disk before judging it.
Return the subagent's raw output.
Release the lock if one was claimed.

Reference prompt:

Review this local scope: <scope>.
First read it from disk.
Use the local review protocol if available.
Report white spots, contradictions, evidence, and risks.

Async Run Adapter

start: Launch a swarm run in the background and return run metadata.
status: Report whether the run is running, done, degraded, or failed.
tail: Show recent structured run events or raw logs.
list: Show known runs.
cancel: Stop an owned active run when the adapter can prove pid ownership.

Minimum state: an adapter should expose run_id, status, timestamps, state directory or output directory, recent events, stdout/stderr logs, and final result metadata.

Terminal statuses: done, failed, timeout, and cancelled are terminal. running and degraded are observable non-terminal states.

Cancellation boundary: cancel only an owned active run when pid ownership or runtime ownership can be verified. Stale pid reuse must fail closed.

`swarm_quorum`

Multi-model review by independent subagents.

Inputs:

scope: target path or domain.
models: 2-6 model identifiers.
prompt: shared review lens.
thinking: per-model or shared reasoning depth.
timeout: per-subagent or whole-quorum timeout.
merge_mode: merge behavior, default consensus-first.
merger_model: explicit model for the clean merger.

Behavior:

Claim a shared read lock when the target is local and stable.
Spawn one subagent per model concurrently unless a local rate limit requires throttling.
Give every subagent the same target path and review lens.
Preserve every raw output.
Release the lock.
Run a clean-context merger subagent.
Store or return the merged report.
Optionally run post-merge review.

Serious quorum rule: For high-stakes review, the merger is not the current orchestrator. The merger is a fifth clean-context subagent.

Lock Protocol

Locks prevent subagents from interfering with shared scopes. Locks are optional for read-only review, but required for concurrent mutation.

Lock Fields

scope: Scope identifier, usually a path or domain label.
owner: Subagent or quorum run that owns the lock.
type: read or write.
acquired: Acquisition time.
ttl: Time-to-live in seconds.
expires: Expiry time derived from acquisition plus TTL.

Conflict Rules

read request with no lock: allow.
read request with existing read: allow.
read request with existing write: conflict unless stale.
write request with no lock: allow.
write request with existing read or write: conflict unless stale.

Lifecycle

acquire: prune stale locks, then check conflicts.
hold: run the bounded subagent task.
release: remove the entry.
expire: any later lock operation may prune stale entries.

TTL rule: Every lock must have a TTL. A lock without expiry is invalid.

Reference implementation: scripts/swarm-lock.mjs exposes claim, release, status, and prune commands for local adapters and tests.

Validation

After changing Swarm scripts or adapter contracts, run:

node scripts/_self-test.mjs

This smoke gate checks lock behavior and script-boundary invariants: no broad coordinator and no pi -p-coupled quorum runner in portable Swarm scripts.

Merge Protocol

Merger influence: High.

Orchestrator merge exception: The orchestrator may merge quick, low-stakes, or exploratory quorum runs. The report must say the merge was not isolated.

Merge Parameters

merger_model: Explicit synthesis model. Required for quorum reports.
merger_context: Clean or inherited context. Default clean for serious quorum.
merger_tools: Tool access granted to merger. Prefer none if raw text is embedded; otherwise read-only access to raw outputs.
merger_thinking: Reasoning depth. Use medium normally, high for architecture, security, money, governance, migrations, or specs.
merge_mode: Synthesis style. Default consensus-first.
consensus_threshold: Votes needed to promote. Use 3/4 for critical promotion and 2/4 for major discussion by default.
minority_policy: Treatment of unique findings. Preserve high-impact, evidence-backed minority findings.
attribution: Which model found what. Required for quorum reports.
raw_retention: Keep raw outputs until the final report is accepted.

Merge Modes

faithful: Minimal editing; preserves subagent wording and uncertainty.
consensus-first: Groups by agreement and preserves dissent separately.
risk-first: Promotes high-impact minority findings even without consensus.
design-synthesis: Converts findings into decisions and repair order.

Merge Guardrails

Preserve raw outputs until the final report is accepted.
Do not hide model disagreement.
Do not drop high-impact minority findings only for lack of consensus.
Do not over-promote repeated low-value nitpicks.
Keep model attribution for every major finding.
Keep merger identity explicit: model, context, thinking, and tools.
Separate consensus finding, minority finding, and merger finding.
Separate current-quorum votes from prior-run corroboration.

Post-Merge Review

Run a reviewer on the merged report when the output will drive code, architecture, security, money, governance, migrations, or specifications.

Reviewer input:

target scope
raw subagent outputs
merged report
merge parameters
intended use of the report

Reviewer output: Meta-findings about report quality, not a second domain review unless explicitly requested.

Post-Merge Review Lens

Evidence: Does each major finding trace to raw outputs or clearly marked merger evidence?
Severity: Are labels justified by failure impact?
Consensus: Was agreement counted honestly?
Purity: Is current quorum separate from prior context?
Minority: Were unique high-impact findings preserved?
Bias: Did the merger impose an unsupported narrative?
Actionability: Are repair order, scope, and gates named?
Consistency: Do maps, headers, and attribution agree?
Compression: Were caveats or disagreements lost?

Post-Merge Decisions

Accept: Report is decision-grade.
Accept with notes: Report is useful, with caveats.
Revise merge: Rerun merger with adjusted parameters.
Rerun quorum: Raw outputs are too weak, divergent, or under-scoped.
Escalate: Run cross-merge with another clean merger.

Error Handling

LOCK_CONFLICT: Retry, back off, or choose another scope.
SUBAGENT_TIMEOUT: Keep partial results and release locks by TTL.
COORDINATOR_INPUT_REQUIRED: Pause only if the adapter can preserve subagent context; otherwise write a checkpoint artifact and stop degraded for coordinator replanning.
MODEL_FAILURE: Continue with fewer votes and mark degraded quorum.
MERGE_FAILURE: Return INSUFFICIENT_DATA or rerun merger.
REVIEW_FAILURE: Keep merged report and mark post-review missing.

Composition Contract

This skill composes with capabilities, not concrete sibling skills:

local tool registry for reusable command-template adapters
local review and implementation protocols
scoped lock standard with TTL
subagent runner with file-read access
artifact writer owned by the orchestrator

Do not make swarm depend on a specific sibling skill, repository, model alias, or tool registry. The orchestrator may tell subagents to use the local review protocol when one exists.

Portability Lens

Lock protocol: Mandatory kernel.
Quorum review: Optional capability.
Clean merger: Mandatory for serious quorum.
Post-merge review: Required for high-stakes outputs.
Command templates: Local adapter.
Test scripts: Protocol validation aid.

Adoption

llblab/swarm

$ install --global

Security Scan Results

SKILL.md

Swarm

Purpose

Core Concepts

Swarm Principle

Swarm Shapes

Work Modes

Brainstorm Swarm

Research Swarm

Development Swarm

Small-Team Development Swarm

Review Swarm

Lens Catalog

General Software Lenses

Complex System / Blockchain Lenses

Tool Contracts

File Handoff

swarm_review

Async Run Adapter

swarm_quorum

Lock Protocol

Lock Fields

Conflict Rules

Lifecycle

Validation

Merge Protocol

Merge Parameters

Merge Modes

Merge Guardrails

Post-Merge Review

Post-Merge Review Lens

Post-Merge Decisions

Error Handling

Composition Contract

Portability Lens

Related Skills

llblab/extra-self

llblab/dev2main-release

llblab/bits-ui

llblab/re-review

llblab/swarm

$ install --global

Security Scan Results

SKILL.md

Swarm

Purpose

Core Concepts

Swarm Principle

Swarm Shapes

Work Modes

Brainstorm Swarm

Research Swarm

Development Swarm

Small-Team Development Swarm

Review Swarm

Lens Catalog

General Software Lenses

Complex System / Blockchain Lenses

Tool Contracts

File Handoff

swarm_review

Async Run Adapter

swarm_quorum

Lock Protocol

Lock Fields

Conflict Rules

Lifecycle

Validation

Merge Protocol

Merge Parameters

Merge Modes

Merge Guardrails

Post-Merge Review

Post-Merge Review Lens

Post-Merge Decisions

Error Handling

`swarm_review`

`swarm_quorum`

`swarm_review`

`swarm_quorum`