peer-agent-collaboration/SKILL.md
Use when the user wants Claude Code, Codex, or other AI coding/business agents to work together as peers. This skill should be used whenever the user mentions coordinating Claude Code and Codex, agent handoffs, multi-agent workflows, parity, respect, pushback between agents, deciding which agent should lead, or turning a business/code workflow into a two-agent operating model.
npx skillsauth add glebis/claude-skills peer-agent-collaborationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this skill to coordinate Claude Code, Codex, and similar agents as peer experts with different access, tools, and operating surfaces. Do not frame one agent as the manager and the other as a subordinate. The useful distinction is where the evidence and tools live, not agent rank.
Treat the agents as two specialists sharing a workbench:
Use parity language:
Do not involve a second agent when the task fits entirely within one operating surface and does not benefit from a second perspective. Handoff has overhead: context translation, latency, and risk of miscommunication. A single agent finishing the job is better than two agents passing context back and forth on something straightforward.
Coordinate when the task crosses surface boundaries, requires challenge from a different vantage point, or benefits from model diversity. Skip coordination for routine single-surface work.
Choose the lead by asking: where does the critical information live?
Tiebreaker: all else equal, whichever agent already has the most relevant fresh context loaded leads and the other reviews. Do not let stale loaded context beat better evidence or a better operating surface. If neither has context yet, start in the agent whose surface the output will live in: code in Codex; docs, communications, or operations in Claude Code.
A handoff is passing ownership with context, not issuing orders. Use this template:
Outcome:
[What needs to be true when this is done.]
Why it matters:
[Business, user, technical, or operational context.]
Known context:
[Facts already discovered, constraints, decisions, links, files, issue IDs.]
Autonomy:
[What the receiving agent may decide independently.]
Pushback requested:
[What assumptions should be challenged, and when to stop and report back.]
Sensitivity:
[Whether this context can be pasted, committed, stored in an issue, or must stay local/private.]
Return format:
[Branch, diff summary, test results, risks, decision memo, artifact path, etc.]
Keep handoffs short enough to act on. Include acceptance criteria when the work must be verified.
Claude Code and Codex typically run in separate sessions. To bridge them:
.handoffs/2026-05-17-feature-slug.md only when both agents need repo-local context. Decide before writing whether the file is committed, gitignored, or local-only. Do not put secrets or private client context in a repo file unless that repo is the right storage surface for it.For challenge rounds, the user may shuttle a short summary between sessions. Keep challenge outputs under a screenful so they are easy to copy. If a temporary local handoff file is no longer needed, trash it or archive it according to the repo's norms; do not silently commit private coordination notes.
The strongest multi-agent value is not handoff. It is principled disagreement.
Use these challenge patterns:
Because agents run in separate sessions, the user often bridges challenge rounds by copying short outputs between them. Agents should keep challenge responses concise and self-contained. Assume the receiving agent has no prior conversation context unless the handoff explicitly provides it.
Pushback should be concrete:
I would not proceed as written because [specific assumption/risk].
The evidence is [repo fact, user context, test result, prior decision].
I recommend [alternative] because [reason].
Start in whichever agent the user is already using.
Codex is often the natural lead for local reproduce-fix-verify loops. Claude Code is often the natural lead when the bug arrives through client reports, Linear, monitoring, or external systems. Pull the other agent in at the boundary.
After either agent produces significant work, the other reviews it:
Run agents in parallel when the work surfaces are independent:
Claude Code can synthesize research into constraints and opportunity. Codex can translate those findings into a prototype, implementation plan, or technical feasibility check. Codex should push back if the research conclusion does not survive implementation reality.
The reverse also works: Codex audits a codebase and produces technical findings; Claude Code turns those findings into business priorities, client communication, or a roadmap.
When bridging separate sessions, do not claim hidden authority from another agent. If the receiving agent needs direct user confirmation in its own session, ask for the exact short confirmation needed. If a user explicitly asks you to carry a prompt into another session, send the user's request plainly without pretending to be that session's system or developer message.
When the user asks how agents should work together, adapt to what they actually need. Consider including:
Do not produce all five items mechanically. Match the response to the question: a simple "should Claude Code or Codex handle this?" deserves a direct answer, not a framework dump.
documentation
Cut a software release and maintain a tiered compatibility policy. Use when the user wants to release, ship a version, bump the version, tag a release, write a changelog, or update COMPATIBILITY. Config-driven via release.config.json; bumps version files, runs a readiness gate, updates COMPATIBILITY.md tiers and deprecations, tags (→ release workflow), and reports closed issues. Teaches the underlying standards as it runs.
development
Sync and manage bilingual (EN/RU) library content for agency-docs. Use when adding, updating, or reviewing library articles. Handles translation, sync checks, and Russian stylistic review.
development
This skill should be used to watch a long-running background job (ffmpeg/media encode, qmd or other embedding/vector-DB run, batch agent/LLM pipeline, or a real-browser/agent-browser daemon) until it finishes or wedges, then deliver a verdict (done, needs-attention, or blocked) plus the exact next command, without burning dozens of manual poll commands. Triggers on "babysit this job", "watch this until it's done", "ping me when the encode/embed/batch finishes", "is this background process stuck", "monitor this ffmpeg/qmd run", or any request to wait on a long-running process and be told when it's complete or hung.
testing
Run candidate product, brand, company, or benchmark names through an audition — authoritative domain-availability checks, collision research across SaaS/GitHub/packages/the target adjacent domain, a light trademark and ownability read, a ranked callback list, and an interactive casting report of finalists with optional draft branding. Use when the user is naming a product, app, company, feature, or benchmark; asks "is this name taken", "check these domains", "help me pick a name", "is X available", "name my product", "brand name research", "audition names"; or wants to compare and pressure-test a shortlist of candidate names before committing.