.claude/skills/deeper-research/SKILL.md
Use when /deep-research isn't thorough enough, when a topic needs adversarial challenge and deep-dive elaboration, or when producing a polished research report for a complex design decision. 6-phase funnel with 8-10 parallel survey agents plus adversarial review.
npx skillsauth add ahrav/gossip-rs deeper-researchInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
A six-phase evidence-gathering funnel for problems where the cost of a wrong
design exceeds the cost of thorough research. Doubles the agent count of
/deep-research (21-23 agents vs 7), approaches the problem from 8-10
independent lenses, and adds adversarial review to challenge conclusions before
they reach the implementation plan.
The funnel narrows progressively: a wide initial survey generates raw findings, synthesis distills them, parallel deep-dives and adversarial challenges stress- test the synthesis from both directions, a final synthesis reconciles all streams, and an integrator grounds everything in the codebase with full traceability from finding IDs to implementation steps.
/deep-research isn't enough: you need adversarial challenge of
findings, deeper dives into gaps, and full traceability/deep-research for standard research tasks (7 agents, 3 phases)/design-tournament when the problem is understood and you need to
explore implementation approaches/deeper-research <problem statement>
/deeper-research --lenses=10 <problem> # Full 10 lenses (default: 8)
/deeper-research --lenses=5 <problem> # Minimal (5 core lenses)
/deeper-research --no-adversarial <problem> # Skip Phase 4
/deeper-research --skip-deep-dive <problem> # Skip Phase 3
If no argument is given, ask the user for the problem statement before proceeding.
Orchestrator: Problem Decomposition (inline, produces Research Brief)
|
Phase 1: Wide Survey (8-10 parallel agents)
|
Phase 2: First Synthesis (1 agent)
|
+--- Phase 3: Deep-Dives (3-5 parallel) ---+
| | <- run in parallel
+--- Phase 4: Adversarial (4 parallel) ----+
| |
+--------------------------------------------+
|
Phase 5: Final Synthesis (1 agent)
|
Phase 6: Integration (1 agent)
Key structural decision: Phase 3 and Phase 4 run in parallel in a single message. This saves a serial step and creates an information firewall — adversarial agents challenge Phase 2's conclusions independently of Phase 3's elaborations, preventing anchoring bias.
Every finding gets a traceable ID used through all subsequent phases:
P1.{agent#}.F{n} — Phase 1, Agent 4, Finding 3 -> P1.4.F3
S1.F{n} — First Synthesis findings
P3.{agent#}.F{n} — Deep-dive findings
P4.{agent#}.F{n} — Adversarial findings
S2.F{n} — Final Synthesis findings
Every step in Phase 6's implementation plan cites these IDs.
| Phase | Agents | Max Per Agent | Total to Next Phase | |-------|--------|---------------|---------------------| | 1 | 8-10 | ~3000 tokens | ~24-30k -> Phase 2 | | 2 | 1 | ~6000 tokens | 6k -> Phases 3, 4, 5 | | 3 | 3-5 | ~4000 tokens | ~12-20k -> Phase 5 | | 4 | 4 | ~2500 tokens | ~10k -> Phase 5 | | 5 | 1 | ~8000 tokens | 8k -> Phase 6 | | 6 | 1 | unconstrained | final output |
Used by ALL agents across all phases.
| Level | Label | Description | Example | |-------|-------|-------------|---------| | 5 | Proven at scale | Battle-tested in production systems handling similar workloads | FoundationDB's simulation testing, TigerBeetle's storage engine | | 4 | Peer-reviewed | Published in reputable venue with formal analysis | OSDI/SOSP paper with proofs | | 3 | Implemented & tested | Open-source implementation with benchmarks/tests | Well-maintained crate with >1k stars, comprehensive test suite | | 2 | Documented practice | Technical blog from credible engineering org | Blog post from Cloudflare, Datadog, AWS engineering | | 1 | Anecdotal | Forum discussion, personal blog, Stack Overflow answer | Useful for leads but needs corroboration |
Agents should weight evidence by domain credibility alongside evidence strength. Higher-credibility domains require less corroboration; lower ones require more.
| Tier | Score | Domains | Treatment | |------|-------|---------|-----------| | High (80-100) | Auto-trust for factual claims | arxiv.org, usenix.org, dl.acm.org, ieee.org, nature.com, official project docs, RFC specs | Core evidence; cite directly | | Moderate (60-79) | Trust with corroboration | Engineering blogs (Cloudflare, AWS, Datadog, Discord), conference talks, well-known tech media (arstechnica) | Good lead; corroborate with another source | | Low (40-59) | Leads only | Medium posts, personal blogs, Stack Overflow, forum discussions, unknown domains | Use only when nothing stronger exists; flag as WEAK | | Suspect (<40) | Verify before citing | Content farms, SEO-optimized listicles, anonymous posts, domains with sensational patterns | Do NOT cite unless verified against primary source |
When scoring evidence, the final weight is:
weight = evidence_strength × credibility_tier_multiplier × corroboration_count
Where credibility multiplier: High=1.0, Moderate=0.8, Low=0.5, Suspect=0.1.
These rules apply to ALL agents across ALL phases. Violation makes findings worthless.
Before launching Phase 1, the orchestrator (you) produces a Structured Research Brief. This is NOT a separate agent — do this inline.
Establish current date: Run date +%Y-%m-%d via Bash to get today's
date. Use this year for all date-filtered searches and recency checks. Do
NOT assume a year from training data.
Parse the problem statement and identify:
Quick codebase scan: Use Glob, Grep, and Read to gather:
Select lenses: Based on the problem, select which of the 10 research
lenses are active (default: 8 core lenses; with --lenses=10 add the two
optional lenses; with --lenses=5 use lenses 1-5 only).
Produce the Research Brief in this format:
## Research Brief
### Problem Statement
{user's problem, restated for clarity}
### Current Date
{YYYY-MM-DD from Step 0 — agents use this for recency filtering}
### Sub-Problems
1. {sub-problem 1}
2. {sub-problem 2}
...
### Key Search Terms
- {term 1}: {why it matters}
- {term 2}: {why it matters}
...
### Codebase Context
- {file path}: {what it contains and why it's relevant}
...
### Active Lenses
{numbered list of selected lenses with brief rationale for optional ones}
### Constraints
- {constraint from problem or codebase}
...
Include this Research Brief in every Phase 1 agent's prompt.
All phase agent prompts (Phases 1-6, output format, and collecting instructions) are in references/phase-prompts.md.
The prompts follow this progression:
| Phase | Agents | Role | Input | |-------|--------|------|-------| | 1 | 8-10 parallel | Wide survey from independent lenses | Research Brief | | 2 | 1 | Cross-reference + gap identification | All Phase 1 reports | | 3 | 3-5 parallel | Deep-dive into Phase 2 gaps | Phase 2 targets | | 4 | 4 parallel | Adversarial challenge of Phase 2 | Phase 2 synthesis | | 5 | 1 | Reconcile all streams | Phases 2+3+4 | | 6 | 1 | Map to implementation plan | Phase 5 synthesis |
After Phase 2 synthesizes Phase 1 findings, before launching Phases 3 and 4, the orchestrator (you) evaluates whether the research scope needs adjustment.
Refinement is triggered when ANY of these signals appear in Phase 2's output:
Compare the Research Brief's Sub-Problems against Phase 2's findings:
Update the Research Brief (inline, not a new agent):
Adjust Phase 3 deep-dive targets: Phase 2 already identifies these, but the orchestrator can reorder or replace targets based on the updated brief.
Optional targeted gap-fill: If refinement reveals a critical gap that Phase 1 missed entirely (not just underexplored — missed), launch 1-2 targeted WebSearch calls before Phase 3/4. Time-box to 2 minutes.
If Phase 4 adversarial agents identify a critical knowledge gap — not a writing or framing issue, but a factual gap where the design would be built on unverified assumptions — the orchestrator triggers a targeted retrieval loop before proceeding to Phase 5.
Loop-back fires when ANY adversarial agent reports:
These standards apply to the final output from each phase. The orchestrator (you) verifies them before proceeding to the next phase.
For long research runs, intermediate results should be persisted to disk to survive context compaction.
After each phase completes, write a summary file:
.claude/research-state/{run-id}/
brief.md — Research Brief (after Orchestrator)
phase1-summary.md — Concatenated Phase 1 agent reports
phase2-synthesis.md — Phase 2 output
phase3-dives.md — Concatenated Phase 3 deep-dive reports
phase4-adversarial.md — Concatenated Phase 4 reports
phase5-final.md — Phase 5 final synthesis
phase6-plan.md — Phase 6 implementation plan
Use a stable identifier: {date}-{first-5-words-of-problem-slugified}.
Example: 2026-04-08-gossip-protocol-partition-tolerance.
/deeper-research <problem> # Default: 8 lenses
/deeper-research --lenses=10 <problem> # Full 10 lenses
/deeper-research --lenses=5 <problem> # Minimal (5 core lenses)
/deeper-research --no-adversarial <problem> # Skip Phase 4
/deeper-research --skip-deep-dive <problem> # Skip Phase 3
| Config | Phase 1 | Phase 2 | Phase 3 | Phase 4 | Phase 5 | Phase 6 | Total | |--------|---------|---------|---------|---------|---------|---------|-------| | Default (8 lenses) | 8 | 1 | 3-5 | 4 | 1 | 1 | 18-20 | | Full (10 lenses) | 10 | 1 | 3-5 | 4 | 1 | 1 | 20-22 | | Minimal (5 lenses) | 5 | 1 | 3-5 | 4 | 1 | 1 | 15-17 | | No adversarial | 8 | 1 | 3-5 | 0 | 1 | 1 | 14-16 | | Skip deep-dive | 8 | 1 | 0 | 4 | 1 | 1 | 15 | | Both skipped | 8 | 1 | 0 | 0 | 1 | 1 | 11 |
--no-adversarial: Phase 4 is skipped entirely. Phase 5 synthesizes
Phase 2 + Phase 3 only (no adversarial reconciliation). Phase 6 has no
adversarial concerns to address.--skip-deep-dive: Phase 3 is skipped entirely. Phase 4 still runs
(challenging Phase 2). Phase 5 synthesizes Phase 2 + Phase 4 only./deep-research instead.--lenses=10 for cross-cutting concerns: When the problem spans
multiple domains (e.g., a data structure that needs both formal correctness
AND API ergonomics), the optional lenses provide crucial coverage.--lenses=5 when you need more depth, not breadth: If the problem
is narrow but deep, 5 lenses with deep-dives gives better results than 10
surface-level surveys.--no-adversarial) for
exploratory research where you don't need verified conclusions./design-tournament: Use deeper-research to
establish the evidence base, then design-tournament to explore implementation
approaches grounded in that evidence./deeper-research --lenses=10 with
all phases, then feed the output into /design-tournament for implementation
exploration. This gives maximum coverage at ~28 total agents.development
Deep first-principles code explanation that builds real understanding through phased walkthroughs with diagrams. Covers algorithms, data structures, memory layout, concurrency patterns, and performance tricks — especially for systems code in Rust. Use whenever the user asks to explain, walk through, break down, deep dive into, or understand code. Trigger on "how does this work", "what's happening here", "teach me about this", "why is it done this way", or when the user references a file with @ and wants to understand it. Proactively use when examining code involving lock-free algorithms, atomics/CAS, memory ordering,
development
Use when creating implementation-ready beads tasks that need testing strategy, optimal implementation approach, and documentation requirements baked in — composes /create-task with parallel enrichment agents that analyze the codebase and produce concrete test specifications, algorithm/data-structure guidance, and doc quality standards so implementing agents don't need to re-research
development
--- name: autoresearch description: Autonomous Goal-directed Iteration. Apply Karpathy's autoresearch principles to ANY task. Loops autonomously — modify, verify, keep/discard, repeat. Supports bounded iteration via Iterations: N inline config. version: 1.9.11 --- # Claude Autoresearch — Autonomous Goal-directed Iteration Inspired by [Karpathy's autoresearch](https://github.com/karpathy/autoresearch). Applies constraint-driven autonomous iteration to ANY work — not just ML research. **Core id
development
Use when implementing a new feature and assessing coverage gaps, during periodic test hygiene, when test suites feel bloated, or before merging code that changes coordination or hot paths. Two-phase assess-then-improve testing pipeline.