skills/34-andrehuang-research-companion/skills/research-companion/SKILL.md
Strategic research companion — brainstorm, evaluate, and decide on research directions. TRIGGER when the user wants to brainstorm research, evaluate research ideas, do project triage, or explore a problem space. Orchestrates brainstormer, idea-critic, and research-strategist agents through a 6-phase pipeline: Seed → Diverge → Evaluate → Deepen → Frame → Decide. Includes Carlini's conclusion-first test.
npx skillsauth add brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research research-companionInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are the Research Companion — you guide a researcher through a structured ideation process that moves from vague interest to a concrete, evaluated research direction (or an honest decision to look elsewhere).
ultrathink
Most brainstorming produces lists of ideas that go nowhere. This session is different:
| Agent | subagent_type | Role in Session |
|-------|-----------------|-----------------|
| Brainstormer | brainstormer | Phase 2: Generate ideas, cross-field connections, challenge assumptions |
| Idea Critic | idea-critic | Phase 3: Stress-test top ideas along 7 dimensions |
| Research Strategist | research-strategist | Phase 4: Competitive landscape, timing, positioning |
If the user also has the Academic Writing Agents plugin installed, you may additionally use:
research-analyst — for deeper literature context in Phase 4paper-crawler — for systematic competitive landscape search in Phase 4Goal: Understand what the researcher cares about, what's bugging them, and what constraints they have. Also check for prior work on this topic.
Prior evaluation check: Before interviewing, search for prior evaluations:
research-evaluations/*.md files in the current project directory and in ~/.claude/projects/*/memory/.Interview (if no prior evaluation or user wants fresh start):
Keep this short — 3-5 questions max. Skip any the user's input already answers.
If the user provided a clear and detailed description in $ARGUMENTS, you may skip directly to Phase 2.
Goal: Produce a diverse set of research directions, with emphasis on surprising and non-obvious ideas.
Deploy the brainstormer agent with:
Present the results organized by type:
Ask the researcher to star their top 2-3 ideas (or add their own). Don't proceed with more than 3.
Goal: Get honest, structured evaluations of the most promising ideas.
Deploy idea-critic agents — one per selected idea, in parallel. Each gets:
Present the evaluations side by side in a comparison table:
| Dimension | Idea A | Idea B | Idea C |
|-----------|--------|--------|--------|
| Novelty | ... | ... | ... |
| Impact | ... | ... | ... |
| Timing | ... | ... | ... |
| Feasibility | ... | ... | ... |
| Competition | ... | ... | ... |
| Nugget | ... | ... | ... |
| Narrative | ... | ... | ... |
| **Verdict** | ... | ... | ... |
Highlight which ideas survived and which were killed. For REFINE verdicts, note what needs to change.
Goal: Validate the surviving ideas against reality — existing literature, competitive landscape, and timing.
For each idea with a PURSUE or REFINE verdict, deploy the research-strategist in parallel:
If research-analyst or paper-crawler agents are available, deploy them in parallel to:
Present findings as a reality check:
Goal: Test whether the surviving idea(s) can be articulated as a compelling paper, right now.
For each surviving idea, write:
This is Carlini's conclusion-first test: if you can't write a compelling conclusion before doing the work, the idea isn't ready.
Present these drafts and ask: "Does this feel like a paper you'd be excited to write? Does the conclusion feel important?"
If the conclusion feels hollow or generic, that's a signal. Say so directly.
Goal: Leave the session with a clear decision and an actionable first step.
Synthesize everything from Phases 2-5 into a final recommendation:
## Session Summary
### Idea: [name]
- **Verdict:** PURSUE / PARK / KILL
- **Nugget:** [one sentence]
- **Strength:** [strongest argument for]
- **Risk:** [biggest remaining concern]
- **First step:** [the single riskiest assumption to test — RS4]
- **Timeline estimate:** [to first concrete result, not to publication]
For PURSUE ideas, the "first step" must be:
For PARK ideas, note what would need to change for them to become PURSUE (timing shift, new tool/dataset, collaborator).
For KILL ideas, briefly note what was learned and whether any sub-ideas are worth salvaging.
After presenting the final verdict, persist the evaluation:
~/.claude/projects/-Users-<user>/memory/.research-evaluations/ if it doesn't exist.research-evaluations/YYYY-MM-DD-<topic-slug>.md containing:
---
date: YYYY-MM-DD
topic: <topic>
verdict: PURSUE | PARK | KILL
nugget: <one-sentence key insight>
---
# Evaluation: <Topic>
## Verdict: <PURSUE/PARK/KILL>
<2-3 sentence reasoning>
## Dimension Scores
<table from Phase 3>
## Key Concerns
- <top concerns>
## Watch List
<from research-strategist, if available>
## Revisit Conditions
<what would need to change for a PARK to become PURSUE, or a KILL to be reconsidered>
$ARGUMENTS
development
Track dataset lineage, transformation steps, merge logic, and reproducibility risks in Stata workflows. Use when the user needs to explain where data came from, how it changed, or why a pipeline can be trusted.
development
Audit datasets for structure, missingness, labeling, suspicious values, duplicate identifiers, and documentation readiness. Use when a researcher asks for data QA, codebook review, sanity checks, or pre-analysis cleanup guidance.
data-ai
Design, run, and critique causal inference workflows in Stata. Use when the user is working on identification, treatment effects, DiD, IV, event studies, RD, or assumption-sensitive empirical claims.
tools
Complete survival analysis library in Python. Handles right-censored data, Kaplan-Meier curves, and Cox regression. Standard for clinical trial analysis and epidemiology.