research/litreview/skills/litreview/SKILL.md
Academic literature orientation skill that searches papers via Consensus, builds a strategic search plan using PICO (default) or SPIDER / Decomposition / hybrid as fallbacks, and synthesizes findings into a professionally formatted Word document (.docx) research guide. Grill-me intake (research question specificity + framework hint + tentative depth) before the recon search; a second forcing checkpoint after Phase 2 confirms framework + sub-areas + depth before searches consume budget. Configurable depth (5/10/20 queries) controls coverage vs. speed. Output is a 'launching pad' — not a finished review, but an orientation guide that lets a researcher dive in confidently. Triggers: 'litreview on [topic]', 'literature review on [topic]', 'I'm starting a literature review on X', 'I'm writing a paper on X', 'help me research X', 'I'm doing research on X', 'can you help me research X'. Do NOT trigger for single one-off paper searches where the user just wants a quick list — that's a plain Consensus search.
npx skillsauth add alirezarezvani/claude-skills litreviewInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Portability: Requires a Consensus MCP connection, Node.js with
docxpackage for document generation, and (in CLI)bash_tool. Works in Claude Code CLI natively. In Claude.ai with Consensus MCP + Code Execution, the workflow is supported.
Produce a launching pad — not a finished literature review, but an orientation document that gives a researcher entering an unfamiliar field everything they need to start reading and searching with confidence. Think: what a generous colleague who knows the field would tell you over coffee.
Inherited from the research-pack convention; locked verbatim per PR #657's cross-skill consistency audit.
[Not from Consensus — model knowledge] and excluded from cited count. Sparse results stated explicitly, never silently filled.scripts/citation_tracker.py for deterministic counts.See references/search_budget_allocation.md for the sequential-execution rationale + plan-tier signals.
| Failure | Behavior | |---|---| | Consensus rate-limit hit | Wait 3s, retry once, log outcome | | Search returns 0 results | Note explicitly; "either niche terminology or genuine gap"; never silently fill | | Plan-tier cap detected | Log tier; report at checkpoint; surface in audit | | 3 consecutive failures | Stop searching, alert user, share what's collected, ask how to proceed | | Sub-area returns thin results (<5 papers) | Flag in audit; suggest manual PubMed/Scholar supplementation | | User wants to adjust sub-areas | Update table, re-confirm before searching | | DOCX validation fails | Unpack XML, fix, repack |
Each question carries explicit "why I'm asking". Stop condition: max 3 before Phase 1.
State the research question in 1–2 sentences. Specific is better — "How do LLMs perform on clinical reasoning tasks compared to physicians?" beats "AI in medicine". Vague questions produce vague reviews.
Why I'm asking: The reconnaissance search hinges on precise terminology. Vague questions produce thin recon results that don't yield a useful framework breakdown.
Refuse mush. Re-ask once with examples if user is too broad. If still vague, deliver with explicit "broad-scope orientation, not depth review" caveat.
Framework — pick one or say "you pick":
- PICO (Population / Intervention / Comparison / Outcome — most clinical questions)
- SPIDER (Sample / Phenomenon / Design / Evaluation / Research-type — social/qualitative)
- Decomposition (Problem / Solution / Evaluation / Limitations — technology-focused)
- Hybrid (you pick which components from which framework)
- You pick — analyze Q1 and recommend
Why I'm asking: PICO is the default for ~70% of clinical questions but maps poorly to qualitative work or technology evaluation. Picking upfront saves the recon search from suggesting a misaligned framework.
Forcing choice with default ("you pick"). The skill surfaces its own framework recommendation after the recon search so user can override. Use scripts/framework_recommender.py for the heuristic.
See references/framework_selection.md for PICO / SPIDER / Decomposition canon.
Tentative depth — pick one. Final confirmation comes after the framework breakdown:
- Quick scan (5 searches)
- Standard review (10 searches)
- Deep dive (20 searches)
Why I'm asking: I ask this twice — once now to calibrate the recon search emphasis, once after the framework breakdown to confirm. Tentative answer affects which sub-areas to surface first; final answer drives search budget allocation.
Forcing choice. Re-asked at the post-Phase-2 checkpoint after the user has seen the framework breakdown.
Stop condition: 3 questions max before Phase 1. The post-Phase-2 checkpoint is its own grill-me moment (framework table + sub-area-adjustment + depth-reconfirmation).
One broad Consensus search to map themes, terminology, methodological distinctions.
citation_tracker.py --action record_search --session NAME --query "..."citation_tracker.py --action record_papers_received --session NAME --count NSynthesize for the checkpoint:
Choose framework (from Q2 OR override based on recon):
Generate 4-5 sub-area questions mapped to framework components. Each becomes a targeted Phase 3 search.
After Phase 2, halt and present:
| Framework Component | How It Maps to This Topic | Proposed Sub-area to Explore | |---|---|---| | (Component 1) | ... | Sub-area 1 | | (Component 2) | ... | Sub-area 2 | | (Component 3) | ... | Sub-area 3 | | (Component 4) | ... | Sub-area 4 | | Cross-cutting theme | ... | Sub-area 5 |
Surface the practical constraint: detected plan tier + theoretical ceiling.
A wrong framework or sub-area set wastes the search budget. This is the last cheap moment to correct course.
Wait for user response before Phase 3. Refuse to start Phase 3 without explicit user choice.
Sequential (1 query/sec), budget per depth tier. See references/search_budget_allocation.md for full canon.
"systematic review [topic]" / "meta-analysis [topic]"year_max: 2015 + year_min: 2021year_min after publicationThroughout: 1 q/sec rate limit. Sequential. Confirm response before next call. Record each via citation_tracker.py.
Three trackers across ALL search results — run scripts/cross_search_aggregator.py --session NAME after Phase 3 completes:
These feed the "Start Here" + "Key Research Groups" + "Bibliography" DOCX sections.
Generate via Node.js + docx library. 8 sections (see references/docx_8_sections.md for full spec):
Document the key docx library patterns:
LevelFormat.BULLET (never unicode bullets)ExternalHyperlink with style: "Hyperlink", full URL (never truncated)columnWidths + cell width), ShadingType.CLEARpython scripts/office/validate.py output.docx)Reference the docx skill for setup patterns and best practices.
research_guide_<topic-slug>_<YYYY-MM-DD>.docx
Plus:
| Script | Role |
|---|---|
| scripts/citation_tracker.py | JSON-backed three-count audit at ~/.litreview_sessions/<session>.json |
| scripts/framework_recommender.py | Heuristic PICO/SPIDER/Decomposition suggestion from research question |
| scripts/cross_search_aggregator.py | Repeat-hits + recurring-authors + citation-per-year ranking after Phase 3 |
references/framework_selection.md — PICO / SPIDER / Decomposition canon (7+ sources)references/search_budget_allocation.md — depth tiers + cross-search intelligence + sequential execution rationale (7+ sources)references/docx_8_sections.md — research guide DOCX spec + technical requirements (7+ sources)Version: 1.0.0
Source spec: megaprompts/09-litreview-megaprompt.md
Build pattern: Path B (direct conversion). Sibling of pulse (research-pack shape).
data-ai
Use when you want to understand what Claude contributed vs what you drove in a session. Triggers on: /collab-proof, session retrospective, ai contribution analysis, collaboration evidence, what did claude do.
data-ai
Personal coach that teaches users to become Claude power users. Use this skill the FIRST time a user asks to "learn Claude", "be a power user", "coach me", "teach me Claude tricks", "what can Claude do", "make me better at prompting", or any variation. After activation, also use it on EVERY subsequent turn to detect missed optimization opportunities (vague prompts, ignored capabilities, manual work Claude could automate) and surface a single power-user tip. Trigger generously — most users do not know what they do not know, so err on the side of coaching.
development
Use when designing or revisiting product pricing — selecting a pricing model (subscription seat-based, usage-based, value-based, freemium, or hybrid), running Van Westendorp Price Sensitivity Meter analysis on WTP survey data, or designing Good/Better/Best packaging tiers. Recommends a model and a price range with trade-offs, never a single number. For Commercial leads, Product Marketing, and CMOs at the pricing-design moment — not deal-by-deal discounting, not brand positioning.
testing
Use when a startup is approached by a prospective partner and someone has to decide should we sign this partner, at what partner tier (referral / reseller / OEM / SI-consulting / strategic alliance), with what joint GTM commitment, and at what revshare. Classifies partner tier from independent-demand evidence vs. preferential-terms hunting, designs a 90-day joint GTM plan, models revshare against direct-sale margin, and surfaces kill criteria for unwinding under-performing partnerships. For Head of Partnerships, Head of BD, and Founder-CEOs doing reseller agreement, OEM deal, or strategic alliance review — not technical sale enablement, not channel cost economics, not M&A.