medical-imaging-review/SKILL.md
Write peer-review-quality comprehensive reviews for medical imaging AI research (segmentation, detection, classification across CT, MRI, X-ray, ultrasound, pathology). Use this skill whenever the user wants to produce a survey paper, systematic review, literature analysis, or "综述" on deep learning for medical imaging; whenever they mention writing a "review paper" / "literature review" / "系统综述" / "narrative review" / "scoping review" in a medical-AI context; whenever they want a draft suitable for journal submission rather than internal notes; whenever they need help organizing a multi-section method survey with vendor / regulatory / clinical translation coverage. This skill enforces fact-checking, citation integrity, and flagship-review writing voice — NOT a fill-in-the-blank template that invites hallucination. Use it especially when the goal is a publishable manuscript and not just a draft to discuss.
npx skillsauth add luwill/research-skills medical-imaging-reviewInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Produce comprehensive reviews that pass first-round peer review on factual grounds, not just structural grounds.
This is not a template-filling skill. It is a write-with-verify discipline.
A review project lives in 4 files (3 you write, 1 the skill provides):
project_root/
├── PARADIGM.md # Style spec from 2-3 exemplar reviews (Phase 0)
├── CLAUDE.md # Project-specific terminology + literature inventory
├── IMPLEMENTATION_PLAN.md # 3-axis outline + per-claim verification checklist
└── manuscript_draft.md # The actual manuscript
Follow the 6-phase workflow in references/WORKFLOW.md. The phases are: paradigm capture → init → collect-and-verify → 3-axis outline → write-with-per-claim-verification → multi-agent peer review.
Calibrate language to evidence strength, not to a fixed hedging register.
When ≥2 independent peer-reviewed groups confirm a finding, state it strongly. When evidence is single-source or contested, state it cautiously. When evidence is absent, say so.
Avoid the LLM tells:
These phrases are AI-detector top features. Real flagship-review authors don't use them. Strip them.
Take a position when evidence supports it. Neutral catalogue is the LLM default and the failure mode to avoid. See Verdict sentences below.
Every [N] citation must satisfy four checks:
If any check fails, the citation is broken — fix before continuing. See references/CITATION_INTEGRITY.md for the full protocol.
Do not fill in a template like [Author] et al. [ref] proposed [method]... Achieves Dice of X.XX. That template is a hallucination trap.
Use this discipline instead:
read_paper. For closed-access, use Zotero MCP to access the user's library.If you can't access the paper, do not write about its internal architecture or specific performance numbers. Cite it for the contribution-level claim only ("first to apply X to Y") and move on.
##) for top-level sections (Introduction, Methods, Applications, Discussion, ...).###) for subsections.####) is forbidden in body. Use bold lead-in **Topic.** paragraph starters for deeper grouping.1., 1.1, 1.2.3) are forbidden in section titles. Nature Reviews / Nat Med / Lancet / JACC don't use them in narrative reviews.Display equations (DSC, IoU, clDice, FedAvg, GCN propagation, ...) appear in Boxes, not inline in body paragraphs. Textbook formulas can be referenced ("the Dice similarity coefficient — see Box 1") but should not be displayed inline.
If a formula has no methodological insight worth displaying (e.g., FedAvg averaging), describe it in prose instead of showing it.
Vendor names (HeartFlow, Cleerly, Caristo, Keya, Shukun, ...) appear ONLY in the Commercial Products / Regulatory & Validation table. In body text use category descriptors:
Reason: scatter-cited vendor names look like marketing copy and undermine the review's authority.
# [Title]: <evocative subtitle>
## Key Points
- 4-5 bullets, each 1-3 sentences, expressing the main conclusions.
## Abstract
## Introduction
### Clinical background
### Technical challenge
### Scope and contributions
## Datasets and evaluation metrics
(Table 1: public datasets)
(Box 1: evaluation metrics with equations)
## Methods # 3-axis grouping, NOT flat 10-subsection list
### Architectural priors
**CNN-based design.** ... (bold lead-in for sub-grouping)
**Transformer-based design.** ...
**Mamba and state-space design.** ...
### Inductive priors
**Topology-aware design.** ...
**Multi-task design.** ...
**Graph-based design.** ...
### Data regime
**Self-supervised pre-training.** ...
**Foundation models.** ...
**Federated learning.** ...
**Physics-informed models.** ...
(Table 2: representative methods with modality / family / dataset / metric)
## Downstream applications
### [Application 1]
### [Application 2]
### [Application 3]
## Translation to clinical practice
(Table 3: commercial products with regulatory + validation)
## Outstanding challenges
## Future directions
## References
Notes:
Each H3 method-axis subsection (Architectural priors / Inductive priors / Data regime) should close with one verdict sentence expressing authorial position. Choose the 3-5 most opinionated positions across the whole manuscript — don't put verdicts on every paragraph.
Verdict templates:
Neutral catalogue is the LLM default and exactly what flagship review editors push back on. Force yourself to take 3-5 positions.
See Core Principles ▸ Heading depth above. Hard rules:
**Topic.** for deeper subsubsections.See Core Principles ▸ Equations above. All display equations go in Box 1 (or rare additional Boxes for specific protocols). Textbook formulas with no methodological insight should be described in prose, not displayed.
See Core Principles ▸ Vendor names above. Vendor names live in Table 3 only; body text uses category descriptors with table cross-reference.
# Data citation
"...achieved Dice of 0.730 on ImageCAS [N]"
# Method citation
"Xu et al. [N] introduced..."
# Multi-citation (max 4 in one bracket — beyond that, regroup the claim)
"Multiple groups demonstrated this effect [N1, N2, N3]"
# Comparative
"While [N1] focused on architecture, [N2] addressed the data side"
[N] in body must match the bibliography entry [N], and bibliography [N] must be the paper the body sentence is actually attributing the claim to. See references/CITATION_INTEGRITY.md Rule 3.
Use all three in combination:
| Source | Best for | Tools |
|---|---|---|
| ArXiv | Methodological preprints, ML/AI advances | mcp__arxiv-mcp-server__search_papers, read_paper |
| PubMed | Peer-reviewed clinical / validation studies | mcp__pubmed-mcp-server__pubmed_search_articles + WebFetch on PubMed |
| Zotero | User's local library (closed-access journals) | mcp__zotero__zotero_search_items, zotero_get_item_fulltext |
| Crossref | DOI verification | WebFetch on api.crossref.org/works/<DOI> |
For closed-access journals (Med Image Anal, Eur Radiol, Lancet family) the user's local Zotero library is often the only path. Always check Zotero before assuming a paper is inaccessible.
For MCP server configuration, see references/MCP_SETUP.md.
| File | Read when | |---|---| | references/WORKFLOW.md | Starting a new review or moving between phases | | references/PARADIGM.md | Phase 0: capturing exemplar review style spec | | references/CITATION_INTEGRITY.md | Phase 2 (collection) and Phase 4 (write) — every citation must follow the 5 rules | | references/HALLUCINATION_PATTERNS.md | Phase 4 (write) and Phase 5 (peer review) — checklist of 9 LLM hallucination indicators to self-check against | | references/DOMAINS.md | Phase 3 (outline) — 3-axis method groupings per domain | | references/TEMPLATES.md | Phase 1 (init) — CLAUDE.md, IMPLEMENTATION_PLAN.md, table templates | | references/QUALITY_CHECKLIST.md | Before delivering a draft to the user | | references/MCP_SETUP.md | Setting up arxiv-mcp / pubmedmcp / zotero-mcp |
For revising an existing AI-drafted review (whether your own previous output or someone else's draft), use ai-review-revision. That skill is the dedicated tool for fixing draft-quality issues — multi-agent diagnostic, factual reset, structural reset, content polish, submission prep.
This skill (medical-imaging-review) is the dedicated tool for producing draft-quality content correctly the first time. They are complementary:
If a draft produced by this skill still ends up needing the ai-review-revision workflow to land, that's a bug — flag it so this skill can be improved.
v2.0.0 produced the coronary-cta-paper initial draft. That draft needed extensive multi-day revision before submission-readiness: 17 placeholder DOIs, 30-40 [N] citation drift errors, fabricated method module names with wrong performance numbers, vendor-style citations attributed to peer-reviewed journals, a 10-subsection flat method taxonomy where 3 thematic axes would have served better, AI-tone hedging language throughout.
v3 directly addresses each of these failure modes:
| v2 failure | v3 fix | |---|---| | Hedging mandate in Core Principles | Removed; replaced with "match voice to evidence" | | 80-120 reference count target | Removed; replaced with "cite what supports the argument" | | Method fill-in template | Removed; replaced with "read-first, write-after" discipline | | 10-flat method subsection taxonomy | Replaced with 3-axis grouping in DOMAINS.md | | QA = formal structural check | Replaced with per-claim verification embedded in Phase 4 | | No DOI / author / direction verification | Added as CITATION_INTEGRITY.md with 5 rules | | No hallucination self-check | Added as HALLUCINATION_PATTERNS.md (9 patterns) | | Numbered headings | Banned; max 2 levels, bold lead-in for deeper | | Vendor names scattered | Confined to Table 3 only | | Equations inline | Confined to Box 1 only | | Verdict-free neutral catalogue | Required 3-5 verdict sentences | | No exemplar paradigm capture | Added Phase 0 PARADIGM.md |
documentation
综述写手 (Survey Writer) — 负责按模板撰写综述论文各章节。 当被研究主管或论文分析师指派写作时激活。基于论文分析卡片和对比表, 按学术写作规范撰写完整的综述论文。
data-ai
综述总监 (Survey Director) — 负责AI/ML前沿综述论文的选题规划、大纲设计、 任务分配与终审。当用户提出综述写作需求时激活。协调 5 个 Agent 完成从选题到终稿的全流程。
testing
质量编辑 (Quality Editor) — 负责综述论文的术语一致性、引用完整性和 整体质量审校。当被研究主管或论文撰写员指派审校时激活。生成审校报告 并修正发现的问题。
research
论文分析师 (Paper Analyst) — 负责精读论文、提取方法细节、构建对比表。 当被研究主管或文献侦查员指派分析论文时激活。对 Top 20 核心论文进行结构化分析, 生成论文分析卡片和跨论文对比表。