papermill/skills/review/SKILL.md
Launch a multi-agent editorial review with 8 specialists (2 literature scouts plus 6 domain reviewers) orchestrated by an area-chair agent. Produces a unified report plus specialist reports in .papermill/reviews/. Updates .papermill/state.md.
npx skillsauth add queelius/claude-anvil reviewInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Launch a comprehensive multi-agent review of a research paper. The review system uses 8 specialist agents orchestrated by an area chair to evaluate logic, novelty, methodology, prose, citations, and formatting — grounded in literature context.
Read .papermill/state.md (Read tool) for:
If .papermill/state.md does not exist, the review can still proceed by reading the manuscript directly — but the review will be less targeted without thesis and venue context. Note this limitation to the user and suggest running /papermill:init first for best results.
Locate the manuscript files:
.papermill/state.md for format and any recorded manuscript path.*.tex, *.Rmd, paper.md, manuscript.md.Read the manuscript to confirm it has enough content for review.
Before launching the review, ask the user:
I'll launch a multi-agent review with specialists covering:
- Logic & proofs — mathematical correctness and argument structure
- Novelty — contribution evaluation against the literature
- Methodology — experimental design and statistical rigor
- Prose — writing quality and narrative structure
- Citations — reference accuracy and completeness
- Formatting — build verification and venue compliance
Are there specific areas you want me to focus on, or should I run the full review?
If the user specifies focus areas, note them for the orchestrator. If they want the full review, proceed with all specialists.
Launch the reviewer agent (Task tool with subagent_type: "papermill:reviewer").
Pass the agent:
.papermill/state.md (if it exists)The agent will:
.papermill/reviews/YYYY-MM-DD/After the agent completes, read .papermill/reviews/YYYY-MM-DD/review.md (Read tool).
Present a summary to the user:
Review Complete
Recommendation: [ready | minor-revision | major-revision | not-ready]
| Severity | Count | |----------|-------| | Critical | N | | Major | M | | Minor | P | | Suggestions | Q |
Top findings:
- [Most important finding]
- [Second most important finding]
- [Third most important finding]
The full report is at
.papermill/reviews/YYYY-MM-DD/review.md. Individual specialist reports are in the same directory.
Then ask: "Would you like to go through the findings in detail, or address the critical issues first?"
Update .papermill/state.md (Edit tool):
Add a review record to review_history:
review_history:
- date: "YYYY-MM-DD"
type: "multi-agent-review"
findings_major: N
findings_minor: M
recommendation: "ready | minor-revision | major-revision | not-ready"
notes: "Brief summary of key findings"
report_path: ".papermill/reviews/YYYY-MM-DD/review.md"
Append a timestamped note to the markdown body.
Based on the recommendation, suggest the most relevant next step:
/papermill:polish to prepare, then /papermill:venue if no venue is selected yet./papermill:review./papermill:outline. If issues are with the argument, suggest /papermill:thesis. Then re-run /papermill:review./papermill:thesis. Insufficient evidence → /papermill:experiment or /papermill:simulation. Missing related work → /papermill:prior-art. Proof gaps → /papermill:proof.tools
This skill should be used when the user asks about research directions, open problems, future work, or follow-up research from the academic literature. Trigger phrases include "open problems in X", "what's next for Y", "future work for paper Z", "research directions on T", "salient follow-up research", "what should I work on next in", "find me follow-up research unrelated to my prior work", "broad survey of W", "neglected directions in V". Routes the request to the right Vista MCP tool, reads back the structured paper sections, and synthesizes research directions in the conversation.
testing
Discover latent themes in the metafunctor corpus. Use when the user wants to find implicit through-lines across their blog posts, surface recurring ideas they have not consciously framed as a series, or get candidate themes for a synthesis post. Reads titles, descriptions, and tags; proposes 3 themes that connect 4+ posts each, each with a single-sentence through-line. Output is a proposal, not a draft. Trigger phrases include "find themes in my posts", "what threads connect my writing", "latent themes in my corpus", "scribe".
development
Use when drafting prose sections for a bookwright (technical non-fiction) project. Encodes the Bernoulli-textbook workflow: atom-outward design, deferral discipline, running threads, page budgets, Path A subagent pattern, header comment block convention.
testing
Use when drafting or executing a paired notebook for a bookwright chapter. Covers when notebooks are required vs optional, numerical-sanity-target convention, exec-from-fresh-kernel requirement, and stack-specific execution commands (Jupyter/R Markdown/Quarto).