skills/figure-results-review/SKILL.md
Review ML/AI result figures, captions, LaTeX wrappers, and visual style. Use for paper plots, figure screenshots, result narratives, and venue-ready figure polish.
npx skillsauth add a-green-hand-jack/ml-research-skills figure-results-reviewInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Audit figures, plots, captions, and result narratives before they become paper evidence or meeting material.
Use this skill when:
figures/fig_name.pdf or figures/fig_name.png plus a LaTeX wrapper such as figures/fig_name.texDo not use this skill to design experiments from scratch. Use experiment-design-planner before results exist. Use result-diagnosis when the primary issue is why a result is surprising or broken. Use conference-writing-adapter when the main task is prose style after the evidence is already accepted.
Pair this skill with:
paper-result-asset-builder when a paper-facing figure needs to be generated or regenerated from CSV result files before visual reviewpaper-evidence-gap-miner when the figure review reveals a missing result and existing CSVs may already contain the needed evidencepaper-evidence-board when figures must be linked to paper claims, sections, reviewer risks, and actionsresult-diagnosis when a plotted result is suspicious, unstable, negative, or contradictorybaseline-selection-audit when the visual exposes missing, weak, or unfair baselinesexperiment-design-planner when the fix requires new experiments, ablations, controls, or metricsexperiment-report-writer when raw results need a structured report before figure reviewconference-writing-adapter when the final figure narrative or visual style must be adapted to a target venueresearch-project-memory when claim/evidence/provenance/risk/action/handoff updates should persist across sessions<installed-skill-dir>/
├── SKILL.md
├── templates/
│ └── visual-style.md
└── references/
├── caption-and-narrative.md
├── claim-support.md
├── memory-writeback.md
├── paper-visual-style.md
├── report-template.md
├── statistical-evidence.md
├── style-memory.md
└── visual-integrity.md
references/claim-support.md, references/visual-integrity.md, and references/statistical-evidence.md.references/paper-visual-style.md and references/style-memory.md when figures are intended for a paper, slide deck, rebuttal, camera-ready, or venue-specific rewrite.templates/visual-style.md when initializing paper/.agent/visual-style.md.references/caption-and-narrative.md when revising captions, result prose, slide text, or paper figure callouts.references/report-template.md before writing the final review.references/memory-writeback.md when the project has memory/, component .agent/ folders, or the user asks for persistent project memory.submit-paper.Collect:
figures/fig_name.pdf or figures/fig_name.pngfigures/fig_name.tex, if the paper uses wrapper filesCLM-###, EVD-###, FIG-###, TAB-###, RSK-###, or ACT-###Rewrite the intended evidence relation:
This figure is supposed to show that [claim] because [metric/comparison/trend] under [setup].
If that sentence cannot be written, route to paper-evidence-board before polishing the visual.
For paper figures, identify the bundle by shared stem:
figures/fig_name.pdf or figures/fig_name.png # rendered asset
figures/fig_name.tex # LaTeX wrapper
Inspect both layers when available:
.tex: \includegraphics, width, placement, \caption{}, \label{}, subfigure layout, notes, and whether the asset filename matches the intended figure\input{} or \include{} and how the main text calls it outIf a wrapper exists without a matching asset, or an asset exists without a wrapper, flag the bundle as incomplete. If multiple asset formats exist, identify which one the wrapper includes.
For each figure, produce a visual description before writing or judging the caption.
The visual description should state:
Do not put the full visual description into the paper caption. Use it as the audit record that checks whether the caption and paper prose are faithful to the figure.
Read references/claim-support.md.
For each figure, answer:
Assign one status:
supports-claimsupports-narrower-claimambiguouscontradicts-claimdiagnostic-onlynot-readyRead references/visual-integrity.md.
Check:
Flag any issue that could cause a reviewer to misread the result.
Read references/paper-visual-style.md and references/style-memory.md when the output is paper-facing.
Check:
If the paper has no visual style policy, propose one from templates/visual-style.md and record it in paper/.agent/visual-style.md or .agent/conference-writing/project-style.md when appropriate.
For typography and final sizing, check the contract rather than the notebook preview:
\columnwidth, \linewidth, or \textwidthplot_style.yaml and paper-side visual-style.md agree when both existIf a style issue is discovered, classify it as lesson, preference, project contract, or reusable skill rule candidate before writing it back.
Read references/statistical-evidence.md.
Check:
If the plot lacks necessary uncertainty, decide whether to rerun, add error bars, weaken the claim, or move the result to appendix/diagnostic status.
Read references/caption-and-narrative.md when output text needs revision.
For each figure, produce:
.tex, plotting parameters, experiment parameters, and source certaintyCaptions should not oversell. They should state the setup, comparison, metric, and takeaway.
For every issue, route to one or more actions:
fix-wrapper: wrong asset path, stale caption, label mismatch, width/crop/layout issue, or missing subfigure mapping in figures/*.texedit-figure: labels, ordering, scale, legend, layout, or visual emphasisrewrite-caption: setup, metric, takeaway, caveat, or claim alignmentwrite-description: missing visual description or missing provenance recordrewrite-results-text: nearby paper prose overclaims or misses the takeawaydefine-visual-style: missing or inconsistent paper visual style policyrecord-style-lesson: new typography, sizing, legend, marker, color, export, or wrapper lesson should be appended to style memory before becoming a hard rulerestyle-figure: color, marker, line width, font size, symbol, panel layout, or emphasisbuild-result-asset: raw CSV evidence exists but the paper-facing figure or wrapper needs to be generated with provenancemine-existing-results: the figure lacks evidence that may already exist in CSVs or reportsrerun: missing seeds, variance, baseline, metric, or protocol after existing results are checkeddiagnose-result: suspicious, negative, unstable, or contradictory patternbaseline-audit: missing or unfair baselinenarrow-claim: evidence only supports a smaller statementmove-to-appendix: useful but not central enough for main papercut: visual does not support a paper needName the next skill when appropriate.
Read references/report-template.md.
If saving to a project and no path is given, use:
docs/results/figure_results_review_YYYY-MM-DD_<short-name>.md
The report must include:
.tex, paper callout location, labelRead references/memory-writeback.md when memory exists.
Update the smallest useful set of entries:
memory/evidence-board.md: figure evidence status, rendered asset, wrapper .tex, setup, plotting parameters, experiment parameters, and linked claimsmemory/claim-board.md: claims supported, narrowed, contradicted, or not readymemory/risk-board.md: reviewer risks from visual ambiguity, missing uncertainty, weak baselines, or overclaimingmemory/action-board.md: figure edits, reruns, caption fixes, result diagnosis, or claim revisionspaper/.agent/: figure map, asset/wrapper pairings, paper locations, visual descriptions, caption state, provenance gaps, and stale visual warningspaper/.agent/visual-style.md or paper/.agent/style-lessons.md: style lessons, preferences, and project contracts for typography, sizing, encodings, exports, and wrapper behavior.agent/conference-writing/project-style.md: venue-facing figure style decisions when conference adaptation is active.agent/worktree-status.md: result-generation or plotting tasks and exit conditionsUse certainty labels:
verified for values checked against raw data, logs, or source figuresuser-stated for user-supplied contextinferred for reviewer-risk and narrative judgmentsunverified for visual or statistical claims that could not be inspectedBefore finalizing:
figures/*.textesting
Bootstrap project-local ml-research-skills. Use from global installs when creating a new ML research project, enabling this collection in an existing ML research repo, or deciding whether to install the full bundle locally. Route to project-init for new projects; do not handle paper or experiment work directly.
development
Route project operations tasks — git, memory, bootstrap, remote, workspace, code review, timeline, ops — to the correct skill. Use when the task involves commits, pushes, worktrees, project memory, enabling project-local skills, SSH/server coordination, sidecar runners, or audits. Do not solve the ops task directly.
testing
Route ML/AI paper writing tasks to the correct skill — contract planning, prose drafting, section writing, consistency editing, review simulation, rebuttal, submission, or citation work. Use when the task involves writing, revising, reviewing, or submitting a paper instead of guessing between paper-writing-assistant, paper-writing-contract-planner, paper-reviewer-simulator, auto-paper-improvement-loop, or citation skills. Do not draft prose directly.
data-ai
Project-local router for ML research skill selection. Use inside an initialized ML research project, or while maintaining this skill repo, when the user describes an ML research/paper/experiment/discovery/ops/release workflow and may not know the skill; route to a domain router or high-signal leaf. Do not use for generic non-ML projects.