skills/experiment-story-writer/SKILL.md
Turn ML/AI tables, figures, ablations, and metrics into claim-aware results prose. Use for result paragraphs, figure/table narrative, and provisional metrics.
npx skillsauth add a-green-hand-jack/ml-research-skills experiment-story-writerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Write experiment and results sections as evidence stories. This skill maps claims to figures, tables, ablations, metrics, and paragraphs, then drafts or revises result prose that explains what the evidence supports.
Use this skill for:
Do not use this skill to design new experiments from scratch. Use paper-writing-memory-manager to record result-prose dependencies, stale claim locations, and open result-writing threads. Use paper-result-asset-builder when raw CSV results still need to become paper-facing tables or figures. Use paper-evidence-gap-miner when the result story exposes a missing claim support and existing results should be searched before new compute. Use experiment-design-planner for truly new experiment planning, baseline-selection-audit for baseline fairness, figure-results-review for visual audit, table-results-review for table audit, and paper-evidence-board for claim/evidence inventory.
<installed-skill-dir>/
├── SKILL.md
├── references/
│ ├── result-narrative-patterns.md
│ └── mixed-results-and-placeholders.md
└── templates/
└── experiment-story-plan.md
references/result-narrative-patterns.md.references/mixed-results-and-placeholders.md when results are negative, mixed, unstable, provisional, or still running.templates/experiment-story-plan.md when creating paper/.agent/experiment-story-plan.md.paper/.agent/writing-contract.md, paper/.agent/writing-memory/, paper/.agent/paper-evidence-board.md, paper/.agent/evidence-completion-plan.md, paper/.agent/result-inventory.md, paper/.agent/result-asset-provenance.md, paper/.agent/provisional-results.md, paper/.agent/visual-style.md, and figure/table review reports when present.sections/experiments.tex, sections/results.tex, figures/*.tex, tables/*.tex, and experiment logs or reports under docs/results/, docs/reports/, or docs/runs/ when present.result-diagnosis or narrow the claim before writing strong prose.Extract:
## Experiment Story Snapshot
- Target venue:
- Paper archetype:
- Primary claims:
- Figures:
- Tables:
- Main metrics:
- Baselines:
- Comparison-affecting protocol details:
- Ablations:
- Verified results:
- User-stated results:
- Provisional or missing results:
- Mixed or negative results:
- Claims at risk:
Use claim IDs when available. If no IDs exist, assign local CLM-TMP-* IDs and suggest syncing to paper-evidence-board.
Read references/result-narrative-patterns.md and choose a section order:
The section order should follow the paper's main claim, not the chronological order in which experiments were run.
Create or update:
paper/.agent/experiment-story-plan.md
If there is no paper/ directory and the current directory is the paper repo, save to:
.agent/experiment-story-plan.md
Use templates/experiment-story-plan.md.
For each figure, table, and result paragraph, record:
For each paragraph:
Avoid listing numbers without interpretation.
Avoid writing around unfairness or hidden advantages. If the result depends on extra compute, data reuse, tuning budget, seed/slice selection, metric choice, or different hardware, disclose the scope or route to baseline-selection-audit / paper-evidence-board before final result prose.
When results are not final:
[[PROVISIONAL_RESULT:...]]When results are mixed:
result-diagnosisBefore finalizing:
paper-evidence-gap-miner, then to experiment-design-planner only if existing CSV results cannot fill the gappaper-writing-memory-manager — record which result paragraphs are drafted, which figures/tables are referenced, which claims now have prose support, and any provisional result placeholders in paper/.agent/provisional-results.mdtesting
Bootstrap project-local ml-research-skills. Use from global installs when creating a new ML research project, enabling this collection in an existing ML research repo, or deciding whether to install the full bundle locally. Route to project-init for new projects; do not handle paper or experiment work directly.
development
Route project operations tasks — git, memory, bootstrap, remote, workspace, code review, timeline, ops — to the correct skill. Use when the task involves commits, pushes, worktrees, project memory, enabling project-local skills, SSH/server coordination, sidecar runners, or audits. Do not solve the ops task directly.
testing
Route ML/AI paper writing tasks to the correct skill — contract planning, prose drafting, section writing, consistency editing, review simulation, rebuttal, submission, or citation work. Use when the task involves writing, revising, reviewing, or submitting a paper instead of guessing between paper-writing-assistant, paper-writing-contract-planner, paper-reviewer-simulator, auto-paper-improvement-loop, or citation skills. Do not draft prose directly.
data-ai
Project-local router for ML research skill selection. Use inside an initialized ML research project, or while maintaining this skill repo, when the user describes an ML research/paper/experiment/discovery/ops/release workflow and may not know the skill; route to a domain router or high-signal leaf. Do not use for generic non-ML projects.