skills/experiment-report-writer/SKILL.md
Write structured experiment reports from notes, configs, logs, metrics, tables, and figures. Use for result analysis, research updates, and presentation-ready summaries.
npx skillsauth add a-green-hand-jack/ml-research-skills experiment-report-writerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Turn experiment evidence into a clear research report that a reader can evaluate without rerunning the experiment.
Use this skill to write a standalone document, a section for a paper or lab note, a mentor-facing update, or a presentation-ready experiment summary.
Pair this skill with research-project-memory when completed results should update project claims, evidence, risks, actions, figures, or worktree decisions.
<installed-skill-dir>/
├── SKILL.md
└── templates/
└── experiment-report.md
templates/experiment-report.md as the default Markdown skeleton when saving a report.Identify the report mode:
single-experiment: one run or one controlled comparisonablation-report: several variants testing one factorbatch-summary: many related runs from a sweep or experiment batchmentor-update: concise progress report with decision-oriented discussionpaper-section: polished text intended to become part of a paperAlso identify:
If the user gives no format, default to Markdown. If they ask for a file and no path is given, use:
docs/reports/experiment_report_YYYY-MM-DD.md
Prefer primary evidence over memory.
Look for:
Useful local checks include:
git rev-parse --short HEAD
find . -maxdepth 3 -type f \( -name "*.yaml" -o -name "*.yml" -o -name "*.json" -o -name "*.csv" -o -name "*.md" \)
find . -maxdepth 4 -type f \( -name "*.png" -o -name "*.jpg" -o -name "*.pdf" -o -name "*.svg" \)
If the user only provides informal notes, use them but label missing reproducibility details explicitly.
Before drafting, organize the experiment into:
For ablations or sweeps, make the independent variable explicit and keep the comparison fair.
Use these sections unless the user requests a different format:
# [Experiment Report Title]
## Summary
## 1. Experiment Motivation
## 2. Experiment Setup
## 3. Core Algorithm or Method
## 4. Metrics
## 5. Results
## 6. How to Read the Figures
## 7. Interpretation
## 8. Conclusion and Discussion
## 9. Limitations and Caveats
## 10. Next Steps
## Reproducibility Notes
If there is no core algorithm, write "Not applicable" and briefly explain whether the experiment changes data, hyperparameters, evaluation, infrastructure, or analysis instead.
If there are no figures, omit "How to Read the Figures" or replace it with "How to Read the Tables" when tables are the main evidence.
Write 3-6 bullets covering:
Explain the research or engineering reason for the experiment:
Include enough detail to reproduce or audit the run:
Use a table for parameters when there are more than five important settings.
Describe the algorithm only at the level needed to understand the experiment:
Do not over-explain standard background unless the audience needs it.
For each metric, explain:
Flag metrics that can conflict with each other.
Present results before interpretation.
Use:
Always identify the baseline and report absolute values plus meaningful deltas when possible.
For every figure, explain:
If an axis is log-scaled, normalized, clipped, or unitless, say so explicitly.
Connect the observed results back to the motivation:
Use cautious wording when there is only one seed, weak statistical evidence, or missing controls.
State the practical conclusion:
Include risks that could change the conclusion:
Recommend concrete follow-ups:
Tie each next step to the uncertainty it resolves.
If the project uses research-project-memory, write back the result after the report is drafted:
memory/evidence-board.md: completed EVD-### summary, source paths, linked claim IDs, limitations, and certaintymemory/provenance-board.md: source-to-evidence traceability from run pointers, CSVs, reports, aggregation rules, and produced paper-facing candidatesmemory/claim-board.md: move linked claims through the lifecycle, such as supported, weakened, revised, evidence-needed, provisional, or cutmemory/risk-board.md: close mitigated risks or add new risks exposed by the resultmemory/action-board.md: next steps from the report, including rerun, write, revise-method, park, or kill decisionsmemory/handoff-board.md: create a handoff when results are ready for paper-result-asset-builder, paper-evidence-board, result-diagnosis, or writingmemory/phase-dashboard.md: update the active gate if the project moved into evidence production, paper asset building, drafting, or regressed because the result weakened a claim.agent/worktree-status.md "Local Hot Results": always record the result here first when working in a code-worktree; this is the default write target for in-progress and intermediate results<ProjectRoot>/memory/hot-results.md: graduate here only when the result is confirmed, ranks among the project's top 5–7, and directly changes a project-level claim; do not write here for in-progress or worktree-local workmemory/current-status.md: latest reliable experiment state and next session entry pointDo not write an interpretation as a measured fact. Use observed for metrics from logs/tables and inferred for explanations.
Before finalizing, check that:
testing
Bootstrap project-local ml-research-skills. Use from global installs when creating a new ML research project, enabling this collection in an existing ML research repo, or deciding whether to install the full bundle locally. Route to project-init for new projects; do not handle paper or experiment work directly.
development
Route project operations tasks — git, memory, bootstrap, remote, workspace, code review, timeline, ops — to the correct skill. Use when the task involves commits, pushes, worktrees, project memory, enabling project-local skills, SSH/server coordination, sidecar runners, or audits. Do not solve the ops task directly.
testing
Route ML/AI paper writing tasks to the correct skill — contract planning, prose drafting, section writing, consistency editing, review simulation, rebuttal, submission, or citation work. Use when the task involves writing, revising, reviewing, or submitting a paper instead of guessing between paper-writing-assistant, paper-writing-contract-planner, paper-reviewer-simulator, auto-paper-improvement-loop, or citation skills. Do not draft prose directly.
data-ai
Project-local router for ML research skill selection. Use inside an initialized ML research project, or while maintaining this skill repo, when the user describes an ML research/paper/experiment/discovery/ops/release workflow and may not know the skill; route to a domain router or high-signal leaf. Do not use for generic non-ML projects.