papermill/skills/experiment/SKILL.md
Design rigorous experiments for a research paper: hypothesis formulation, variable identification, methodology selection, and success criteria. Produces a structured experiment plan with reproducibility in mind.
npx skillsauth add queelius/claude-anvil experimentInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Help the researcher design rigorous experiments or computational studies. Good experiments are hypothesis-driven, reproducible, and have clear success criteria before they are run.
Read .papermill/state.md (Read tool) for:
If .papermill/state.md does not exist, ask the user what claim the experiments should test. Experiment design can proceed without the state file — suggest running /papermill:init afterward to register experiments persistently.
Scan the repository for existing code (Glob tool) in research/, code/, scripts/, experiments/, or analysis/ directories.
Ask the user: "What specific claim or aspect of your thesis do these experiments need to support?"
Different contribution types need different experimental approaches:
| Contribution | Experimental approach | |-------------|----------------------| | Theorem/proof | Numerical validation of theoretical predictions | | Algorithm | Runtime/accuracy benchmarks against baselines | | Statistical method | Monte Carlo simulations with known ground truth | | Empirical finding | Controlled experiments with statistical tests | | Framework/model | Case studies demonstrating applicability |
For each experiment, specify:
State the expected outcome in falsifiable terms. "We expect X to be Y under conditions Z" -- not "we want to show our method works."
Define before running what constitutes support for the hypothesis. This prevents post-hoc rationalization.
Check for and warn about:
If .papermill/state.md exists, update it (Edit tool) by adding to the experiments list. If it does not exist, skip registration and suggest running /papermill:init to persist the experiment.
experiments:
- name: "descriptive-name"
type: "simulation | benchmark | case-study | ablation"
hypothesis: "Expected outcome in one sentence"
status: "planned | running | completed | failed"
script: "path/to/script.R"
last_run: null
Append a timestamped note documenting the experiment design.
Based on the experiment type, suggest the most relevant next step:
/papermill:simulation for detailed simulation design — it covers sample sizes, convergence diagnostics, and result presentation."/papermill:proof to verify the theory before running experiments."/papermill:review once the results are written up."development
Force a research-agent run to conclude. Launches the researcher in synthesis mode: it reads state.md and log.md, writes .research/synthesis.md with outcome, key findings, failed approaches, open questions, and recommendations, then exits. Use when current results are good enough or the agent is stalling.
data-ai
Show the current state of an in-flight research-agent run from .research/state.md, log.md, and attempts/. Read-only summary of cycles, sub-problems, hypothesis statuses, eval trend, and current focus.
testing
Resume an interrupted research-agent run. Re-launches the researcher with instructions to read .research/state.md and log.md, reorient, and continue from the documented current focus. Use after a context compression, session restart, or explicit pause.
tools
When and how to use the repoindex plugin surface (MCP tools, agents, slash commands) for collection queries, release prep, activity summaries, and tag discipline. Use when users ask repoindex questions, mention their repo catalog, or want to know which repoindex tool fits their task.