skills/execution/SKILL.md
Use when experiments are running or just completed, or user shares results and wants to decide what to do next.
npx skillsauth add moralespanitz/research-loop executionInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Read the active session:
ls -t .research-loop/sessions/
cat .research-loop/sessions/<latest>/lab_notebook.md
Summarize the state:
"You're in session [slug]. [N] experiments run. Best so far: [metric]. Last decision: [continue/pivot/kill]."
Create todos with TodoWrite:
Task 1: Annotate run #N — record result, mechanistic explanation, decision
Task 2: Update knowledge graph
Task 3: Write conclusion paragraph (before checking if result matched prediction)
Task 4: Decide — continue / pivot / kill
Read the latest result:
tail -1 .research-loop/sessions/<slug>/autoresearch.jsonl | python3 -m json.tool
Ask the researcher three questions — one at a time:
Q1:
"What happened? Walk me through the result — metric value, direction, was it what you expected?"
Q2:
"Why do you think it happened? I want a mechanistic explanation, not 'the model improved'. What did the change actually do?"
Q3:
"What does this tell you about the next step?"
Append the full exchange to lab_notebook.md:
## Run #N — <node name>
Date: <date>
Mutation: <what changed>
Result: <metric value> (Δ <delta> from baseline)
Researcher explanation: <their answer to Q2>
Causal annotation: <your synthesis of why>
Decision: <continue / pivot / kill>
Next mutation rationale: <why>
Also append a node to knowledge_graph.md:
## [node name] → [result] → [next]
- Mutation: <what changed>
- Result: <metric> Δ<delta>
- Why it worked/failed: <mechanistic>
- Implication: <what to try next>
Apply this tree — ask the researcher first, then give your recommendation:
Improved in last 5 runs?
├── YES → continue this direction
└── NO
├── > 10 runs total with no improvement?
│ └── YES → KILL. Update status, move to next hypothesis.
└── NO → PIVOT. Suggest a different mutation direction.
Show your recommendation explicitly:
"My recommendation: [continue/pivot/kill]. Here's why: [one sentence]."
Update lab_notebook.md status:
## Status
<date>: Run #N complete. Decision: <continue/pivot/kill>. Reason: <why>
Declare success when ALL of these are true:
Then say:
"You have enough to write the paper. Run
/writeor load thewriting-papersskill."
testing
Plan and execute a structured replication workflow for a paper, claim, or benchmark with environment selection and integrity checks.
testing
End-to-end paper generation pipeline ported from AutoResearchClaw (Aiming Lab). 14 phases covering topic initiation through export/publish, with human- in-the-loop gates and quality gating at each handoff. Use this when the user wants a full paper pipeline run — topic to submission-ready manuscript. Delegates to researcher/reviewer/writer/verifier subagents for stage execution and to autonomous-iteration for experiment optimization loops.
testing
Run a structured literature review on a topic using parallel search, evidence tables with quality scoring, and primary-source synthesis.
development
Publication-quality figure generation for research papers. Decision agent selects figure type (code plot vs architecture diagram). Generates Matplotlib/Seaborn code for quantitative figures with iterative improvement loop. Style-matches conference templates (NeurIPS, ICML, ICLR). Use when the paper-pipeline reaches the figure generation phase, or when a user requests figures for an existing draft.