Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

a-green-hand-jack/auto-paper-improvement-loop

Name: auto-paper-improvement-loop
Author: a-green-hand-jack

skills/auto-paper-improvement-loop/SKILL.md

npx skillsauth add a-green-hand-jack/ml-research-skills auto-paper-improvement-loop

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Auto Paper Improvement Loop

Run controlled, multi-round review → implement → recompile cycles on a paper draft. Each review round uses a fresh context to prevent confirmation bias; an edit-whitelist gates what may be changed; state is checkpointed after each round so sessions can resume.

Use this skill when:

a paper draft needs iterative writing quality improvement beyond a single-pass consistency edit
reviewer independence matters: prior-context reviews inflate scores and miss real problems
certain parts of the paper (theorems, numerics, citations) should be frozen during a writing pass
a long improvement session may span multiple agent sessions and needs crash recovery
you want a logged diff of what changed between draft versions

Do not use this skill as a substitute for real reviewer feedback — use paper-reviewer-simulator first to identify structural risks. Do not use this skill to make decisions about experimental results — use result-diagnosis or research-results-auditor before running improvement loops.

Pair this skill with:

paper-reviewer-simulator before the first loop round to identify high-priority issues
paper-draft-consistency-editor for a single targeted pass when full multi-round iteration is not needed
paper-writing-assistant when a round's review flags sections that need substantial rewriting
submit-paper after the final round to verify submission readiness

Skill Directory Layout

<installed-skill-dir>/
├── SKILL.md
└── templates/
    └── improvement-log.md

Progressive Loading

Read templates/improvement-log.md before starting a new loop.
Read paper-writing-assistant/references/edit-whitelist-contract.md to select or customize the edit whitelist preset for this loop.
Read paper/.agent/writing-contract.md when it exists to understand protected invariants.
Read paper/.agent/PAPER_IMPROVEMENT_STATE.json when resuming an interrupted loop.

Core Principles

Reviewer independence is non-negotiable. A reviewer that continues from the writer's session context produces inflated scores. Each review sub-task must start with no memory of prior rounds or the author's intentions — only the paper text.

Edit-whitelist prevents scope creep. A writing-quality pass should not silently introduce new claims, new citations, or new numerical values. Declare what is frozen before the loop starts.

Two rounds is usually enough. Round 1 catches the most obvious issues. Round 2 catches what round 1's fixes introduced. A third round rarely finds genuinely new problems and risks over-polishing.

Checkpoint after every round. Multi-round loops over long documents take time. Write state after each completed round.

Step 1 — Configure the Loop

Decide before starting:

Rounds: 2 (default) | 1 (quick) | 3 (high-stakes submission)
Mode: writing | theory | format | full
Edit whitelist — FROZEN (may not be changed):
  - [ ] Theorem/lemma/proof bodies
  - [ ] Any numerical result values
  - [ ] Citation keys and reference list
  - [ ] Section structure and ordering
Edit whitelist — ALLOWED:
  - [ ] Prose rewording for clarity and flow
  - [ ] Paragraph restructuring within sections
  - [ ] Caption rewording
  - [ ] Transition sentences
  - [ ] Notation consistency fixes

Save the configuration and a snapshot of the current PDF (or .tex hash) as the baseline.

Step 2 — Initialize State

Create paper/.agent/PAPER_IMPROVEMENT_STATE.json:

{
  "loop_id": "<paper-dir>-<YYYY-MM-DD>",
  "rounds_planned": 2,
  "rounds_completed": 0,
  "mode": "writing",
  "edit_whitelist_frozen": ["theorems", "numerics", "citations"],
  "baseline_tex_hash": "<sha256>",
  "round_summaries": [],
  "status": "in-progress"
}

Step 3 — Run a Review Round (Reviewer Independence Protocol)

For each round:

Prepare a self-contained review prompt that includes only the paper text — no prior review history, no author intent, no session context.
Run the review as an isolated task using sidecar-task-runner with a fresh Codex session (codex exec --ephemeral), or explicitly start a new Claude session with no continuity. Never continue from the current agent session to run the review.
The review prompt should ask for:
- section-by-section clarity issues (confusing sentences, missing transitions)
- argument flow problems (claims not supported by the evidence in that section)
- presentation issues (figures/tables not mentioned in prose, undefined notation)
- format violations against the writing contract
- a ranked list of the 5 most impactful fixes
Save the review output to paper/.agent/sidecars/improvement-round-<N>/output.md.

Step 4 — Implement Fixes (Edit-Whitelist Enforcement)

For each fix from the review:

Check whether the fix touches a frozen category. If yes, log the rejection:
```
REJECTED: [fix description] — touches frozen category [category]
```
For allowed fixes, implement them in the .tex source.
Recompile to confirm the paper builds without errors.
Log the implemented changes in the round summary.

Step 5 — Restatement Regression Check (Theory Mode)

When mode includes theory:

For each theorem/lemma in the main paper, locate its corresponding restatement in the appendix.
Confirm the statement body is byte-identical or semantically equivalent.
Flag any drift between main-paper statement and appendix restatement.

This check catches accidental divergence introduced by prose edits near theorem environments.

Step 6 — Update State and Log

After each round, update PAPER_IMPROVEMENT_STATE.json:

{
  "rounds_completed": <N>,
  "round_summaries": [
    {
      "round": 1,
      "review_output": "paper/.agent/sidecars/improvement-round-1/output.md",
      "fixes_implemented": <count>,
      "fixes_rejected": <count>,
      "recompile_status": "success"
    }
  ]
}

Write a human-readable log using templates/improvement-log.md.

Step 7 — Final Format Check

After all rounds:

Page count is within venue limit
No duplicate \label{} keys
No \ref{} to undefined labels
No obvious overfull hbox warnings in the compile log (check for Overfull \hbox lines > 10pt)
Abstract word count is within venue limit if specified

Step 8 — Route Next Steps

submit-paper: run final submission preflight after the loop
paper-reviewer-simulator: run a fresh simulation if structural issues were found during the loop
paper-writing-assistant: draft new content for sections flagged as needing substantial work
Mark status: "complete" in PAPER_IMPROVEMENT_STATE.json

Crash Recovery

If the loop is interrupted:

Read paper/.agent/PAPER_IMPROVEMENT_STATE.json to find rounds_completed.
Resume from the next round. Do not re-run completed rounds.
If recompile_status is not success for the last completed round, fix the compile error before continuing.

Final Sanity Check

Before marking the loop complete:

all planned rounds are done
each round's review used a fresh context (no continuation)
edit-whitelist rejections are logged
paper compiles cleanly
improvement log is saved
PAPER_IMPROVEMENT_STATE.json has status: "complete"

a-green-hand-jack/auto-paper-improvement-loop

skills/auto-paper-improvement-loop/SKILL.md

Run multi-round review-implement-recompile improvement cycles on a paper draft. Use when a draft needs iterative writing quality passes with reviewer independence (fresh context per review round), edit-whitelist gating, and crash-resumable state. Distinct from paper-reviewer-simulator (report only) and paper-draft-consistency-editor (single pass).

4 stars

development

Updated May 16, 2026

$ install --global

skillsauth

npx skillsauth add a-green-hand-jack/ml-research-skills auto-paper-improvement-loop

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 16, 2026, 4:24 AM226.5s2 files scanned

SKILL.md

name:: auto-paper-improvement-loop
description:: Run multi-round review-implement-recompile improvement cycles on a paper draft. Use when a draft needs iterative writing quality passes with reviewer independence (fresh context per review round), edit-whitelist gating, and crash-resumable state. Distinct from paper-reviewer-simulator (report only) and paper-draft-consistency-editor (single pass).
argument-hint:: [paper-dir] [--rounds <N>] [--edit-whitelist <ops>] [--mode writing|theory|format]
allowed-tools:: Read, Write, Edit, Bash, Glob

Auto Paper Improvement Loop

Use this skill when:

a paper draft needs iterative writing quality improvement beyond a single-pass consistency edit
reviewer independence matters: prior-context reviews inflate scores and miss real problems
certain parts of the paper (theorems, numerics, citations) should be frozen during a writing pass
a long improvement session may span multiple agent sessions and needs crash recovery
you want a logged diff of what changed between draft versions

Pair this skill with:

paper-reviewer-simulator before the first loop round to identify high-priority issues
paper-draft-consistency-editor for a single targeted pass when full multi-round iteration is not needed
paper-writing-assistant when a round's review flags sections that need substantial rewriting
submit-paper after the final round to verify submission readiness

Skill Directory Layout

<installed-skill-dir>/
├── SKILL.md
└── templates/
    └── improvement-log.md

Progressive Loading

Read templates/improvement-log.md before starting a new loop.
Read paper-writing-assistant/references/edit-whitelist-contract.md to select or customize the edit whitelist preset for this loop.
Read paper/.agent/writing-contract.md when it exists to understand protected invariants.
Read paper/.agent/PAPER_IMPROVEMENT_STATE.json when resuming an interrupted loop.

Core Principles

Edit-whitelist prevents scope creep. A writing-quality pass should not silently introduce new claims, new citations, or new numerical values. Declare what is frozen before the loop starts.

Two rounds is usually enough. Round 1 catches the most obvious issues. Round 2 catches what round 1's fixes introduced. A third round rarely finds genuinely new problems and risks over-polishing.

Checkpoint after every round. Multi-round loops over long documents take time. Write state after each completed round.

Step 1 — Configure the Loop

Decide before starting:

Rounds: 2 (default) | 1 (quick) | 3 (high-stakes submission)
Mode: writing | theory | format | full
Edit whitelist — FROZEN (may not be changed):
  - [ ] Theorem/lemma/proof bodies
  - [ ] Any numerical result values
  - [ ] Citation keys and reference list
  - [ ] Section structure and ordering
Edit whitelist — ALLOWED:
  - [ ] Prose rewording for clarity and flow
  - [ ] Paragraph restructuring within sections
  - [ ] Caption rewording
  - [ ] Transition sentences
  - [ ] Notation consistency fixes

Save the configuration and a snapshot of the current PDF (or .tex hash) as the baseline.

Step 2 — Initialize State

Create paper/.agent/PAPER_IMPROVEMENT_STATE.json:

{
  "loop_id": "<paper-dir>-<YYYY-MM-DD>",
  "rounds_planned": 2,
  "rounds_completed": 0,
  "mode": "writing",
  "edit_whitelist_frozen": ["theorems", "numerics", "citations"],
  "baseline_tex_hash": "<sha256>",
  "round_summaries": [],
  "status": "in-progress"
}

Step 3 — Run a Review Round (Reviewer Independence Protocol)

For each round:

Prepare a self-contained review prompt that includes only the paper text — no prior review history, no author intent, no session context.
Run the review as an isolated task using sidecar-task-runner with a fresh Codex session (codex exec --ephemeral), or explicitly start a new Claude session with no continuity. Never continue from the current agent session to run the review.
The review prompt should ask for:
- section-by-section clarity issues (confusing sentences, missing transitions)
- argument flow problems (claims not supported by the evidence in that section)
- presentation issues (figures/tables not mentioned in prose, undefined notation)
- format violations against the writing contract
- a ranked list of the 5 most impactful fixes
Save the review output to paper/.agent/sidecars/improvement-round-<N>/output.md.

Step 4 — Implement Fixes (Edit-Whitelist Enforcement)

For each fix from the review:

Check whether the fix touches a frozen category. If yes, log the rejection:
```
REJECTED: [fix description] — touches frozen category [category]
```
For allowed fixes, implement them in the .tex source.
Recompile to confirm the paper builds without errors.
Log the implemented changes in the round summary.

Step 5 — Restatement Regression Check (Theory Mode)

When mode includes theory:

For each theorem/lemma in the main paper, locate its corresponding restatement in the appendix.
Confirm the statement body is byte-identical or semantically equivalent.
Flag any drift between main-paper statement and appendix restatement.

This check catches accidental divergence introduced by prose edits near theorem environments.

Step 6 — Update State and Log

After each round, update PAPER_IMPROVEMENT_STATE.json:

{
  "rounds_completed": <N>,
  "round_summaries": [
    {
      "round": 1,
      "review_output": "paper/.agent/sidecars/improvement-round-1/output.md",
      "fixes_implemented": <count>,
      "fixes_rejected": <count>,
      "recompile_status": "success"
    }
  ]
}

Write a human-readable log using templates/improvement-log.md.

Step 7 — Final Format Check

After all rounds:

Page count is within venue limit
No duplicate \label{} keys
No \ref{} to undefined labels
No obvious overfull hbox warnings in the compile log (check for Overfull \hbox lines > 10pt)
Abstract word count is within venue limit if specified

Step 8 — Route Next Steps

submit-paper: run final submission preflight after the loop
paper-reviewer-simulator: run a fresh simulation if structural issues were found during the loop
paper-writing-assistant: draft new content for sections flagged as needing substantial work
Mark status: "complete" in PAPER_IMPROVEMENT_STATE.json

Crash Recovery

If the loop is interrupted:

Read paper/.agent/PAPER_IMPROVEMENT_STATE.json to find rounds_completed.
Resume from the next round. Do not re-run completed rounds.
If recompile_status is not success for the last completed round, fix the compile error before continuing.

Final Sanity Check

Before marking the loop complete:

all planned rounds are done
each round's review used a fresh context (no continuation)
edit-whitelist rejections are logged
paper compiles cleanly
improvement log is saved
PAPER_IMPROVEMENT_STATE.json has status: "complete"

Related Skills

a-green-hand-jack/ml-research-bootstrap

testing

VerifiedTrustedCommunity

Bootstrap project-local ml-research-skills. Use from global installs when creating a new ML research project, enabling this collection in an existing ML research repo, or deciding whether to install the full bundle locally. Route to project-init for new projects; do not handle paper or experiment work directly.

4SKILL.mdUpdated May 26, 2026

a-green-hand-jack/ml-research-bootstrap

a-green-hand-jack/project-ops-router

development

VerifiedTrustedCommunity

Route project operations tasks — git, memory, bootstrap, remote, workspace, code review, timeline, ops — to the correct skill. Use when the task involves commits, pushes, worktrees, project memory, enabling project-local skills, SSH/server coordination, sidecar runners, or audits. Do not solve the ops task directly.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/project-ops-router

a-green-hand-jack/paper-writing-router

testing

VerifiedTrustedCommunity

Route ML/AI paper writing tasks to the correct skill — contract planning, prose drafting, section writing, consistency editing, review simulation, rebuttal, submission, or citation work. Use when the task involves writing, revising, reviewing, or submitting a paper instead of guessing between paper-writing-assistant, paper-writing-contract-planner, paper-reviewer-simulator, auto-paper-improvement-loop, or citation skills. Do not draft prose directly.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/paper-writing-router

a-green-hand-jack/ml-research-router

data-ai

VerifiedTrustedCommunity

Project-local router for ML research skill selection. Use inside an initialized ML research project, or while maintaining this skill repo, when the user describes an ML research/paper/experiment/discovery/ops/release workflow and may not know the skill; route to a domain router or high-signal leaf. Do not use for generic non-ML projects.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/ml-research-router

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/a-green-hand-jack/ml-research-skills.git

# Copy into Claude Code skills folder (global)
cp -r ml-research-skills/skills/auto-paper-improvement-loop ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

a-green-hand-jack/ml-research-skills

4 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT