Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

wanshuiyin/research-review

Name: research-review
Author: wanshuiyin

skills/skills-codex-claude-review/research-review/SKILL.md

npx skillsauth add wanshuiyin/Auto-claude-code-research-in-sleep research-review

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Error

VirusTotalMulti-engine malware detection

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Override for Codex users who want Claude Code, not a second Codex agent, to act as the reviewer. Install this package after skills/skills-codex/*.

This reviewer is a different model family from the Codex executor. Every overlay trace/audit records:
review_independence: cross-family
acceptance_status: accepted

Research Review via `claude-review` MCP (high-rigor review)

Claude overlay assurance: this route is a different model family from the Codex executor and records review_independence: cross-family plus acceptance_status: accepted.

Get a multi-round critical review of research work from an external LLM with maximum reasoning depth.

Constants

REVIEWER_MODEL = claude-review — Claude reviewer invoked through the local claude-review MCP bridge. Set CLAUDE_REVIEW_MODEL if you need a specific Claude model override.
REVIEWER_BACKEND = claude-review — reviews route through the claude-review MCP (Claude family; cross-family for a Codex executor).

Context: $ARGUMENTS

Prerequisites

Install the base Codex-native skills first: copy skills/skills-codex/* into ~/.codex/skills/.
Then install this overlay package: copy skills/skills-codex-claude-review/* into ~/.codex/skills/ and allow it to overwrite the same skill names.

codex mcp add claude-review -- python3 ~/.codex/mcp-servers/claude-review/server.py

This gives Codex access to mcp__claude-review__review_start, mcp__claude-review__review_reply_start, and mcp__claude-review__review_status.

Workflow

Step 1: Gather Research Context

Before calling the external reviewer, compile a comprehensive briefing:

Read project narrative documents (e.g., STORY.md, README.md, paper drafts)
Read any memory/notes files for key findings and experiment history
Identify: core claims, methodology, key results, known weaknesses

Step 2: Initial Review (Round 1)

Send a detailed prompt with ultra reasoning:

mcp__claude-review__review_start:
  prompt: |
    [Full research context + specific questions]
    Please act as a senior ML reviewer (NeurIPS/ICML level). Start from the
    assumption that the work is broken somewhere — your job is to find where.
    Be adversarial. Trust nothing the author tells you — verify everything
    yourself. Identify:
    1. Logical gaps or unjustified claims
    2. Missing experiments that would strengthen the story
    3. Narrative weaknesses
    4. Whether the contribution is sufficient for a top venue
    Please be brutally honest.

After this start call, immediately save the returned jobId and poll mcp__claude-review__review_status with a bounded waitSeconds until done=true. Treat the completed status payload's response as the reviewer output, and save the completed threadId for any follow-up round.

Step 3: Iterative Dialogue (Rounds 2-N)

Use mcp__claude-review__review_reply_start with the saved completed threadId, then poll mcp__claude-review__review_status with the returned jobId until done=true to continue the conversation:

mcp__claude-review__review_reply_start:
  threadId: [saved reviewer id from Step 2]
  prompt: |
    Please continue the review using the revised materials below.

    Revised files:
    - /absolute/path/to/file1
    - /absolute/path/to/file2

    Focus on unresolved weaknesses and whether the revision actually fixed them.

For each round:

Respond to criticisms with evidence/counterarguments
Ask targeted follow-ups on the most actionable points
Request specific deliverables: experiment designs, paper outlines, claims matrices

Key follow-up patterns:

"If we reframe X as Y, does that change your assessment?"
"What's the minimum experiment to satisfy concern Z?"
"Please design the minimal additional experiment package (highest acceptance lift per GPU week)"
"Please write a mock NeurIPS/ICML review with scores"
"Give me a results-to-claims matrix for possible experimental outcomes"

Step 4: Convergence

Stop iterating when:

Both sides agree on the core claims and their evidence requirements
A concrete experiment plan is established
The narrative structure is settled

Step 5: Document Everything

Save the full interaction and conclusions to a review document in the project root:

Round-by-round summary of criticisms and responses
Final consensus on claims, narrative, and experiments
Claims matrix (what claims are allowed under each possible outcome)
Prioritized TODO list with estimated compute costs
Paper outline if discussed

Update project memory/notes with key review conclusions.

If — composed: <canonical-report-path> is explicitly present, fold consensus, claims matrix, TODOs, and trace links into that report instead of writing a standalone review document. Without the directive, write the standalone review as documented; never infer composed mode from an existing file. — standalone always wins. See output-composition.md.

Step 6: Review Tracing

Save a trace for every mcp__claude-review__review_start, mcp__claude-review__review_reply_start, or oracle-pro review call following ../shared-references/review-tracing.md. Record the reviewer route, saved threadId, prompt summary, raw response path, decisions, and action items. This preserves the Claude mainline Review Tracing semantics while using Codex-native reviewer calls.

Key Rules

Always ask the Claude reviewer for strict, high-rigor feedback in every review round.
Send comprehensive context in Round 1 — the external model cannot read your files
Be honest about weaknesses — hiding them leads to worse feedback
Push back on criticisms you disagree with, but accept valid ones
Focus on ACTIONABLE feedback — "what experiment would fix this?"
Document the completed threadId for potential future resumption
The review document should be self-contained (readable without the conversation)

Prompt Templates

For initial review:

"I'm going to present a complete ML research project for your critical review. Please act as a senior ML reviewer (NeurIPS/ICML level)..."

For experiment design:

"Please design the minimal additional experiment package that gives the highest acceptance lift per GPU week. Our compute: [describe]. Be very specific about configurations."

For paper structure:

"Please turn this into a concrete paper outline with section-by-section claims and figure plan."

For claims matrix:

"Please give me a results-to-claims matrix: what claim is allowed under each possible outcome of experiments X and Y?"

For mock review:

"Please write a mock NeurIPS review with: Summary, Strengths, Weaknesses, Questions for Authors, Score, Confidence, and What Would Move Toward Accept."

wanshuiyin/research-review

skills/skills-codex-claude-review/research-review/SKILL.md

Get a deep critical review of research from Claude via claude-review MCP. Use when user says "review my research", "help me review", "get external review", or wants critical feedback on research ideas, papers, or experimental results.

13,243 stars

tools

Updated Jul 11, 2026

$ install --global

skillsauth

npx skillsauth add wanshuiyin/Auto-claude-code-research-in-sleep research-review

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Error

VirusTotalMulti-engine malware detection

70%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jul 11, 2026, 5:11 AM145.6s1 file scanned

SKILL.md

name:: research-review
description:: Get a deep critical review of research from Claude via claude-review MCP. Use when user says \"review my research\", \"help me review\", \"get external review\", or wants critical feedback on research ideas, papers, or experimental results.

Override for Codex users who want Claude Code, not a second Codex agent, to act as the reviewer. Install this package after skills/skills-codex/*.

This reviewer is a different model family from the Codex executor. Every overlay trace/audit records:
review_independence: cross-family
acceptance_status: accepted

Research Review via `claude-review` MCP (high-rigor review)

Claude overlay assurance: this route is a different model family from the Codex executor and records review_independence: cross-family plus acceptance_status: accepted.

Get a multi-round critical review of research work from an external LLM with maximum reasoning depth.

Constants

REVIEWER_MODEL = claude-review — Claude reviewer invoked through the local claude-review MCP bridge. Set CLAUDE_REVIEW_MODEL if you need a specific Claude model override.
REVIEWER_BACKEND = claude-review — reviews route through the claude-review MCP (Claude family; cross-family for a Codex executor).

Context: $ARGUMENTS

Prerequisites

Install the base Codex-native skills first: copy skills/skills-codex/* into ~/.codex/skills/.
Then install this overlay package: copy skills/skills-codex-claude-review/* into ~/.codex/skills/ and allow it to overwrite the same skill names.

codex mcp add claude-review -- python3 ~/.codex/mcp-servers/claude-review/server.py

This gives Codex access to mcp__claude-review__review_start, mcp__claude-review__review_reply_start, and mcp__claude-review__review_status.

Workflow

Step 1: Gather Research Context

Before calling the external reviewer, compile a comprehensive briefing:

Read project narrative documents (e.g., STORY.md, README.md, paper drafts)
Read any memory/notes files for key findings and experiment history
Identify: core claims, methodology, key results, known weaknesses

Step 2: Initial Review (Round 1)

Send a detailed prompt with ultra reasoning:

mcp__claude-review__review_start:
  prompt: |
    [Full research context + specific questions]
    Please act as a senior ML reviewer (NeurIPS/ICML level). Start from the
    assumption that the work is broken somewhere — your job is to find where.
    Be adversarial. Trust nothing the author tells you — verify everything
    yourself. Identify:
    1. Logical gaps or unjustified claims
    2. Missing experiments that would strengthen the story
    3. Narrative weaknesses
    4. Whether the contribution is sufficient for a top venue
    Please be brutally honest.

Step 3: Iterative Dialogue (Rounds 2-N)

mcp__claude-review__review_reply_start:
  threadId: [saved reviewer id from Step 2]
  prompt: |
    Please continue the review using the revised materials below.

    Revised files:
    - /absolute/path/to/file1
    - /absolute/path/to/file2

    Focus on unresolved weaknesses and whether the revision actually fixed them.

For each round:

Respond to criticisms with evidence/counterarguments
Ask targeted follow-ups on the most actionable points
Request specific deliverables: experiment designs, paper outlines, claims matrices

Key follow-up patterns:

"If we reframe X as Y, does that change your assessment?"
"What's the minimum experiment to satisfy concern Z?"
"Please design the minimal additional experiment package (highest acceptance lift per GPU week)"
"Please write a mock NeurIPS/ICML review with scores"
"Give me a results-to-claims matrix for possible experimental outcomes"

Step 4: Convergence

Stop iterating when:

Both sides agree on the core claims and their evidence requirements
A concrete experiment plan is established
The narrative structure is settled

Step 5: Document Everything

Save the full interaction and conclusions to a review document in the project root:

Round-by-round summary of criticisms and responses
Final consensus on claims, narrative, and experiments
Claims matrix (what claims are allowed under each possible outcome)
Prioritized TODO list with estimated compute costs
Paper outline if discussed

Update project memory/notes with key review conclusions.

Step 6: Review Tracing

Key Rules

Always ask the Claude reviewer for strict, high-rigor feedback in every review round.
Send comprehensive context in Round 1 — the external model cannot read your files
Be honest about weaknesses — hiding them leads to worse feedback
Push back on criticisms you disagree with, but accept valid ones
Focus on ACTIONABLE feedback — "what experiment would fix this?"
Document the completed threadId for potential future resumption
The review document should be self-contained (readable without the conversation)

Prompt Templates

For initial review:

"I'm going to present a complete ML research project for your critical review. Please act as a senior ML reviewer (NeurIPS/ICML level)..."

For experiment design:

"Please design the minimal additional experiment package that gives the highest acceptance lift per GPU week. Our compute: [describe]. Be very specific about configurations."

For paper structure:

"Please turn this into a concrete paper outline with section-by-section claims and figure plan."

For claims matrix:

"Please give me a results-to-claims matrix: what claim is allowed under each possible outcome of experiments X and Y?"

For mock review:

"Please write a mock NeurIPS review with: Summary, Strengths, Weaknesses, Questions for Authors, Score, Confidence, and What Would Move Toward Accept."

Related Skills

wanshuiyin/web-debug-search

development

VerifiedTrustedCommunity

Search GitHub Issues and Discussions for software errors, version compatibility problems, and exact error-string matches. Use for debugging and discovery only; results are not paper-citation evidence.

13,732SKILL.mdUpdated Jul 23, 2026

wanshuiyin/web-debug-search

wanshuiyin/web-debug-search

development

VerifiedTrustedCommunity

13,732SKILL.mdUpdated Jul 23, 2026

wanshuiyin/web-debug-search

wanshuiyin/integrity-forensics

testing

VerifiedTrustedCommunity

Run the Anti-Autoresearch integrity-forensics sweep (span-anchored evidence ledger → GPT auditors propose findings → deterministic rules-only adjudicator) against a paper via a SHA-pinned thin launcher — then convert the verdict into a typed policy gate (BLOCK/WARN/NO_NEW_BLOCKER) and an append-only obligations ledger. Use when user says "integrity forensics", "forensic audit this paper", "投稿前自查诚信", "审这篇论文的诚信", or says "anti-autoresearch" when the upstream repo's own skills are not installed. Also invoked by /paper-writing (submission self-forensics, default ON), /peer-review (forensic appendix), /resubmit-pipeline.

13,401SKILL.mdUpdated Jul 13, 2026

wanshuiyin/integrity-forensics

wanshuiyin/meta-apply

testing

VerifiedTrustedCommunity

Privileged applier that LANDS meta-optimize / corpus-audit patches the user approved — the ONLY skill permitted to mutate the skill corpus from a self-modification proposal, with cross-model jury and human approval at landing. Use when the user says "meta apply", "/meta-apply", "land the staged patches", "应用优化", after a /meta-optimize run.

13,401SKILL.mdUpdated May 31, 2026

wanshuiyin/meta-apply

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/wanshuiyin/Auto-claude-code-research-in-sleep.git

# Copy into Claude Code skills folder (global)
cp -r Auto-claude-code-research-in-sleep/skills/skills-codex-claude-review/research-review ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

wanshuiyin/Auto-claude-code-research-in-sleep

13,243 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

wanshuiyin/research-review

$ install --global

Security Scan Results

SKILL.md

Research Review via claude-review MCP (high-rigor review)

Constants

Context: $ARGUMENTS

Prerequisites

Workflow

Step 1: Gather Research Context

Step 2: Initial Review (Round 1)

Step 3: Iterative Dialogue (Rounds 2-N)

Step 4: Convergence

Step 5: Document Everything

Step 6: Review Tracing

Key Rules

Prompt Templates

For initial review:

For experiment design:

For paper structure:

For claims matrix:

For mock review:

Related Skills

wanshuiyin/web-debug-search

wanshuiyin/web-debug-search

wanshuiyin/integrity-forensics

wanshuiyin/meta-apply

wanshuiyin/research-review

$ install --global

Security Scan Results

SKILL.md

Research Review via claude-review MCP (high-rigor review)

Constants

Context: $ARGUMENTS

Prerequisites

Workflow

Step 1: Gather Research Context

Step 2: Initial Review (Round 1)

Step 3: Iterative Dialogue (Rounds 2-N)

Step 4: Convergence

Step 5: Document Everything

Step 6: Review Tracing

Key Rules

Prompt Templates

For initial review:

For experiment design:

For paper structure:

For claims matrix:

For mock review:

Related Skills

wanshuiyin/web-debug-search

wanshuiyin/web-debug-search

wanshuiyin/integrity-forensics

wanshuiyin/meta-apply

Research Review via `claude-review` MCP (high-rigor review)

Research Review via `claude-review` MCP (high-rigor review)