Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

a-green-hand-jack/reference-corpus-analyzer

Name: reference-corpus-analyzer
Author: a-green-hand-jack

skills/reference-corpus-analyzer/SKILL.md

npx skillsauth add a-green-hand-jack/ml-research-skills reference-corpus-analyzer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Reference Corpus Analyzer

Synthesize a literature corpus into a structured comparison matrix. This skill answers: across these N papers, who does what, how do they differ, and where is the open space?

Use this skill when:

related-work writing requires a side-by-side method comparison across multiple papers
you want to identify the closest 3–5 papers and understand exactly how they differ from each other
a literature survey needs trend identification across publication years or venues
you want to map open gaps across a set of existing approaches before writing the related-work section
you have 5+ source cards and want a comparison table rather than individual summaries

Do not use this skill to create per-paper source cards — use reference-reading-summarizer for that. Do not use this skill to link a paper to your project's claims — use reference-project-synthesizer for that. Use this skill after source cards exist; avoid re-reading raw PDFs unless a card is insufficient.

Pair this skill with:

reference-reading-summarizer upstream: produce source cards before running corpus analysis
reference-project-synthesizer upstream or downstream: link individual papers to project memory before or after comparison
related-work-positioning-writer downstream: use the comparison matrix to write novelty-boundary paragraphs
baseline-selection-audit downstream: use the ranking to identify must-have baselines
literature-review-sprint when the corpus is not yet assembled and a broader topic survey is needed first

Skill Directory Layout

<installed-skill-dir>/
├── SKILL.md
└── templates/
    └── comparison-matrix.md

Progressive Loading

Read reference/cards/ to find available source cards before reading raw sources.
Read reference/.agent/source-index.md or reference/.agent/reference-index.md to get the corpus inventory.
Read memory/claim-board.md when the comparison should be anchored to specific project claims.
Read templates/comparison-matrix.md before writing the output matrix.

Core Principles

Tiered depth: read deeply only the top-N closest papers; skim the rest for placement.
The comparison matrix is a project artifact, not a free-text essay — it should be queryable.
Ranking closest work requires a clear criterion: task overlap, method overlap, or claim overlap.
Gaps should be stated specifically: "no paper does X under constraint Y" beats "X is underexplored".
Do not invent paper properties not stated in the source card or the paper itself.

Step 1 — Assemble the Corpus

Read reference/.agent/source-index.md (or reference-index.md) to list available sources.

For each source, record:

source ID and title
card availability: has-card, no-card, partial-card
initial relevance estimate: core, related, background, tangential

If cards are missing for sources that appear highly relevant, route to reference-reading-summarizer first.

Step 2 — Select Tiered Read Depth

Assign read depth to each source:

| Tier | Sources | Read depth | |---|---|---| | Deep | Top 3–5 closest by task + method overlap | Full source card; re-read raw source if card is insufficient | | Standard | Next 5–10 related works | Source card only | | Survey | Remaining background papers | Title + abstract + card summary |

Criterion for "closest": same task, same claim type, overlapping method family, or shared benchmark.

Step 3 — Build the Comparison Matrix

Read templates/comparison-matrix.md.

Dimensions to compare (select those relevant to the project):

Task / problem: what problem does the paper address?
Method family: what is the core mechanism (attention, diffusion, RL, prompting, etc.)?
Key innovation: what is the single thing this paper claims to contribute?
Benchmark / dataset: what is it evaluated on?
Primary metric: what metric is reported?
Best reported result: the headline number (with venue and year for context)
Limitations acknowledged: what does the paper say it cannot do?
Relationship to our work: closer / complementary / orthogonal / superseded by ours

For each tier-1 (deep) paper, also add:

Closest claim to ours: the specific claim that most overlaps with our paper's contribution
Key differentiator: in one sentence, how our work differs from this paper

Save to reference/corpus-analysis-<date>.md.

Step 4 — Rank Closest Work

Produce a ranked list of the top-5 closest papers with:

Rank: 1
Paper: [title] ([venue year])
Overlap: task=high / method=medium / claim=high
Closest claim: [their specific claim that overlaps ours]
Differentiator: [one sentence: how we differ]
Novelty risk: high / medium / low
Reviewer action: cite as closest work / cite as baseline / cite as background

A paper with novelty risk: high and method=high overlap is the paper whose related-work paragraph needs the clearest boundary statement.

Step 5 — Identify Gaps and Trends

Gaps: what combinations of (task, method, constraint, benchmark) are not yet addressed by any paper in the corpus?

Gap: no paper addresses [X] under [constraint Y] with [method family Z]
Evidence: papers A, B, C address X but not under Y; papers D, E address Y but not X
Opportunity: our work fills this gap by [brief description]

Trends (optional, for survey mode):

Which method families are gaining / losing papers over the last 3 years?
Which benchmarks are becoming standard vs. falling out of use?
What claims were controversial 2 years ago and are now accepted?

Step 6 — Write Memory Writeback

Update reference/.agent/source-index.md with read-depth assignments
Update memory/risk-board.md for any high-novelty-risk closest-work findings
Update memory/claim-board.md if the comparison changes the novelty framing of a claim
The comparison matrix itself is saved in reference/ — do not copy it into memory/

Final Sanity Check

Before finishing:

tier-1 papers have been read at full card depth
the comparison matrix has consistent dimensions across all papers
top-5 closest-work ranking has differentiators in one sentence each
gaps are stated specifically (not as vague "future work" language)
novelty-risk papers are flagged for related-work-positioning-writer

a-green-hand-jack/reference-corpus-analyzer

skills/reference-corpus-analyzer/SKILL.md

Produce a multi-paper comparison matrix across a literature corpus with tiered read depth. Use when multiple papers need to be compared side-by-side for method differences, performance gaps, closest-work ranking, or trend identification — distinct from per-paper source cards (reference-reading-summarizer) and single-paper project linking (reference-project-synthesizer).

4 stars

research

Updated May 16, 2026

$ install --global

skillsauth

npx skillsauth add a-green-hand-jack/ml-research-skills reference-corpus-analyzer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 16, 2026, 4:25 AM149.7s2 files scanned

SKILL.md

name:: reference-corpus-analyzer
description:: Produce a multi-paper comparison matrix across a literature corpus with tiered read depth. Use when multiple papers need to be compared side-by-side for method differences, performance gaps, closest-work ranking, or trend identification — distinct from per-paper source cards (reference-reading-summarizer) and single-paper project linking (reference-project-synthesizer).
argument-hint:: [project-dir] [--corpus <path>] [--top-n <N>] [--mode compare|rank|trend|gap]
allowed-tools:: Read, Write, Edit, Bash, Glob, WebSearch, WebFetch

Reference Corpus Analyzer

Synthesize a literature corpus into a structured comparison matrix. This skill answers: across these N papers, who does what, how do they differ, and where is the open space?

Use this skill when:

related-work writing requires a side-by-side method comparison across multiple papers
you want to identify the closest 3–5 papers and understand exactly how they differ from each other
a literature survey needs trend identification across publication years or venues
you want to map open gaps across a set of existing approaches before writing the related-work section
you have 5+ source cards and want a comparison table rather than individual summaries

Pair this skill with:

reference-reading-summarizer upstream: produce source cards before running corpus analysis
reference-project-synthesizer upstream or downstream: link individual papers to project memory before or after comparison
related-work-positioning-writer downstream: use the comparison matrix to write novelty-boundary paragraphs
baseline-selection-audit downstream: use the ranking to identify must-have baselines
literature-review-sprint when the corpus is not yet assembled and a broader topic survey is needed first

Skill Directory Layout

<installed-skill-dir>/
├── SKILL.md
└── templates/
    └── comparison-matrix.md

Progressive Loading

Read reference/cards/ to find available source cards before reading raw sources.
Read reference/.agent/source-index.md or reference/.agent/reference-index.md to get the corpus inventory.
Read memory/claim-board.md when the comparison should be anchored to specific project claims.
Read templates/comparison-matrix.md before writing the output matrix.

Core Principles

Tiered depth: read deeply only the top-N closest papers; skim the rest for placement.
The comparison matrix is a project artifact, not a free-text essay — it should be queryable.
Ranking closest work requires a clear criterion: task overlap, method overlap, or claim overlap.
Gaps should be stated specifically: "no paper does X under constraint Y" beats "X is underexplored".
Do not invent paper properties not stated in the source card or the paper itself.

Step 1 — Assemble the Corpus

Read reference/.agent/source-index.md (or reference-index.md) to list available sources.

For each source, record:

source ID and title
card availability: has-card, no-card, partial-card
initial relevance estimate: core, related, background, tangential

If cards are missing for sources that appear highly relevant, route to reference-reading-summarizer first.

Step 2 — Select Tiered Read Depth

Assign read depth to each source:

Criterion for "closest": same task, same claim type, overlapping method family, or shared benchmark.

Step 3 — Build the Comparison Matrix

Read templates/comparison-matrix.md.

Dimensions to compare (select those relevant to the project):

Task / problem: what problem does the paper address?
Method family: what is the core mechanism (attention, diffusion, RL, prompting, etc.)?
Key innovation: what is the single thing this paper claims to contribute?
Benchmark / dataset: what is it evaluated on?
Primary metric: what metric is reported?
Best reported result: the headline number (with venue and year for context)
Limitations acknowledged: what does the paper say it cannot do?
Relationship to our work: closer / complementary / orthogonal / superseded by ours

For each tier-1 (deep) paper, also add:

Closest claim to ours: the specific claim that most overlaps with our paper's contribution
Key differentiator: in one sentence, how our work differs from this paper

Save to reference/corpus-analysis-<date>.md.

Step 4 — Rank Closest Work

Produce a ranked list of the top-5 closest papers with:

Rank: 1
Paper: [title] ([venue year])
Overlap: task=high / method=medium / claim=high
Closest claim: [their specific claim that overlaps ours]
Differentiator: [one sentence: how we differ]
Novelty risk: high / medium / low
Reviewer action: cite as closest work / cite as baseline / cite as background

A paper with novelty risk: high and method=high overlap is the paper whose related-work paragraph needs the clearest boundary statement.

Step 5 — Identify Gaps and Trends

Gaps: what combinations of (task, method, constraint, benchmark) are not yet addressed by any paper in the corpus?

Gap: no paper addresses [X] under [constraint Y] with [method family Z]
Evidence: papers A, B, C address X but not under Y; papers D, E address Y but not X
Opportunity: our work fills this gap by [brief description]

Trends (optional, for survey mode):

Which method families are gaining / losing papers over the last 3 years?
Which benchmarks are becoming standard vs. falling out of use?
What claims were controversial 2 years ago and are now accepted?

Step 6 — Write Memory Writeback

Update reference/.agent/source-index.md with read-depth assignments
Update memory/risk-board.md for any high-novelty-risk closest-work findings
Update memory/claim-board.md if the comparison changes the novelty framing of a claim
The comparison matrix itself is saved in reference/ — do not copy it into memory/

Final Sanity Check

Before finishing:

tier-1 papers have been read at full card depth
the comparison matrix has consistent dimensions across all papers
top-5 closest-work ranking has differentiators in one sentence each
gaps are stated specifically (not as vague "future work" language)
novelty-risk papers are flagged for related-work-positioning-writer

Related Skills

a-green-hand-jack/ml-research-bootstrap

testing

VerifiedTrustedCommunity

Bootstrap project-local ml-research-skills. Use from global installs when creating a new ML research project, enabling this collection in an existing ML research repo, or deciding whether to install the full bundle locally. Route to project-init for new projects; do not handle paper or experiment work directly.

4SKILL.mdUpdated May 26, 2026

a-green-hand-jack/ml-research-bootstrap

a-green-hand-jack/project-ops-router

development

VerifiedTrustedCommunity

Route project operations tasks — git, memory, bootstrap, remote, workspace, code review, timeline, ops — to the correct skill. Use when the task involves commits, pushes, worktrees, project memory, enabling project-local skills, SSH/server coordination, sidecar runners, or audits. Do not solve the ops task directly.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/project-ops-router

a-green-hand-jack/paper-writing-router

testing

VerifiedTrustedCommunity

Route ML/AI paper writing tasks to the correct skill — contract planning, prose drafting, section writing, consistency editing, review simulation, rebuttal, submission, or citation work. Use when the task involves writing, revising, reviewing, or submitting a paper instead of guessing between paper-writing-assistant, paper-writing-contract-planner, paper-reviewer-simulator, auto-paper-improvement-loop, or citation skills. Do not draft prose directly.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/paper-writing-router

a-green-hand-jack/ml-research-router

data-ai

VerifiedTrustedCommunity

Project-local router for ML research skill selection. Use inside an initialized ML research project, or while maintaining this skill repo, when the user describes an ML research/paper/experiment/discovery/ops/release workflow and may not know the skill; route to a domain router or high-signal leaf. Do not use for generic non-ML projects.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/ml-research-router

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/a-green-hand-jack/ml-research-skills.git

# Copy into Claude Code skills folder (global)
cp -r ml-research-skills/skills/reference-corpus-analyzer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

a-green-hand-jack/ml-research-skills

4 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT