agentic/code/frameworks/research-complete/skills/corpus-snapshot/SKILL.md
Generate a corpus snapshot report — computes dimensions, topology, degree distribution, delta from previous. Helps with cluster, chain, and gap analysis sections.
npx skillsauth add jmagly/aiwg corpus-snapshotInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate a point-in-time snapshot of the research corpus with computed metrics and analysis. Reads a snapshot template, fills [COMPUTE] sections with data, assists with [ANALYZE] sections, and writes the completed report.
/corpus-snapshot--compute-only (optional)Only compute data sections — skip analysis sections. Faster, fully automated.
--delta-only (optional)Only compute the delta from the previous snapshot. Useful for tracking session progress.
--template <path> (optional)Custom template path. Default: .aiwg/reports/corpus-snapshot-template.md.
--format (optional)Output format: full (default for the report file), summary (terminal), json (programmatic).
Before generating a snapshot, the following should be current:
| Prerequisite | Command | Gates on |
|-------------|---------|----------|
| Citation edges complete | /citation-backfill | Topology metrics |
| Indices up to date | /corpus-index-build | Group counts, hub analysis |
| Stub rate < 10% | /research-quality-audit | Snapshot validity |
If prerequisites are stale, the snapshot will include warnings.
Scan the corpus and compute:
Dimensions:
Topology (from citation-network index):
Degree Distribution:
Quality Distribution:
Compare current metrics against the most recent snapshot:
Delta from previous snapshot (2026-04-10):
Papers: +12 (360 → 372)
Edges: +87 (1,160 → 1,247)
Density: +0.001 (0.008 → 0.009)
New topics: +2 (gui-agents, code-generation)
Stubs fixed: 23 (88 → 65)
New hubs: REF-364 (entered top 10)
Read the snapshot template and fill sections:
[COMPUTE] sections — fully automated:
[ANALYZE] sections — agent-assisted:
Write the completed snapshot to:
.aiwg/reports/corpus-snapshot-YYYY-MM-DD.md
With frontmatter:
---
type: corpus-snapshot
date: 2026-04-13
papers: 372
edges: 1247
density: 0.009
components: 9
stub_rate: 0.17
previous: corpus-snapshot-2026-04-10.md
---
Corpus Snapshot Generated
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Papers: 372 (+12) | Edges: 1,247 (+87)
Density: 0.009 | Components: 9
Hub: REF-016 (34) | Isolated: 3
GRADE: 33% High, 24% Mod, 26% Low, 16% VLow
Stubs: 65 (17%) | Full text: 54%
Delta highlights:
+12 papers inducted
+87 citation edges (backfill)
-23 stubs (expanded)
+2 new topics
Written to: .aiwg/reports/corpus-snapshot-2026-04-13.md
The default template uses markers for computed vs analyzed sections:
# Corpus Snapshot — [DATE]
## Dimensions
[COMPUTE: dimensions-table]
## Topology
[COMPUTE: topology-metrics]
## Degree Distribution
[COMPUTE: degree-histogram]
## Quality Distribution
[COMPUTE: grade-distribution]
[COMPUTE: depth-distribution]
## Delta
[COMPUTE: delta-from-previous]
## Cluster Analysis
[ANALYZE: describe main clusters, their themes, and notable papers]
## Citation Chains
[ANALYZE: identify significant citation chains and their meaning]
## Gaps and Opportunities
[ANALYZE: summarize disconnected areas and bridge opportunities]
## Recommendations
[ANALYZE: what should be inducted next, what needs expansion]
| Component | Relationship |
|-----------|-------------|
| corpus-index-build | Reads index metrics (topology, hubs, components) |
| research-quality-audit | Reads depth distribution; gates if stub rate > 10% |
| citation-backfill | Must run before snapshot for accurate topology |
| research-gap-detect | Cluster data feeds into gap narrative |
| research-status | Snapshot is the detailed version of the health score |
# Full snapshot with analysis
/corpus-snapshot
# Just data, no analysis sections
/corpus-snapshot --compute-only
# Delta from previous snapshot only
/corpus-snapshot --delta-only
# Custom template
/corpus-snapshot --template .aiwg/reports/custom-template.md
# JSON metrics for dashboards
/corpus-snapshot --format json
data-ai
Report which research-corpus radar sidecars are overdue for refresh. Computes staleness (days since last refresh vs the cadence window) for every radar, sorted most-overdue-first. Runs via `aiwg corpus radar-status`.
data-ai
Aggregate research-corpus radar sidecars into a corpus or per-cluster freshness report — totals, overdue count, per-cluster / per-GRADE / per-trajectory breakdowns, an overdue table, and per-radar rationale snippets. Runs via `aiwg corpus radar-report`.
testing
Scaffold radar/freshness sidecars for research-corpus REFs. Pulls title/authors from the citation sidecar and GRADE from the analysis doc, defaults the refresh cadence from GRADE and the cluster from a corpus-local map, and stamps documentation/radar/REF-XXX-radar.md. Runs via `aiwg corpus radar-init`.
data-ai
Compute an entity's publication trajectory — per-year paper counts, topic drift, hot-streak detection (≥3 consecutive A-grade years), and career phase. Runs via `aiwg corpus profile-temporal`.