Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

mathews-tom/literature-review

Name: literature-review
Author: mathews-tom

skills/literature-review/SKILL.md

npx skillsauth add mathews-tom/armory literature-review

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Literature Review

Systematic discovery, extraction, and synthesis of academic research on a defined topic.

When to use this skill vs. others

| Need | Skill | | ----------------------------------------------------- | ---------------------------------- | | Survey a research area, synthesize multiple papers | literature-review (this skill) | | Critique a single paper's methodology and claims | research-critique | | Audit a manuscript's formatting, structure, citations | manuscript-review | | Verify a manuscript's numbers trace to code | manuscript-provenance | | Search arXiv for papers matching a query | arxiv-search (utility) |

Workflow

Phase 1: Scope Definition

Before searching, establish the review boundaries:

Research question — What specific question does the review answer? Vague topics produce vague reviews. "What techniques exist for X" is weaker than "How do methods for X compare on metric Y across domains Z?"
Inclusion criteria — Define what counts:
- Date range (e.g., 2020–present)
- Publication type (peer-reviewed, preprints, both)
- Domains/categories (e.g., cs.CL, cs.AI)
- Minimum relevance threshold
Exclusion criteria — Define what does not count:
- Tangentially related work
- Non-primary sources (blog posts, tutorials) unless explicitly included
- Duplicate or superseded versions
Expected output — What form should the review take? Narrative synthesis, tabular comparison, gap analysis, annotated bibliography, or related-work section?

Present the scope to the user for confirmation before proceeding.

Phase 2: Search & Discovery

Execute searches across available sources. Use multiple queries with varying specificity to avoid single-query blind spots.

Primary source: arXiv (via arxiv-search utility)

uv run --with arxiv python scripts/arxiv_search.py "QUERY" --max-results 30 --sort-by relevance

Vary queries systematically:

Broad topic query: "retrieval augmented generation"
Field-scoped query: ti:retrieval AND abs:generation AND cat:cs.CL
Author-anchored query: au:lewis AND abs:retrieval (when key authors are known)
Recency query: same terms with --sort-by submitted

Secondary sources (via web search/fetch):

Semantic Scholar API: https://api.semanticscholar.org/graph/v1/paper/search?query=QUERY&limit=20&fields=title,authors,abstract,year,citationCount,externalIds
Google Scholar (via web search): site:scholar.google.com QUERY
Connected Papers (for citation graph exploration): https://www.connectedpapers.com/search?q=QUERY

Snowball strategy:

Forward snowball: find papers that cite a key paper (Semantic Scholar citations endpoint)
Backward snowball: follow the references of key papers
Use citation count as a signal for influence, not quality

Phase 3: Screening & Filtering

For each discovered paper, apply the inclusion/exclusion criteria from Phase 1.

Produce a screening table:

| # | ID | Title | Authors | Year | Relevant? | Reason | | --- | ---------- | ----------- | ------------- | ---- | --------- | ------------------------------- | | 1 | 2301.07041 | Paper Title | Author et al. | 2023 | Yes | Directly addresses RQ | | 2 | 2302.12345 | Other Paper | Author B | 2023 | No | Tangential — focuses on X not Y |

Rules:

Screen on title + abstract first. Read full paper only for borderline cases.
When uncertain, include. It is cheaper to drop a paper later than to miss it.
Track exclusion reasons — they inform the review's limitations section.
Flag papers that appear in multiple search queries as likely high-relevance.

Phase 4: Data Extraction

For each included paper, extract a structured record:

- id: "2301.07041"
  title: "Paper Title"
  authors: ["Author One", "Author Two"]
  year: 2023
  venue: "NeurIPS 2023"
  research_question: "How does X affect Y?"
  methodology: "Controlled experiment with N=1000"
  key_findings:
    - "Finding 1 with quantitative result"
    - "Finding 2 with effect size"
  limitations: "Single-domain evaluation"
  relevance_to_rq: "Directly compares methods A and B on metric Y"
  citation_count: 142

Extraction discipline:

Record what the paper demonstrates, not what it claims to demonstrate.
Distinguish empirical findings (data-backed) from interpretive claims (author's framing).
Note methodology details that enable cross-paper comparison (datasets, metrics, baselines).
If the paper is available via pdf_url, read it for extraction. Do not extract from abstracts alone for included papers.

Phase 5: Synthesis

Transform extracted records into structured analysis. The synthesis method depends on the output format requested in Phase 1.

Thematic synthesis — Group papers by theme, approach, or finding:

Identify recurring themes across papers
Note where papers agree, disagree, or address different aspects
Highlight methodological trends (what approaches are gaining/losing traction)

Comparative synthesis — Build comparison tables:

| Method | Paper(s) | Dataset | Metric | Result | Limitations | | -------- | -------- | ------- | ------ | ------ | --------------- | | Method A | [1], [3] | D1 | F1 | 0.85 | Domain-specific | | Method B | [2], [4] | D1, D2 | F1 | 0.82 | Requires X |

Chronological synthesis — Map the evolution of the field:

What was the state of knowledge at time T?
What shifted and why?
Where is the field heading?

Gap analysis — Identify what is missing:

Questions raised but not answered by existing work
Methodological gaps (no one has tried approach X on problem Y)
Domain gaps (studied in domain A but not B)
Contradictions between studies that remain unresolved

Phase 6: Output

Produce the review document in the format specified in Phase 1.

Standard structure for a narrative review:

Introduction — Research question, scope, and motivation for the review
Search methodology — Databases searched, queries used, inclusion/exclusion criteria, screening results (N found → N screened → N included)
Findings — Thematic or chronological synthesis of included papers
Discussion — Cross-cutting analysis, trends, contradictions, gaps
Limitations of this review — Search scope restrictions, potential biases, papers not accessible
References — Full citation list for all included papers

Standard structure for a tabular review:

Summary table (all included papers with key metadata)
Comparison matrix (methods × metrics × results)
Gap analysis table (questions × coverage)
Reference list

Citation format: Default to Author et al. (Year) in-text with full references at the end. Adapt to the user's specified format (APA, Chicago, IEEE) if requested.

Quality Checks

Before delivering the review, verify:

[ ] Every included paper has a structured extraction record
[ ] Every claim in the synthesis is traceable to at least one extracted finding
[ ] The gap analysis identifies at least one concrete research opportunity
[ ] The review acknowledges its own limitations (search scope, access, biases)
[ ] Citation format is consistent throughout
[ ] No paper is cited that was excluded during screening
[ ] Contradictions between papers are noted, not silently resolved by picking a side

Edge Cases

| Situation | Adaptation | | ----------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | Very few papers found (<5) | The field may be nascent. Note this explicitly. Broaden search terms or check if the topic goes by different terminology. Consider adjacent fields. | | Too many papers found (>100) | Tighten inclusion criteria. Consider limiting to top venues, recent years, or high-citation papers. Produce a scoping review rather than exhaustive review. | | User provides a paper list instead of a topic | Skip Phase 2 (Search). Start from Phase 3 (Screening) with the provided list. | | User wants a related-work section for their own paper | Tailor synthesis to position the user's contribution. Organize by approaches the user's work builds on, alternatives it competes with, and gaps it fills. | | No full-text access to key papers | Extract from abstracts and note the limitation. Do not fabricate methodology details. Flag which papers were abstract-only in the extraction records. | | Interdisciplinary topic | Search across multiple category prefixes. Note when different fields use different terminology for the same concept. | | User asks for a "quick" literature review | Reduce Phase 2 to a single search query, Phase 3 to title-only screening, Phase 4 to abstract-only extraction. Label the output as a preliminary survey, not a systematic review. |

mathews-tom/literature-review

skills/literature-review/SKILL.md

Systematic literature review workflow: scope, search (arXiv, Semantic Scholar, Google Scholar), screen, extract, synthesize, and identify gaps. Triggers on: "literature review", "survey the literature", "related work", "systematic review", "synthesize the research", "find papers about", "research gap analysis".

221 stars

development

Updated May 4, 2026

$ install --global

skillsauth

npx skillsauth add mathews-tom/armory literature-review

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 4, 2026, 7:05 AM148.9s5 files scanned

SKILL.md

name:: literature-review
description:: Systematic literature review workflow: scope, search (arXiv, Semantic Scholar, Google Scholar), screen, extract, synthesize, and identify gaps. Triggers on: "literature review", "survey the literature", "related work", "systematic review", "synthesize the research", "find papers about", "research gap analysis".
version:: 1.0.1
category:: review
tags:: [academic, literature, synthesis, citations]
difficulty:: intermediate

Literature Review

Systematic discovery, extraction, and synthesis of academic research on a defined topic.

When to use this skill vs. others

Workflow

Phase 1: Scope Definition

Before searching, establish the review boundaries:

Research question — What specific question does the review answer? Vague topics produce vague reviews. "What techniques exist for X" is weaker than "How do methods for X compare on metric Y across domains Z?"
Inclusion criteria — Define what counts:
- Date range (e.g., 2020–present)
- Publication type (peer-reviewed, preprints, both)
- Domains/categories (e.g., cs.CL, cs.AI)
- Minimum relevance threshold
Exclusion criteria — Define what does not count:
- Tangentially related work
- Non-primary sources (blog posts, tutorials) unless explicitly included
- Duplicate or superseded versions
Expected output — What form should the review take? Narrative synthesis, tabular comparison, gap analysis, annotated bibliography, or related-work section?

Present the scope to the user for confirmation before proceeding.

Phase 2: Search & Discovery

Execute searches across available sources. Use multiple queries with varying specificity to avoid single-query blind spots.

Primary source: arXiv (via arxiv-search utility)

uv run --with arxiv python scripts/arxiv_search.py "QUERY" --max-results 30 --sort-by relevance

Vary queries systematically:

Broad topic query: "retrieval augmented generation"
Field-scoped query: ti:retrieval AND abs:generation AND cat:cs.CL
Author-anchored query: au:lewis AND abs:retrieval (when key authors are known)
Recency query: same terms with --sort-by submitted

Secondary sources (via web search/fetch):

Semantic Scholar API: https://api.semanticscholar.org/graph/v1/paper/search?query=QUERY&limit=20&fields=title,authors,abstract,year,citationCount,externalIds
Google Scholar (via web search): site:scholar.google.com QUERY
Connected Papers (for citation graph exploration): https://www.connectedpapers.com/search?q=QUERY

Snowball strategy:

Forward snowball: find papers that cite a key paper (Semantic Scholar citations endpoint)
Backward snowball: follow the references of key papers
Use citation count as a signal for influence, not quality

Phase 3: Screening & Filtering

For each discovered paper, apply the inclusion/exclusion criteria from Phase 1.

Produce a screening table:

Rules:

Screen on title + abstract first. Read full paper only for borderline cases.
When uncertain, include. It is cheaper to drop a paper later than to miss it.
Track exclusion reasons — they inform the review's limitations section.
Flag papers that appear in multiple search queries as likely high-relevance.

Phase 4: Data Extraction

For each included paper, extract a structured record:

- id: "2301.07041"
  title: "Paper Title"
  authors: ["Author One", "Author Two"]
  year: 2023
  venue: "NeurIPS 2023"
  research_question: "How does X affect Y?"
  methodology: "Controlled experiment with N=1000"
  key_findings:
    - "Finding 1 with quantitative result"
    - "Finding 2 with effect size"
  limitations: "Single-domain evaluation"
  relevance_to_rq: "Directly compares methods A and B on metric Y"
  citation_count: 142

Extraction discipline:

Record what the paper demonstrates, not what it claims to demonstrate.
Distinguish empirical findings (data-backed) from interpretive claims (author's framing).
Note methodology details that enable cross-paper comparison (datasets, metrics, baselines).
If the paper is available via pdf_url, read it for extraction. Do not extract from abstracts alone for included papers.

Phase 5: Synthesis

Transform extracted records into structured analysis. The synthesis method depends on the output format requested in Phase 1.

Thematic synthesis — Group papers by theme, approach, or finding:

Identify recurring themes across papers
Note where papers agree, disagree, or address different aspects
Highlight methodological trends (what approaches are gaining/losing traction)

Comparative synthesis — Build comparison tables:

Chronological synthesis — Map the evolution of the field:

What was the state of knowledge at time T?
What shifted and why?
Where is the field heading?

Gap analysis — Identify what is missing:

Questions raised but not answered by existing work
Methodological gaps (no one has tried approach X on problem Y)
Domain gaps (studied in domain A but not B)
Contradictions between studies that remain unresolved

Phase 6: Output

Produce the review document in the format specified in Phase 1.

Standard structure for a narrative review:

Introduction — Research question, scope, and motivation for the review
Search methodology — Databases searched, queries used, inclusion/exclusion criteria, screening results (N found → N screened → N included)
Findings — Thematic or chronological synthesis of included papers
Discussion — Cross-cutting analysis, trends, contradictions, gaps
Limitations of this review — Search scope restrictions, potential biases, papers not accessible
References — Full citation list for all included papers

Standard structure for a tabular review:

Summary table (all included papers with key metadata)
Comparison matrix (methods × metrics × results)
Gap analysis table (questions × coverage)
Reference list

Citation format: Default to Author et al. (Year) in-text with full references at the end. Adapt to the user's specified format (APA, Chicago, IEEE) if requested.

Quality Checks

Before delivering the review, verify:

[ ] Every included paper has a structured extraction record
[ ] Every claim in the synthesis is traceable to at least one extracted finding
[ ] The gap analysis identifies at least one concrete research opportunity
[ ] The review acknowledges its own limitations (search scope, access, biases)
[ ] Citation format is consistent throughout
[ ] No paper is cited that was excluded during screening
[ ] Contradictions between papers are noted, not silently resolved by picking a side

Edge Cases

Related Skills

mathews-tom/chart-clarity

testing

VerifiedTrustedCommunity

Create, review, and restyle data visualizations using Edward Tufte principles: high data-ink ratio, direct labels, range-frame axes, small multiples, accessible color, responsive charts, and honest comparisons. Triggers on: "create a chart", "style this chart", "review this graph", "Tufte chart", "data visualization", "Recharts", "Plotly", "matplotlib", "Chart.js", "ECharts", "D3". Use when generating or critiquing charts, dashboards, sparklines, and data tables.

242SKILL.mdUpdated Jun 6, 2026

mathews-tom/chart-clarity

mathews-tom/stacked-prs

testing

VerifiedTrustedCommunity

Manages dependent branch stacks and stacked pull requests using safe Git topology rules. Triggers on: "create stacked PRs", "publish this stack", "sync my PR stack", "rebase this stack", "merge the stack", "retarget child PRs", "split this branch into stacked PRs", "validate this stack", "cleanup stacked branches". Use when local branches or one source branch need to become a dependency-ordered PR stack with correct parent bases, validation, synchronization, merge order, and cleanup.

242SKILL.mdUpdated May 23, 2026

mathews-tom/stacked-prs

mathews-tom/project-context-setup

development

VerifiedTrustedCommunity

Scaffolds per-repository agent context so coding agents share the same issue tracker rules, triage label vocabulary, domain glossary, ADR layout, and handoff conventions. Triggers on: "set up project context", "configure agent docs", "create CONTEXT.md", "setup agent workflow", "agent issue tracker setup", "triage labels", "domain glossary for agents". Use when a repo needs durable context files before planning, triage, debugging, TDD, architecture review, or multi-agent implementation.

230SKILL.mdUpdated May 12, 2026

mathews-tom/project-context-setup

mathews-tom/task-decomposer

testing

VerifiedTrustedCommunity

Produces phased task boards from feature requests: dependency-mapped work items, parallelization flags, risk flags, edge cases, test matrices. Triggers on: "decompose this feature", "task breakdown with dependencies", "phased implementation plan", "work breakdown structure". NOT for effort estimates, use estimate-calibrator.

230SKILL.mdUpdated Apr 6, 2026

mathews-tom/task-decomposer

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/mathews-tom/armory.git

# Copy into Claude Code skills folder (global)
cp -r armory/skills/literature-review ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

mathews-tom/armory

221 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT