skills/paper-to-skill/SKILL.md
Converts research papers into executable skill packages via document conversion, critical analysis, and co-evolutionary refinement. Triggers on: "convert this paper to a skill", "paper-to-skill", "extract methodology from paper", "make a skill from this paper". NOT for literature review, use research-critique.
npx skillsauth add mathews-tom/armory paper-to-skillInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Transform research papers into production-grade skill packages. The pipeline extracts the actionable methodology from a paper, structures it as a skill specification, and feeds it through co-evolutionary refinement to produce a validated package.
This closes the loop between research and practice: a paper published today can become an executable skill tomorrow, without manual authoring.
| File | Contents | Load When |
| ------------------------------------- | ------------------------------------------------- | --------- |
| references/extraction-patterns.md | Patterns for extracting methodology from papers | Always |
to-markdown skill (for PDF/document conversion)research-critique skill (for paper analysis)test-engineer agent (for co-evolutionary skill generation)Accept the paper in any supported format:
| Input Format | Action |
| --------------------- | ---------------------------------------------------------- |
| arXiv ID (e.g., 2604.01687) | Fetch via https://arxiv.org/abs/<id>, convert PDF |
| arXiv URL | Extract ID, fetch and convert |
| PDF file path | Convert using to-markdown skill |
| URL to paper | Fetch via WebFetch, convert if PDF |
| Pasted text | Use directly |
For PDF conversion, invoke the to-markdown skill:
Convert this PDF to clean markdown, preserving section structure, tables, equations, and algorithm pseudocode. Drop references section but keep inline citations.
Invoke the research-critique skill on the converted paper:
Analyze this paper focusing on:
- Core contribution: what is the novel methodology?
- Algorithm description: extract the step-by-step procedure
- Input/output specification: what goes in, what comes out?
- Key parameters and their valid ranges
- Claimed results and the evidence supporting them
- Failure modes and limitations acknowledged by the authors
- Prerequisites and dependencies (tools, data, compute)
The critique output becomes the foundation for the skill specification.
From the critique output, build a structured skill specification:
specification:
name: <kebab-case derived from paper's methodology name>
domain: <paper's application domain>
source_paper:
title: <paper title>
arxiv_id: <if available>
url: <paper URL>
authors: <first author et al.>
date: <publication date>
capabilities:
- <capability 1 derived from the methodology>
- <capability 2>
- <capability 3>
input_format: <what the skill accepts>
output_format: <what the skill produces>
algorithm_steps:
- step: 1
description: <from paper's algorithm>
parameters: [<key params with ranges>]
- step: 2
description: <next step>
failure_modes:
- <from paper's limitations section>
example_tasks:
- <task 1 the methodology would solve>
- <task 2>
- <task 3>
Extraction rules:
See references/extraction-patterns.md for patterns specific to common paper types.
Hand off the specification to the test-engineer agent for co-evolutionary generation:
Evolve a skill for: [specification.domain]
Capabilities: [specification.capabilities] Algorithm: [specification.algorithm_steps] Input: [specification.input_format] Output: [specification.output_format] Failure modes: [specification.failure_modes] Example tasks: [specification.example_tasks]
Source: [specification.source_paper.title] ([specification.source_paper.url])
The test-engineer runs its full co-evolutionary loop (generate → verify → oracle → refine) using the specification as the task description.
Ensure the generated skill properly attributes the source paper:
source: <paper_url> to the metadata## Attribution
This skill implements the methodology from:
> <paper title>
> <authors>
> <venue/arxiv, date>
> <URL>
references/ directory linking to themmanifest.yamlThe complete skill package at skills/<name>/:
SKILL.md with attribution and paper-derived workflowevals/cases.yaml with assertions generated by the co-evolutionary loopreferences/ with extraction patterns and source materialsevals/evolution-log.yaml from the test-engineer's refinement process| Error | Resolution | | ------------------------------------ | --------------------------------------------------------- | | Paper has no clear algorithm | Extract the methodology from the experiments section | | Paper is purely theoretical | Report: no actionable methodology; suggest literature-review instead | | PDF conversion fails | Try alternative: fetch HTML version or request user paste text | | Paper methodology requires data/compute | Note in skill's prerequisites; skill may be a workflow template only | | test-engineer budget exhausted | Return best-scoring iteration with manual review warning |
testing
Create, review, and restyle data visualizations using Edward Tufte principles: high data-ink ratio, direct labels, range-frame axes, small multiples, accessible color, responsive charts, and honest comparisons. Triggers on: "create a chart", "style this chart", "review this graph", "Tufte chart", "data visualization", "Recharts", "Plotly", "matplotlib", "Chart.js", "ECharts", "D3". Use when generating or critiquing charts, dashboards, sparklines, and data tables.
testing
Manages dependent branch stacks and stacked pull requests using safe Git topology rules. Triggers on: "create stacked PRs", "publish this stack", "sync my PR stack", "rebase this stack", "merge the stack", "retarget child PRs", "split this branch into stacked PRs", "validate this stack", "cleanup stacked branches". Use when local branches or one source branch need to become a dependency-ordered PR stack with correct parent bases, validation, synchronization, merge order, and cleanup.
development
Scaffolds per-repository agent context so coding agents share the same issue tracker rules, triage label vocabulary, domain glossary, ADR layout, and handoff conventions. Triggers on: "set up project context", "configure agent docs", "create CONTEXT.md", "setup agent workflow", "agent issue tracker setup", "triage labels", "domain glossary for agents". Use when a repo needs durable context files before planning, triage, debugging, TDD, architecture review, or multi-agent implementation.
testing
Produces phased task boards from feature requests: dependency-mapped work items, parallelization flags, risk flags, edge cases, test matrices. Triggers on: "decompose this feature", "task breakdown with dependencies", "phased implementation plan", "work breakdown structure". NOT for effort estimates, use estimate-calibrator.