plugins/v1tamins/skills/v1-canon2skill/SKILL.md
Use when turning a textbook, PDF, blog post, article, paper, course, notes, transcript, or other source material into suggested agent skills or skill improvements. Triggers on "what skills could come from this", "extract skills from", "turn this into skills", "skill ideas from this source".
npx skillsauth add v1-io/v1tamins v1-canon2skillInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Turn large source material into evidence-backed recommendations for new or improved agent skills.
Produce recommendations, not final skill files, unless the user explicitly asks to build them.
Default output:
## Recommended Skills
| Rank | Skill | Placement | Why It Exists | Evidence Coverage | Build Shape |
| --- | --- | --- | --- | --- | --- |
## Best First Skill
[One recommendation with trigger, core workflow, likely files, and validation approach.]
## Coverage Matrix
| Source Area | Covered By | Evidence | Gaps |
| --- | --- | --- | --- |
## Do Not Build
[Useful ideas that are too factual, narrow, generic, private, or not procedural enough.]
## Data Gaps
[Extraction failures, missing figures/tables, OCR uncertainty, inaccessible pages, or source limits.]
Include concise citations to source sections, page numbers, URLs, headings, timestamps, or chunk ids wherever possible.
Identify the source type and preserve provenance before summarizing.
| Input | Ingestion | | --- | --- | | URL or blog post | Fetch the page, preserve URL, title, headings, and publication date if visible. | | Plain text or Markdown | Save or reference the text with heading boundaries intact. | | Born-digital PDF | Extract text with page numbers. Prefer structured Markdown extraction when available. | | Scanned PDF | Run OCR first if local tools are available; otherwise flag OCR as a data gap. | | Slides or images | Extract speaker notes/text where possible and flag visual-only material as a gap unless image understanding is available. |
For PDFs, try available local tools in this order and record what worked:
command -v pymupdf4llm || true
command -v python3 || true
command -v pdftotext || true
command -v ocrmypdf || true
command -v tesseract || true
command -v swift || true
Use deterministic extraction first. Use the model to repair structure, headings, and noisy OCR only after extraction.
Always verify extraction quality before summarizing:
pdftotext -layout -enc UTF-8 input.pdf extracted.txt
wc -w extracted.txt
If the extracted text has near-zero words, mostly page breaks, or obviously missing chapters, treat the PDF as scanned/image-only even if pdfinfo reports normal pages.
OCR fallback order:
ocrmypdf is available, create a searchable copy, then run text extraction again.tesseract plus PDF image rendering tools are available, render pages and OCR them with page markers.swift available, use the bundled Vision OCR helper:
swift "$SKILL_DIR/scripts/ocr-pdf-macos.swift" input.pdf ocr.txt
If the host does not expose SKILL_DIR, resolve scripts/ocr-pdf-macos.swift relative to this skill directory.For OCR output, preserve page markers such as === Page 12 ===. Treat OCR as lower-confidence than embedded text and spot-check early, middle, and late pages before using it for recommendations.
Create a compact inventory before interpreting the material:
For long sources, chunk by semantic boundaries before token size:
ch03-sec02-p45-52.For each chunk, extract a short learning card:
### [chunk id]
- Core concepts:
- Procedures or workflows:
- Decision rules:
- Failure modes:
- Examples or exercises:
- Terms of art:
- Possible agent behavior:
- Evidence:
Do not compress directly into skill ideas yet. First gather the reusable behaviors the source teaches.
Summarize upward in layers:
At each layer, keep:
Keep the rollup lossy enough to fit in context, but never drop provenance ids.
Convert concepts into candidate skills only when they imply reusable agent behavior.
Score each candidate from 1-5:
| Criterion | Question | | --- | --- | | Reusable | Would this help across multiple future tasks? | | Procedural | Does it teach a workflow, decision rule, checklist, or validation loop? | | Non-obvious | Would a general model likely miss or mishandle this without guidance? | | Testable | Can success be evaluated on realistic examples? | | Bounded | Is it narrow enough to fit one skill without becoming a course summary? | | Public-safe | Can it be written without private names, URLs, secrets, customer facts, or project-only context? |
Prefer skill improvements over new skills when the material deepens an existing workflow. Inspect existing skill names and descriptions when working in a skill repository.
For every promising candidate:
This re-read step is mandatory for large sources. Do not trust the first rollup alone.
Recommend a skill only if it passes at least one gate:
Do not recommend a skill for:
When running inside a skill repository:
find . -maxdepth 5 -path '*/SKILL.md' -print
Suggest short gerund-form names:
canon2skillreviewing-documentsdebugging-tracesAvoid vague nouns such as assistant, helper, toolkit, playbook, or knowledge.
Use these checks before finalizing recommendations:
Which recommendations are just summaries rather than reusable agent workflows?
What important source lessons are missing from the proposed skills?
Which recommendations should be improvements to existing skills instead of new skills?
What source areas were weakly extracted or not inspected?
tools
Use when planning or synthesizing user tests for prototypes, mockups, clickable demos, product concepts, design flows, landing pages, or early product specs. Triggers on "test this prototype", "prototype testing", "user test plan", "validate this product idea", "test with users".
development
Use when creating a polished self-contained HTML page would help the user understand, compare, review, present, tune, or interact with information better than Markdown. Triggers on "make an HTML page", "HTML artifact", "nice HTML", "visual report", "interactive explainer", "one-page dashboard", "shareable page".
development
Run an extremely strict maintainability review for abstraction quality, giant files, and spaghetti-condition growth. Use for large prs, new features/architectures, a deep code quality audit, or especially harsh maintainability review.
testing
Commit, push, open, and land a pull request through CI handoff. Use when work is complete and the user wants an agent to create or update a PR, open it as a draft, monitor GitHub checks with `gh pr checks`, fix failed checks, retry up to three remediation pushes, mark the PR ready for review once green, and move a linked Linear ticket to Human Review when one exists. Trigger on requests like 'land this PR', 'open and monitor a PR', 'commit push and watch CI', 'get this ready for review', or 'finish the PR workflow'.