nx/skills/pdf-processing/SKILL.md
Use when PDF documents need to be indexed into nx store for semantic search
npx skillsauth add hellblazer/nexus pdf-processingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
For a single PDF, run directly without spawning an agent:
# Index a PDF into T3 (docs__ collection)
nx index pdf /path/to/file.pdf --corpus {corpus-name} --monitor
# Index into knowledge__ collection (for reference material)
nx index pdf /path/to/file.pdf --collection knowledge__{name} --monitor
# Dry-run preview (extract and embed locally without storing)
nx index pdf /path/to/file.pdf --corpus {corpus-name} --dry-run
# Force re-index (bypass staleness check)
nx index pdf /path/to/file.pdf --corpus {corpus-name} --force --monitor
# Verify indexing — use search tool: query="representative query", corpus="docs__{corpus-name}", limit=3
Corpus naming: Use author-year-short-title pattern. The --corpus flag auto-prepends docs__ — do NOT include the prefix.
Delegates to the pdf-chromadb-processor agent (haiku) for:
## Relay: pdf-chromadb-processor
**Task**: [what needs to be done]
**Bead**: [ID] or 'none'
### Input Artifacts
- Files: [PDF paths or URLs]
### Deliverable
Indexed PDF content in T3 via `nx index pdf`, with chunk counts and searchability verified
### Quality Criteria
- [ ] All PDFs indexed via `nx index pdf`
- [ ] Content searchable via search tool
- [ ] Processing report with chunk counts
For full relay structure and optional fields, see RELAY_TEMPLATE.md.
nx index pdf (not manual store_put tool)author-year-short-title conventionnx index pdf native pipeline)Outputs generated by the pdf-chromadb-processor agent:
nx index pdf pipeline (Docling extraction, context-safe chunking, Voyage embeddings, atomic storage)Session Scratch (T1): Agent uses scratch tool for ephemeral working notes during the session. Flagged items auto-promote to T2 at session end.
development
Use when critiquing / auditing / reviewing a change set against decision history — tries the review plan library first (catalog lookup → decision-evolution traversal → extract → compare), falls through to /nx:query if nothing matches
documentation
Use when doing design / architecture / planning work that walks from prose (RDRs, docs, knowledge) into the modules implementing a concept
development
Use when surveying the plan library's runtime metrics to propose plans for promotion to a higher scope — advisory-only; dispatches the plan-promote-propose meta-seed (no lifecycle ops — those ship in RDR-079)
business
Use when inspecting plan runtime metrics or enumerating dimension-registry usage — dispatches plan_match with dimensions={verb:plan-inspect}; strategy:default reports per-plan metrics, strategy:dimensions reports registry usage counts