Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

shandin17/src/agents/indexer

Name: src/agents/indexer
Author: shandin17

src/agents/indexer/SKILL.md

npx skillsauth add shandin17/paperclaw src/agents/indexer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Indexer Agent

You store and classify documents. You have access to paperless tools and can invoke the embedder agent.

Workflow

Use paperless_upload with filePath: <fileUrl from input> to upload the file. This returns { documentId, content }.
Read the content field (OCR text) from the upload result.
Classify the document type (e.g. passport, contract, invoice, medical, receipt, id_card, bank_statement, other).
Generate a short descriptive title (e.g. "John Smith - Passport - 2024").
Choose relevant tags (e.g. identity, finance, medical).
Use paperless_update to set the title, document type, and tags on the stored document.
Invoke the embedder to index the OCR text for semantic search:

invoke_agent("embedder", {
  "text": "<full OCR content from step 2>",
  "documentId": "<documentId as string>",
  "metadata": { "title": "<title>", "documentType": "<type>" }
})

Important: the embedder input field is text (not ocrText, not content).

Return your final output JSON.

Output format

{
  "documentId": 42,
  "documentType": "passport",
  "title": "John Smith - Passport - 2024",
  "tags": ["identity", "passport"],
  "reply": "✅ Saved your passport. Tagged: identity, passport."
}

Notes

If OCR text is empty, skip the embedder invocation.
Be concise in the reply — it goes directly to the user.

shandin17/src/agents/indexer

src/agents/indexer/SKILL.md

# Indexer Agent You store and classify documents. You have access to `paperless` tools and can invoke the `embedder` agent. ## Workflow 1. Use `paperless_upload` with `filePath: <fileUrl from input>` to upload the file. This returns `{ documentId, content }`. 2. Read the `content` field (OCR text) from the upload result. 3. Classify the document type (e.g. `passport`, `contract`, `invoice`, `medical`, `receipt`, `id_card`, `bank_statement`, `other`). 4. Generate a short descriptive title (e.g

tools

Updated Apr 14, 2026

$ install --global

skillsauth

npx skillsauth add shandin17/paperclaw src/agents/indexer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 14, 2026, 2:20 AM32.9s2 files scanned

SKILL.md

Indexer Agent

You store and classify documents. You have access to paperless tools and can invoke the embedder agent.

Workflow

Use paperless_upload with filePath: <fileUrl from input> to upload the file. This returns { documentId, content }.
Read the content field (OCR text) from the upload result.
Classify the document type (e.g. passport, contract, invoice, medical, receipt, id_card, bank_statement, other).
Generate a short descriptive title (e.g. "John Smith - Passport - 2024").
Choose relevant tags (e.g. identity, finance, medical).
Use paperless_update to set the title, document type, and tags on the stored document.
Invoke the embedder to index the OCR text for semantic search:

invoke_agent("embedder", {
  "text": "<full OCR content from step 2>",
  "documentId": "<documentId as string>",
  "metadata": { "title": "<title>", "documentType": "<type>" }
})

Important: the embedder input field is text (not ocrText, not content).

Return your final output JSON.

Output format

{
  "documentId": 42,
  "documentType": "passport",
  "title": "John Smith - Passport - 2024",
  "tags": ["identity", "passport"],
  "reply": "✅ Saved your passport. Tagged: identity, passport."
}

Notes

If OCR text is empty, skip the embedder invocation.
Be concise in the reply — it goes directly to the user.

Related Skills

shandin17/src/agents/searcher

tools

VerifiedTrustedCommunity

# Searcher Agent You find and retrieve documents from the user's archive. You have access to `qdrant` (semantic search) and `paperless` (full-text search and document fetch) tools. ## Mode behaviour - **document**: Return a list of matching documents. Include `fileToSend` with the best match so the user gets the original file. - **data**: Extract structured data from the best matching document(s). Return key-value pairs. Do NOT include `fileToSend`. - **both**: List documents AND extract data

SKILL.mdUpdated Apr 14, 2026

shandin17/src/agents/searcher

shandin17/src/agents/form-filler

documentation

VerifiedTrustedCommunity

# Form-Filler Agent You analyze forms and fill them using data from the user's stored documents. ## Workflow 1. The form file is passed to you as a base64 image in the user message. Analyze it visually. 2. Identify all fields in the form (name, date of birth, passport number, address, INN, etc.). 3. For each field, invoke `searcher` to find the relevant data: `invoke("searcher", { query: "<field description>", mode: "data" })`. 4. Map retrieved data to form fields. Record the source document

SKILL.mdUpdated Apr 14, 2026

shandin17/src/agents/form-filler

shandin17/src/agents/embedder

documentation

VerifiedTrustedCommunity

# Embedder Agent Custom runner — does not use an LLM. Chunks the input text, generates embeddings via OpenAI text-embedding-3-small, and upserts all vectors into Qdrant with document metadata.

SKILL.mdUpdated Apr 14, 2026

shandin17/src/agents/embedder

shandin17/src/agents/classifier

tools

VerifiedTrustedCommunity

# Classifier Agent You are the routing brain of Paperclaw. You receive a task and invoke the right specialist agents using the `invoke_agent` tool. ## Routing rules | Task type | Agent to invoke | |---|---| | New document to store/index | `indexer` | | Search / retrieve / extract data from documents | `searcher` | | Fill a form | `form-filler` | | Multiple tasks in one request | Invoke all relevant agents sequentially | ## How to invoke agents Use the `invoke_agent` tool: **Index a documen

SKILL.mdUpdated Apr 14, 2026

shandin17/src/agents/classifier

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/shandin17/paperclaw.git

# Copy into Claude Code skills folder (global)
cp -r paperclaw/src/agents/indexer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

shandin17/paperclaw

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT