Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

alphaonedev/nano-pdf

Name: nano-pdf
Author: alphaonedev

skills/community/nano-pdf/SKILL.md

npx skillsauth add alphaonedev/openclaw-graph nano-pdf

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

nano-pdf

Purpose

This skill provides tools for PDF processing, including text extraction, mining, form filling, manipulation, and OCR integration, to handle document workflows efficiently.

When to Use

Use this skill for tasks involving PDF data extraction (e.g., from scanned documents), text analysis in reports, automating form submissions, merging/splitting files, or applying OCR to non-text PDFs. Apply it in data pipelines, document automation scripts, or when integrating with OCR services for unstructured data.

Key Capabilities

Text extraction: Pulls plain text or structured data from PDFs, supporting encrypted files with passwords; uses OCR via Tesseract integration for image-based PDFs.
Text mining: Analyzes extracted text for keywords, sentiment, or patterns; e.g., counts occurrences of phrases in a document.
Form filling: Populates interactive PDF forms with JSON data; supports flattening forms to static PDFs.
Manipulation: Merges, splits, rotates, or watermarks PDFs; handles up to 500-page documents efficiently.
OCR integration: Converts scanned PDFs to searchable text using external APIs; requires Tesseract or similar engine configuration.

Usage Patterns

Invoke via CLI for quick scripts or API for server-side integration. For batch processing, chain commands in a shell script; for web apps, use API calls in loops. Always specify input/output paths explicitly. Pattern: Extract text first, then mine or manipulate as needed. For OCR-heavy tasks, preprocess images before PDF operations.

Common Commands/API

CLI commands use nano-pdf binary; API endpoints are under https://api.opencclaw.com/nano-pdf/. Authentication requires $NANO_PDF_API_KEY environment variable.

Extract text: nano-pdf extract --file input.pdf --output text.txt --ocr true (adds OCR if text is not selectable).
Mine text: nano-pdf mine --input text.txt --keywords "AI,robot" --output results.json (outputs keyword frequencies).
Fill form: nano-pdf fill --template form.pdf --data '{"field1": "value"}' --output filled.pdf.
Manipulate PDF: nano-pdf merge --files file1.pdf file2.pdf --output combined.pdf.
API endpoint for extraction: POST /extract with body {"file": "base64encoded_content", "ocr": true} and header Authorization: Bearer $NANO_PDF_API_KEY.

Code snippet (Python):

import requests
response = requests.post('https://api.opencclaw.com/nano-pdf/extract', headers={'Authorization': f'Bearer {os.environ["NANO_PDF_API_KEY"]}'}, json={'file': 'base64data'})
print(response.json()['text'])

Config format: JSON for API bodies, e.g., {"file": "path", "options": {"ocr_engine": "tesseract", "language": "en"}}; CLI uses flag-based configs like --config config.json.

Integration Notes

Integrate by setting $NANO_PDF_API_KEY for authenticated requests; for local use, install via pip install nano-pdf and import as a module. Combine with other tools: pipe CLI output to NLP libraries for mining, or use in Node.js via HTTP requests. For OCR, ensure Tesseract is installed and configured in your environment path. Test integrations in a sandbox to verify API rate limits (e.g., 100 requests/min).

Error Handling

Check for common errors like file not found (exit code 404), invalid API keys (401), or OCR failures (e.g., no Tesseract installed). Use try-except in code:

try:
    result = nano_pdf.extract('input.pdf')
except FileNotFoundError:
    print("Error: File does not exist.")
except Exception as e:
    print(f"API Error: {e} - Check $NANO_PDF_API_KEY.")

For CLI, parse stderr output; retry transient errors (e.g., network issues) with exponential backoff. Always validate inputs, like ensuring PDFs are not corrupted before processing.

Example 1: Extract and Mine Text from a PDF

To extract text from a scanned invoice PDF and mine for product names:

Run: nano-pdf extract --file invoice.pdf --output invoice_text.txt --ocr true
Then: nano-pdf mine --input invoice_text.txt --keywords "product" --output analysis.json This produces a JSON with keyword occurrences for further processing.

Example 2: Fill and Manipulate a Form PDF

To fill a job application form and merge it with a cover letter:

Prepare data in JSON: {"name": "John Doe", "position": "Engineer"}
Execute: nano-pdf fill --template application.pdf --data application_data.json --output filled_app.pdf
Merge: nano-pdf merge --files filled_app.pdf cover_letter.pdf --output final_packet.pdf Output is a single PDF ready for submission.

Graph Relationships

Related to: "ocr-tool" (for enhanced OCR capabilities), "document-parser" (for broader file type support), "text-analyzer" (for advanced mining integrations).
Clusters: Connected via "community" cluster to skills like "data-extraction" and "automation-utils".

alphaonedev/nano-pdf

skills/community/nano-pdf/SKILL.md

PDF processing: extraction, text mining, form filling, manipulation, OCR integration

2 stars

content-media

Updated Apr 3, 2026

$ install --global

skillsauth

npx skillsauth add alphaonedev/openclaw-graph nano-pdf

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 20, 2026, 3:21 PM1.8s1 file scanned

SKILL.md

name:: nano-pdf
cluster:: community
description:: PDF processing: extraction, text mining, form filling, manipulation, OCR integration
tags:: ["pdf","document","extraction","ocr"]
dependencies:: []
composes:: []
similar_to:: []
called_by:: []
authorization_required:: false
scope:: general
model_hint:: claude-sonnet
embedding_hint:: pdf extract text mine form fill manipulate ocr

nano-pdf

Purpose

This skill provides tools for PDF processing, including text extraction, mining, form filling, manipulation, and OCR integration, to handle document workflows efficiently.

When to Use

Key Capabilities

Text extraction: Pulls plain text or structured data from PDFs, supporting encrypted files with passwords; uses OCR via Tesseract integration for image-based PDFs.
Text mining: Analyzes extracted text for keywords, sentiment, or patterns; e.g., counts occurrences of phrases in a document.
Form filling: Populates interactive PDF forms with JSON data; supports flattening forms to static PDFs.
Manipulation: Merges, splits, rotates, or watermarks PDFs; handles up to 500-page documents efficiently.
OCR integration: Converts scanned PDFs to searchable text using external APIs; requires Tesseract or similar engine configuration.

Usage Patterns

Common Commands/API

CLI commands use nano-pdf binary; API endpoints are under https://api.opencclaw.com/nano-pdf/. Authentication requires $NANO_PDF_API_KEY environment variable.

Extract text: nano-pdf extract --file input.pdf --output text.txt --ocr true (adds OCR if text is not selectable).
Mine text: nano-pdf mine --input text.txt --keywords "AI,robot" --output results.json (outputs keyword frequencies).
Fill form: nano-pdf fill --template form.pdf --data '{"field1": "value"}' --output filled.pdf.
Manipulate PDF: nano-pdf merge --files file1.pdf file2.pdf --output combined.pdf.
API endpoint for extraction: POST /extract with body {"file": "base64encoded_content", "ocr": true} and header Authorization: Bearer $NANO_PDF_API_KEY.

Code snippet (Python):

import requests
response = requests.post('https://api.opencclaw.com/nano-pdf/extract', headers={'Authorization': f'Bearer {os.environ["NANO_PDF_API_KEY"]}'}, json={'file': 'base64data'})
print(response.json()['text'])

Config format: JSON for API bodies, e.g., {"file": "path", "options": {"ocr_engine": "tesseract", "language": "en"}}; CLI uses flag-based configs like --config config.json.

Integration Notes

Error Handling

Check for common errors like file not found (exit code 404), invalid API keys (401), or OCR failures (e.g., no Tesseract installed). Use try-except in code:

try:
    result = nano_pdf.extract('input.pdf')
except FileNotFoundError:
    print("Error: File does not exist.")
except Exception as e:
    print(f"API Error: {e} - Check $NANO_PDF_API_KEY.")

For CLI, parse stderr output; retry transient errors (e.g., network issues) with exponential backoff. Always validate inputs, like ensuring PDFs are not corrupted before processing.

Example 1: Extract and Mine Text from a PDF

To extract text from a scanned invoice PDF and mine for product names:

Run: nano-pdf extract --file invoice.pdf --output invoice_text.txt --ocr true
Then: nano-pdf mine --input invoice_text.txt --keywords "product" --output analysis.json This produces a JSON with keyword occurrences for further processing.

Example 2: Fill and Manipulate a Form PDF

To fill a job application form and merge it with a cover letter:

Prepare data in JSON: {"name": "John Doe", "position": "Engineer"}
Execute: nano-pdf fill --template application.pdf --data application_data.json --output filled_app.pdf
Merge: nano-pdf merge --files filled_app.pdf cover_letter.pdf --output final_packet.pdf Output is a single PDF ready for submission.

Graph Relationships

Related to: "ocr-tool" (for enhanced OCR capabilities), "document-parser" (for broader file type support), "text-analyzer" (for advanced mining integrations).
Clusters: Connected via "community" cluster to skills like "data-extraction" and "automation-utils".

Related Skills

alphaonedev/web

tools

VerifiedTrustedCommunity

Root web development: project structure, tooling selection, deployment decisions

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-wasm

development

VerifiedTrustedCommunity

WebAssembly: Rust/Go/C to WASM, wasm-bindgen, Emscripten, WASM Component Model

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-vue

development

VerifiedTrustedCommunity

Vue 3: Composition API script setup, Pinia, Vue Router 4, SFCs, Vite, Nuxt 3

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-tailwind

tools

VerifiedTrustedCommunity

Tailwind CSS 4: utility classes, config, JIT, arbitrary values, darkMode, plugins, shadcn/ui

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-tailwind

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/alphaonedev/openclaw-graph.git

# Copy into Claude Code skills folder (global)
cp -r openclaw-graph/skills/community/nano-pdf ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

alphaonedev/openclaw-graph

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT