Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

alphaonedev/pdf-2

Name: pdf-2
Author: alphaonedev

skills/community/pdf-2/SKILL.md

npx skillsauth add alphaonedev/openclaw-graph pdf-2

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

pdf-2

Purpose

This skill enables advanced PDF processing, including OCR for text extraction from images, form data extraction, table parsing into structured formats, digital signature verification and addition, PDF merging/splitting, and annotation handling. It's designed for automating document workflows in OpenClaw.

When to Use

Use this skill for tasks involving scanned or complex PDFs, such as extracting data from invoices with tables and forms, verifying legal documents with signatures, or preparing reports by merging files. Apply it when dealing with non-searchable PDFs (e.g., via OCR) or when integration with other tools is needed for document pipelines.

Key Capabilities

OCR: Uses Tesseract engine to extract text; supports languages via --lang flag (e.g., --lang eng for English).
Form Extraction: Parses PDF forms into JSON; extracts fields like text boxes or checkboxes using --fields flag.
Table Parsing: Detects and converts tables to CSV; specify layout with --layout auto or --layout grid.
Digital Signatures: Verifies signatures with --verify flag; adds new ones using --sign with a certificate path.
Merge/Split: Merges multiple PDFs via --inputs flag; splits by page range, e.g., --pages 1-5.
Annotation: Extracts or adds annotations (e.g., highlights) using --extract-annotations or --add-annotation type=highlight text="Note".

Usage Patterns

Invoke via CLI for one-off tasks or API for scripted workflows. Chain commands with pipes, e.g., OCR output to a text processor. For batch processing, use loops in scripts. Always specify input/output paths explicitly. If handling large files, set --timeout 300 for extended operations. Configure defaults in a JSON config file, e.g., {"default_lang": "eng"}.

Common Commands/API

CLI Commands:

OCR: openclaw pdf-2 ocr --input path/to/file.pdf --lang eng --dpi 300 --output extracted.txt
Form Extraction: openclaw pdf-2 extract-form --file form.pdf --fields name,address --output data.json
Table Parsing: openclaw pdf-2 parse-table --input table.pdf --page 2 --output tables.csv
Signature Verification: openclaw pdf-2 verify-signature --file signed.pdf --cert path/to/cert.pem
Merge PDFs: openclaw pdf-2 merge --inputs file1.pdf file2.pdf --output merged.pdf
Split PDF: openclaw pdf-2 split --input original.pdf --pages 1-3 --output split_folder/

API Endpoints:

OCR: POST /api/pdf-2/ocr with JSON body: {"file": "base64encoded_string", "lang": "eng", "dpi": 300}
Form Extraction: POST /api/pdf-2/extract-form with body: {"file": "base64encoded", "fields": ["name", "address"]}
Table Parsing: POST /api/pdf-2/parse-table with body: {"file": "base64encoded", "page": 2}

Code Snippets:

Python API call for OCR:

import requests; import os; import base64
api_key = os.environ['OPENCLAW_API_KEY']
response = requests.post('https://api.openclaw.ai/api/pdf-2/ocr', headers={'Authorization': f'Bearer {api_key}'}, json={'file': base64.b64encode(open('file.pdf', 'rb').read()).decode()})

CLI in a shell script:

export OPENCLAW_API_KEY=your_key
openclaw pdf-2 ocr --input input.pdf --output output.txt

Config Formats: Use JSON for API requests or CLI configs, e.g., {"lang": "eng", "timeout": 60}. Save as .json file and pass with --config path/to/config.json.

Integration Notes

Require authentication via environment variable: set OPENCLAW_API_KEY=your_api_key for API calls. For CLI, ensure OpenClaw CLI is installed (e.g., via npm install openclaw) and authenticated. Integrate with other services by piping outputs, e.g., send OCR results to a NLP tool. Handle file uploads by encoding to base64 in API bodies. For webhooks, use the /api/pdf-2/webhook endpoint with a callback URL in requests.

Error Handling

Check HTTP status codes for API errors (e.g., 400 for bad input, 401 for unauthorized); parse JSON response for details like {"error": "File not found"}. For CLI, errors output to stderr with codes (e.g., exit code 1 for failures). Handle common issues: use try-except for API timeouts, e.g., in Python: try: requests.post(...) except requests.exceptions.Timeout: print("Timeout occurred"). Validate inputs before commands, e.g., check if file exists with os.path.exists(). Retry transient errors with exponential backoff.

Concrete Usage Examples

Extracting and parsing tables from a scanned invoice PDF:
- First, perform OCR: openclaw pdf-2 ocr --input invoice_scanned.pdf --lang eng --output ocr_output.txt
- Then, parse tables: openclaw pdf-2 parse-table --input invoice_scanned.pdf --page 1 --output invoice_tables.csv
- This produces a CSV for further analysis, e.g., import into a database.
Merging PDFs and verifying signatures for a report:
- Merge files: openclaw pdf-2 merge --inputs report_part1.pdf report_part2.pdf --output final_report.pdf
- Verify signature: openclaw pdf-2 verify-signature --file final_report.pdf --cert company_cert.pem
- If valid, proceed; otherwise, handle with error logging.

Graph Relationships

Depends on: ocr-1 (provides core OCR functionality)
Integrates with: document-1 (for general document storage and retrieval)
Conflicts with: none
Related to: pdf-1 (basic PDF handling, as this is an advanced version)
Clusters with: community (shared cluster for collaborative skills)

alphaonedev/pdf-2

skills/community/pdf-2/SKILL.md

Advanced PDF v2: OCR, form extraction, table parsing, digital signatures, merge/split, annotation

2 stars

content-media

Updated Apr 3, 2026

$ install --global

skillsauth

npx skillsauth add alphaonedev/openclaw-graph pdf-2

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 3, 2026, 8:43 PM4.2s1 file scanned

SKILL.md

name:: pdf-2
cluster:: community
description:: Advanced PDF v2: OCR, form extraction, table parsing, digital signatures, merge/split, annotation
tags:: ["pdf","ocr","tables","documents"]
dependencies:: []
composes:: []
similar_to:: []
called_by:: []
authorization_required:: false
scope:: general
model_hint:: claude-sonnet
embedding_hint:: pdf ocr form extract table parse signature merge split annotate

pdf-2

Purpose

When to Use

Key Capabilities

OCR: Uses Tesseract engine to extract text; supports languages via --lang flag (e.g., --lang eng for English).
Form Extraction: Parses PDF forms into JSON; extracts fields like text boxes or checkboxes using --fields flag.
Table Parsing: Detects and converts tables to CSV; specify layout with --layout auto or --layout grid.
Digital Signatures: Verifies signatures with --verify flag; adds new ones using --sign with a certificate path.
Merge/Split: Merges multiple PDFs via --inputs flag; splits by page range, e.g., --pages 1-5.
Annotation: Extracts or adds annotations (e.g., highlights) using --extract-annotations or --add-annotation type=highlight text="Note".

Usage Patterns

Common Commands/API

CLI Commands:

OCR: openclaw pdf-2 ocr --input path/to/file.pdf --lang eng --dpi 300 --output extracted.txt
Form Extraction: openclaw pdf-2 extract-form --file form.pdf --fields name,address --output data.json
Table Parsing: openclaw pdf-2 parse-table --input table.pdf --page 2 --output tables.csv
Signature Verification: openclaw pdf-2 verify-signature --file signed.pdf --cert path/to/cert.pem
Merge PDFs: openclaw pdf-2 merge --inputs file1.pdf file2.pdf --output merged.pdf
Split PDF: openclaw pdf-2 split --input original.pdf --pages 1-3 --output split_folder/

API Endpoints:

OCR: POST /api/pdf-2/ocr with JSON body: {"file": "base64encoded_string", "lang": "eng", "dpi": 300}
Form Extraction: POST /api/pdf-2/extract-form with body: {"file": "base64encoded", "fields": ["name", "address"]}
Table Parsing: POST /api/pdf-2/parse-table with body: {"file": "base64encoded", "page": 2}

Code Snippets:

Python API call for OCR:

import requests; import os; import base64
api_key = os.environ['OPENCLAW_API_KEY']
response = requests.post('https://api.openclaw.ai/api/pdf-2/ocr', headers={'Authorization': f'Bearer {api_key}'}, json={'file': base64.b64encode(open('file.pdf', 'rb').read()).decode()})

CLI in a shell script:

export OPENCLAW_API_KEY=your_key
openclaw pdf-2 ocr --input input.pdf --output output.txt

Config Formats: Use JSON for API requests or CLI configs, e.g., {"lang": "eng", "timeout": 60}. Save as .json file and pass with --config path/to/config.json.

Integration Notes

Error Handling

Concrete Usage Examples

Extracting and parsing tables from a scanned invoice PDF:
- First, perform OCR: openclaw pdf-2 ocr --input invoice_scanned.pdf --lang eng --output ocr_output.txt
- Then, parse tables: openclaw pdf-2 parse-table --input invoice_scanned.pdf --page 1 --output invoice_tables.csv
- This produces a CSV for further analysis, e.g., import into a database.
Merging PDFs and verifying signatures for a report:
- Merge files: openclaw pdf-2 merge --inputs report_part1.pdf report_part2.pdf --output final_report.pdf
- Verify signature: openclaw pdf-2 verify-signature --file final_report.pdf --cert company_cert.pem
- If valid, proceed; otherwise, handle with error logging.

Graph Relationships

Depends on: ocr-1 (provides core OCR functionality)
Integrates with: document-1 (for general document storage and retrieval)
Conflicts with: none
Related to: pdf-1 (basic PDF handling, as this is an advanced version)
Clusters with: community (shared cluster for collaborative skills)

Related Skills

alphaonedev/web

tools

VerifiedTrustedCommunity

Root web development: project structure, tooling selection, deployment decisions

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-wasm

development

VerifiedTrustedCommunity

WebAssembly: Rust/Go/C to WASM, wasm-bindgen, Emscripten, WASM Component Model

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-vue

development

VerifiedTrustedCommunity

Vue 3: Composition API script setup, Pinia, Vue Router 4, SFCs, Vite, Nuxt 3

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-tailwind

tools

VerifiedTrustedCommunity

Tailwind CSS 4: utility classes, config, JIT, arbitrary values, darkMode, plugins, shadcn/ui

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-tailwind

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/alphaonedev/openclaw-graph.git

# Copy into Claude Code skills folder (global)
cp -r openclaw-graph/skills/community/pdf-2 ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

alphaonedev/openclaw-graph

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT