skills/document-parser/SKILL.md
> **OpenClaw skill context:** This skill supports ClawKeeper v1.5 as an OpenClaw-native SMB finance-agent platform. Implementations should preserve tenant isolation, deterministic policy enforcement, and auditable financial operations. --- name: document-parser description: "OCR and parse documents including invoices, receipts, and bank statements. Use when extracting text from PDF/images, parsing scanned documents, or processing uploaded files. Supports Google Document AI and Tesseract OCR." -
npx skillsauth add alexi5000/clawkeeper skills/document-parserInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
OpenClaw skill context: This skill supports ClawKeeper v1.5 as an OpenClaw-native SMB finance-agent platform. Implementations should preserve tenant isolation, deterministic policy enforcement, and auditable financial operations.
Extracts text and structured data from documents using OCR technology, enabling automated processing of invoices, receipts, statements, and contracts.
Classify document:
Use file name, content patterns, or LLM classification.
const document_ai = require('@google-cloud/documentai');
const client = new document_ai.DocumentProcessorServiceClient();
const [result] = await client.processDocument({
name: processor_name,
rawDocument: {
content: file_buffer.toString('base64'),
mimeType: 'application/pdf',
},
});
const extracted_text = result.document.text;
const entities = result.document.entities; // Pre-extracted fields
tesseract invoice.png output -l eng
For Invoices, extract:
For Receipts, extract:
For Bank Statements, extract:
Return JSON with confidence scores:
{
"document_type": "invoice",
"confidence": 0.92,
"raw_text": "...",
"extracted_fields": {
"vendor_name": "Office Depot",
"invoice_number": "INV-2024-001",
"invoice_date": "2026-01-15",
"due_date": "2026-02-15",
"total_amount": "250.00",
"currency": "USD",
"line_items": [
{
"description": "Printer Paper",
"quantity": "10",
"unit_price": "15.00",
"total": "150.00"
}
]
},
"field_confidence": {
"vendor_name": 0.95,
"invoice_number": 0.89,
"total_amount": 0.98
},
"requires_manual_review": false
}
Assess quality:
Check for:
| Type | Extension | OCR Required | |------|-----------|--------------| | PDF (text) | .pdf | No | | PDF (scanned) | .pdf | Yes | | Image | .png, .jpg, .jpeg | Yes | | Not Supported | .doc, .docx, .xls | Convert first |
Invoke this skill as the first step when processing any scanned or PDF document.
testing
> **OpenClaw skill context:** This skill supports ClawKeeper v1.5 as an OpenClaw-native SMB finance-agent platform. Implementations should preserve tenant isolation, deterministic policy enforcement, and auditable financial operations. --- name: payment-gateway description: "Process payments via Stripe, PayPal, or ACH. Use when paying invoices, processing customer payments, or managing payment methods. Handles payment scheduling, execution, and confirmation with full audit trail." --- # Paymen
development
> **OpenClaw skill context:** This skill supports ClawKeeper v1.5 as an OpenClaw-native SMB finance-agent platform. Implementations should preserve tenant isolation, deterministic policy enforcement, and auditable financial operations. --- name: invoice-processor description: "Parse, validate, and categorize invoices using OCR and LLM. Use when processing uploaded invoices, extracting invoice data, validating invoice fields, or categorizing expenses. Handles PDF, image, and scanned invoices wit
testing
> **OpenClaw skill context:** This skill supports ClawKeeper v1.5 as an OpenClaw-native SMB finance-agent platform. Implementations should preserve tenant isolation, deterministic policy enforcement, and auditable financial operations. --- name: financial-reporting description: "Generate standard financial reports including P&L, balance sheet, and cash flow statements. Use when creating monthly/quarterly/annual reports, comparing periods, or exporting financial data. Supports GAAP and custom re
testing
> **OpenClaw skill context:** This skill supports ClawKeeper v1.5 as an OpenClaw-native SMB finance-agent platform. Implementations should preserve tenant isolation, deterministic policy enforcement, and auditable financial operations. --- name: data-sync description: "Synchronize data with external accounting systems (QuickBooks, Xero) and bank feeds (Plaid). Use when importing/exporting financial data, syncing to accounting software, or updating from bank feeds. Handles bi-directional sync wi