skills/document-parser/SKILL.md
# document-parser 高精度文档解析技能,从 PDF、图片、Word 文档中提取结构化数据。 ## 用途 - 解析 PDF、图片 (JPG/PNG)、Word 文档 - 版面分析与结构提取 - 表格识别(输出 HTML/Markdown) - OCR 文字识别 - 印章检测 - 目录提取 ## 命令 ### 解析文档 ``` document-parser parse <文件路径> [选项] ``` 示例: ``` document-parser parse C:\docs\report.pdf document-parser parse C:\docs\scan.jpg --layout --table document-parser parse C:\docs\contract.docx --output markdown ``` ### 查询任务状态 ``` document-parser status <任务 ID> ``` ## 参数说明 | 参数 | 说明 | 示例 | |------|------|------| | 文件路径 | PDF/图
npx skillsauth add leoyeai/openclaw-master-skills skills/document-parserInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
高精度文档解析技能,从 PDF、图片、Word 文档中提取结构化数据。
document-parser parse <文件路径> [选项]
示例:
document-parser parse C:\docs\report.pdf
document-parser parse C:\docs\scan.jpg --layout --table
document-parser parse C:\docs\contract.docx --output markdown
document-parser status <任务 ID>
| 参数 | 说明 | 示例 |
|------|------|------|
| 文件路径 | PDF/图片/Word 文件路径 | C:\docs\report.pdf |
| --layout | 启用版面分析 | --layout |
| --table | 启用表格识别 | --table |
| --seal | 启用印章检测 | --seal |
| --output | 输出格式 (json/markdown/both) | --output markdown |
| --pages | 页码范围 | --pages 1-5,8,10-12 |
DOCUMENT_PARSER_API_KEY=your_api_key
DOCUMENT_PARSER_BASE_URL=http://47.111.146.164:8088/taidp/v1/idp/general_parse
在技能目录创建 config.json:
{
"api_key": "your_api_key",
"base_url": "http://47.111.146.164:8088/taidp/v1/idp/general_parse"
}
返回结构化 JSON 包含:
| 错误码 | 消息 | 说明 | |--------|------|------| | 10000 | Success | 识别成功 | | 10001 | Missing parameter | 参数缺失 | | 10002 | Invalid parameter | 非法参数 | | 10003 | Invalid file | 文件格式非法 | | 10004 | Failed to recognize | 识别失败 | | 10005 | Internal error | 内部错误 |
testing
AI-powered diary generation for agents - creates rich, reflective journal entries (400-600 words) with Quote Hall of Fame, Curiosity Backlog, Decision Archaeology, Relationship Evolution, mood analytics, weekly digests, "On This Day" resurfacing, and cron auto-generation. Works best with Claude models (Haiku, Sonnet, Opus).
development
Multi-agent UX for OpenClaw Control UI — agent selector, per-agent sessions, session history viewer with search, agent-filtered Sessions tab with friendly names, Create Agent wizard, emoji picker, and backend agent CRUD.
tools
Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.
tools
Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.