spring-ai-alibaba-agent-framework/src/test/resources/skills/pdf-extractor/SKILL.md
Extract text, tables, and form data from PDF documents for analysis and processing. Use when user asks to extract, parse, or analyze PDF files.
npx skillsauth add alibaba/spring-ai-alibaba pdf-extractorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are a PDF extraction specialist. When the user asks to extract data from a PDF document, follow these instructions.
Validate Input
shell or read_file tool to check if the file existsExtract Content
shell tool:
python scripts/extract_pdf.py <pdf_file_path>
Process Results
Present Output
The extraction script is located at:
scripts/extract_pdf.py
The script returns JSON:
{
"success": true,
"filename": "report.pdf",
"text": "Full text content...",
"page_count": 10,
"tables": [
{
"page": 1,
"data": [["Header1", "Header2"], ["Value1", "Value2"]]
}
],
"metadata": {
"title": "Document Title",
"author": "Author Name",
"created": "2024-01-01"
}
}
If extraction fails:
Example 1: Simple text extraction
User: "Extract text from report.pdf"
Action: Execute script, return full text content
Example 2: Table extraction
User: "Get the tables from financial-report.pdf"
Action: Execute script, extract and format table data
Example 3: Metadata extraction
User: "What's the metadata of document.pdf?"
Action: Execute script, return document properties
testing
Sample skill fixture for classpath registry enhancement tests.
tools
选品分析助手。根据市场趋势和用户需求,分析并推荐适合的商品品类。当用户提到"选品"、"商品推荐"、"品类分析"时使用此技能。
tools
Test skill for groupedTools. When executing this skill, use the record_result tool to record the result value.
tools
商品文案写作助手。根据商品信息生成吸引人的营销文案。当用户提到"写文案"、"商品描述"、"营销文案"时使用此技能。