Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

openclaw/pdf-figure-extractor

Name: pdf-figure-extractor
Author: openclaw

438061781/pdf-figure-extractor/SKILL.md

npx skillsauth add openclaw/skills pdf-figure-extractor

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

PDF Figure提取技能

使用场景

从学术论文PDF提取Figure插入Word文档
需要干净、无caption、无正文的纯图形图片
批量提取多个Figure

标准工作流程

步骤1: 分析PDF结构

import fitz

doc = fitz.open(pdf_path)
page = doc[page_num]

# 获取所有文本块
blocks = page.get_text("blocks")
for block in blocks:
    x0, y0, x1, y1, text, block_no, block_type = block
    if "Fig." in text or "Figure" in text:
        print(f"Figure相关: y={y0:.0f}-{y1:.0f}, {text[:50]}...")

步骤2: 定位Caption位置

# 搜索Fig. X的精确位置
text_instances = page.search_for(f"Fig. {fig_num}")
for inst in text_instances:
    print(f"Fig.{fig_num}位置: y={inst.y0:.0f}-{inst.y1:.0f}")

步骤3: 确定裁剪区域

根据caption位置判断图形区域：

| Caption位置 | 图形区域 | |------------|---------| | y=400 (页面中部) | y=100-395 (caption上方) | | y=666 (页面底部) | y=350-660 (caption上方) | | y=326 (页面底部) | y=100-320 (caption上方) |

步骤4: 精确裁剪

rect = fitz.Rect(50, y_start, page.rect.width - 50, y_end)
pix = page.get_pixmap(matrix=fitz.Matrix(2, 2), clip=rect)
pix.save(f"fig{fig_num}.png")

步骤5: 验证图片质量

检查清单：

[ ] 包含所有子图(a,b,c,d...)
[ ] 没有混入"Fig. X"开头的caption文字
[ ] 没有混入正文段落
[ ] 坐标轴和标签完整

常见PDF布局模板

Nature/Science论文

Fig.1: 通常caption在底部，图形y=350-660
Fig.2+: caption位置不固定，需要先分析

会议论文

单栏布局: caption通常在图形下方
双栏布局: caption可能在图形上方或下方

错误处理

问题: 图片混入正文

原因: 裁剪范围太大解决: 缩小y_end，确保在caption之前结束

问题: 子图缺失

原因: 裁剪范围太小解决: 扩大y_start/y_end，包含完整图形

问题: caption未去除

原因: 裁剪范围包含了caption区域解决: 根据caption的y坐标精确调整裁剪边界

最佳实践

永远不要凭感觉估计坐标
始终先分析PDF文本块结构
高分辨率渲染: 使用matrix=fitz.Matrix(2, 2)
验证每张图片: 确保干净无杂质
记录坐标: 为常见PDF类型建立坐标模板

触发关键词

"提取PDF图片", "从PDF提取Figure", "PDF图片裁剪", "学术论文图片提取"

openclaw/pdf-figure-extractor

438061781/pdf-figure-extractor/SKILL.md

从PDF论文中精确提取Figure图片，自动分析PDF结构、定位caption位置、裁剪干净图形，并验证图片质量。支持学术新闻稿、论文写作等场景的自动化图片处理。

3,729 stars

content-media

Updated Apr 2, 2026

$ install --global

skillsauth

npx skillsauth add openclaw/skills pdf-figure-extractor

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 3, 2026, 7:02 AM19.7s1 file scanned

SKILL.md

PDF Figure提取技能

使用场景

从学术论文PDF提取Figure插入Word文档
需要干净、无caption、无正文的纯图形图片
批量提取多个Figure

标准工作流程

步骤1: 分析PDF结构

import fitz

doc = fitz.open(pdf_path)
page = doc[page_num]

# 获取所有文本块
blocks = page.get_text("blocks")
for block in blocks:
    x0, y0, x1, y1, text, block_no, block_type = block
    if "Fig." in text or "Figure" in text:
        print(f"Figure相关: y={y0:.0f}-{y1:.0f}, {text[:50]}...")

步骤2: 定位Caption位置

# 搜索Fig. X的精确位置
text_instances = page.search_for(f"Fig. {fig_num}")
for inst in text_instances:
    print(f"Fig.{fig_num}位置: y={inst.y0:.0f}-{inst.y1:.0f}")

步骤3: 确定裁剪区域

根据caption位置判断图形区域：

步骤4: 精确裁剪

rect = fitz.Rect(50, y_start, page.rect.width - 50, y_end)
pix = page.get_pixmap(matrix=fitz.Matrix(2, 2), clip=rect)
pix.save(f"fig{fig_num}.png")

步骤5: 验证图片质量

检查清单：

[ ] 包含所有子图(a,b,c,d...)
[ ] 没有混入"Fig. X"开头的caption文字
[ ] 没有混入正文段落
[ ] 坐标轴和标签完整

常见PDF布局模板

Nature/Science论文

Fig.1: 通常caption在底部，图形y=350-660
Fig.2+: caption位置不固定，需要先分析

会议论文

单栏布局: caption通常在图形下方
双栏布局: caption可能在图形上方或下方

错误处理

问题: 图片混入正文

原因: 裁剪范围太大解决: 缩小y_end，确保在caption之前结束

问题: 子图缺失

原因: 裁剪范围太小解决: 扩大y_start/y_end，包含完整图形

问题: caption未去除

原因: 裁剪范围包含了caption区域解决: 根据caption的y坐标精确调整裁剪边界

最佳实践

永远不要凭感觉估计坐标
始终先分析PDF文本块结构
高分辨率渲染: 使用matrix=fitz.Matrix(2, 2)
验证每张图片: 确保干净无杂质
记录坐标: 为常见PDF类型建立坐标模板

触发关键词

"提取PDF图片", "从PDF提取Figure", "PDF图片裁剪", "学术论文图片提取"

Related Skills

openclaw/mcdonalds-skill

tools

VerifiedTrustedCommunity

Use when the user wants to connect to, test, or use the McDonalds service at mcp.mcd.cn, including checking authentication, probing MCP endpoints, listing tools, or calling McDonalds MCP tools through a reusable local CLI.

3,962SKILL.mdUpdated Apr 10, 2026

openclaw/mcdonalds-skill

openclaw/scrapebadger

development

VerifiedTrustedCommunity

Web scraping platform — Twitter/X data, Vinted marketplace, and general web scraping API

3,962SKILL.mdUpdated Apr 10, 2026

openclaw/scrapebadger

openclaw/slowmist-security-cc

development

VerifiedTrustedCommunity

SlowMist AI Agent Security Review — comprehensive security framework for skills, repositories, URLs, on-chain addresses, and products (Claude Code version)

3,962SKILL.mdUpdated Apr 10, 2026

openclaw/slowmist-security-cc

openclaw/humanizer-cn

data-ai

VerifiedTrustedCommunity

去除中文文本中的 AI 写作痕迹，使其读起来自然。基于维基百科 AI 写作特征指南，检测 24 种 AI 模式。触发词：humanizer-cn、去除 AI 痕迹、去除 AI 写作痕迹、中文文本人性化。

3,962SKILL.mdUpdated Apr 10, 2026

openclaw/humanizer-cn

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/openclaw/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/438061781/pdf-figure-extractor ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

openclaw/skills

3,729 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT