Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

aaaaqwq/mineru-extract

Name: mineru-extract
Author: aaaaqwq

skills/.trash/openclaw-search-skills/mineru-extract/SKILL.md

npx skillsauth add aaaaqwq/agi-super-skills mineru-extract

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Error

VirusTotalMulti-engine malware detection

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

MinerU Extract (official API)

Use MinerU as an upstream “content normalizer”: submit a URL to MinerU, poll for completion, download the result zip, and extract the main Markdown.

Quick start (MCP-aligned)

We align to the MinerU MCP mental model, but we do not run an MCP server.

Primary script (MCP-style): scripts/mineru_parse_documents.py
- Input: --file-sources (comma/newline-separated)
- Output: JSON contract on stdout: { ok, items, errors }
Low-level script (single URL): scripts/mineru_extract.py

Auth:

Set MINERU_TOKEN (Bearer token from mineru.net)

Default model heuristic:

URLs ending with .pdf/.doc/.ppt/.png/.jpg → pipeline
Otherwise → MinerU-HTML (best for HTML pages like WeChat articles)

1) Configure token (skill-local)

Put secrets in skill root .env (do not paste into chat outputs):

# In the mineru-extract skill directory: .env
MINERU_TOKEN=your_token_here
MINERU_API_BASE=https://mineru.net

2) Parse URL(s) → Markdown (recommended)

MCP-style wrapper (returns JSON, optionally includes markdown text):

python3 mineru-extract/scripts/mineru_parse_documents.py \
  --file-sources "<URL1>\n<URL2>" \
  --language ch \
  --enable-ocr \
  --model-version MinerU-HTML

If you want the markdown content inline in the JSON (can be large):

python3 mineru-extract/scripts/mineru_parse_documents.py \
  --file-sources "<URL>" \
  --model-version MinerU-HTML \
  --emit-markdown --max-chars 20000

Low-level (single URL, print markdown to stdout):

python3 mineru-extract/scripts/mineru_extract.py "<URL>" --model MinerU-HTML --print > /tmp/out.md

Output

The script always downloads + extracts the MinerU result zip to:

~/.openclaw/workspace/mineru/<task_id>/

It writes:

result.zip
extracted files (Markdown + JSON + assets)

It prints a JSON summary to stderr with paths:

task_id, full_zip_url, out_dir, markdown_path

Parameters (common)

--model: pipeline | vlm | MinerU-HTML (HTML requires MinerU-HTML)
--ocr/--no-ocr: enable OCR (effective for pipeline/vlm)
--table/--no-table: table recognition
--formula/--no-formula: formula recognition
--language ch|en|...
--page-ranges "2,4-6" (non-HTML)
--timeout 600 / --poll-interval 2

Failure modes & fallbacks

MinerU may fail to fetch some URLs (anti-bot / geo / login).
- Fallback: provide an HTML file or a PDF/long screenshot; then implement “upload + parse” flow with MinerU batch upload endpoints.
- Always report the failing URL + MinerU err_msg and keep an original-source link in outputs.

References

MinerU API docs: https://mineru.net/apiManage/docs
MinerU output files: https://opendatalab.github.io/MinerU/reference/output_files/

aaaaqwq/mineru-extract

skills/.trash/openclaw-search-skills/mineru-extract/SKILL.md

Use the official MinerU (mineru.net) parsing API to convert a URL (HTML pages like WeChat articles, or direct PDF/Office/image links) into clean Markdown + structured outputs. Use when web_fetch/browser can’t access or extracts messy content, and you want higher-fidelity parsing (layout/table/formula/OCR).

22 stars

development

Updated Mar 25, 2026

$ install --global

skillsauth

npx skillsauth add aaaaqwq/agi-super-skills mineru-extract

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Error

VirusTotalMulti-engine malware detection

70%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Mar 26, 2026, 4:52 AM46.0s3 files scanned

SKILL.md

name:: mineru-extract
description:: Use the official MinerU (mineru.net) parsing API to convert a URL (HTML pages like WeChat articles, or direct PDF/Office/image links) into clean Markdown + structured outputs. Use when web_fetch/browser can’t access or extracts messy content, and you want higher-fidelity parsing (layout/table/formula/OCR).

MinerU Extract (official API)

Use MinerU as an upstream “content normalizer”: submit a URL to MinerU, poll for completion, download the result zip, and extract the main Markdown.

Quick start (MCP-aligned)

We align to the MinerU MCP mental model, but we do not run an MCP server.

Primary script (MCP-style): scripts/mineru_parse_documents.py
- Input: --file-sources (comma/newline-separated)
- Output: JSON contract on stdout: { ok, items, errors }
Low-level script (single URL): scripts/mineru_extract.py

Auth:

Set MINERU_TOKEN (Bearer token from mineru.net)

Default model heuristic:

URLs ending with .pdf/.doc/.ppt/.png/.jpg → pipeline
Otherwise → MinerU-HTML (best for HTML pages like WeChat articles)

1) Configure token (skill-local)

Put secrets in skill root .env (do not paste into chat outputs):

# In the mineru-extract skill directory: .env
MINERU_TOKEN=your_token_here
MINERU_API_BASE=https://mineru.net

2) Parse URL(s) → Markdown (recommended)

MCP-style wrapper (returns JSON, optionally includes markdown text):

python3 mineru-extract/scripts/mineru_parse_documents.py \
  --file-sources "<URL1>\n<URL2>" \
  --language ch \
  --enable-ocr \
  --model-version MinerU-HTML

If you want the markdown content inline in the JSON (can be large):

python3 mineru-extract/scripts/mineru_parse_documents.py \
  --file-sources "<URL>" \
  --model-version MinerU-HTML \
  --emit-markdown --max-chars 20000

Low-level (single URL, print markdown to stdout):

python3 mineru-extract/scripts/mineru_extract.py "<URL>" --model MinerU-HTML --print > /tmp/out.md

Output

The script always downloads + extracts the MinerU result zip to:

~/.openclaw/workspace/mineru/<task_id>/

It writes:

result.zip
extracted files (Markdown + JSON + assets)

It prints a JSON summary to stderr with paths:

task_id, full_zip_url, out_dir, markdown_path

Parameters (common)

--model: pipeline | vlm | MinerU-HTML (HTML requires MinerU-HTML)
--ocr/--no-ocr: enable OCR (effective for pipeline/vlm)
--table/--no-table: table recognition
--formula/--no-formula: formula recognition
--language ch|en|...
--page-ranges "2,4-6" (non-HTML)
--timeout 600 / --poll-interval 2

Failure modes & fallbacks

MinerU may fail to fetch some URLs (anti-bot / geo / login).
- Fallback: provide an HTML file or a PDF/long screenshot; then implement “upload + parse” flow with MinerU batch upload endpoints.
- Always report the failing URL + MinerU err_msg and keep an original-source link in outputs.

References

MinerU API docs: https://mineru.net/apiManage/docs
MinerU output files: https://opendatalab.github.io/MinerU/reference/output_files/

Related Skills

aaaaqwq/browser-use

testing

VerifiedTrustedCommunity

AI驱动的智能浏览器自动化工具。使用LLM理解页面并自动执行任务，比传统Playwright更智能、更省token。适用于复杂交互、动态页面、需要智能决策的浏览器操作。Chrome浏览器优先。

37SKILL.mdUpdated Mar 25, 2026

aaaaqwq/auth-manager

tools

VerifiedTrustedCommunity

网页登录态管理。使用 fast-browser-use (fbu) 管理各平台登录状态，定期检查可用性，新平台授权时自动保存 profile。

37SKILL.mdUpdated Mar 25, 2026

aaaaqwq/api-quota-monitor

development

VerifiedTrustedCommunity

Monitor and report on API provider quotas, balances, and usage. Query official providers (Moonshot, DeepSeek, xAI, Google AI Studio) and relay/proxy providers (Xingjiabiapi, Aixn, WoW) via their billing APIs. Also checks subscription services (Brave Search, OpenRouter). Generates quota reports. Triggers on "查额度", "API余额", "quota check", "billing report", "api balance", "供应商额度", "中转站余额", "费用报告", "check balance", "how much credit".

37SKILL.mdUpdated Mar 25, 2026

aaaaqwq/api-quota-monitor

aaaaqwq/skills/a-fund-monitor

development

VerifiedTrustedCommunity

# A股基金监控 Skill A股基金净值监控，支持实时估值和盘后净值，自动判断交易日/节假日。 ## 用法 ### 快速监控（命令行） ```bash # 默认配置，输出到控制台 bash ~/clawd/skills/a-fund-monitor/scripts/monitor.sh # 推送到群（使用--push参数） bash ~/clawd/skills/a-fund-monitor/scripts/monitor.sh --push # 监控指定基金 bash ~/clawd/skills/a-fund-monitor/scripts/monitor.sh --codes "000979 002943" ``` ### Agent调用 ``` 执行A股基金监控任务。 1. 读取配置文件： ~/clawd/skills/a-fund-monitor/config.json 2. 获取实时净值数据 3. 非交易日自动切换为简短报告配置文件格式： { "funds": [ {"code": "000979", "name": "景顺长城沪港深精选股票

37SKILL.mdUpdated Mar 25, 2026

aaaaqwq/skills/a-fund-monitor

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/aaaaqwq/agi-super-skills.git

# Copy into Claude Code skills folder (global)
cp -r agi-super-skills/skills/.trash/openclaw-search-skills/mineru-extract ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

aaaaqwq/agi-super-skills

22 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT