skills/glmv-doc-based-writing/SKILL.md
Write a textual content based on given document(s) and requirements, using ZhiPu GLM-V multimodal model. Read and comprehend one or multiple documents (PDF/DOCX), write a content in Markdown format according to the specified requirements. Use when the user wants to draft a paper/article/essay/report/review/post/brief/proposal/plan, etc.
npx skillsauth add zai-org/GLM-V glmv-doc-based-writingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Comprehend the given document(s) and write a textual content (paper/article/essay/report/review/post/brief/proposal/plan) according to your requirements using the ZhiPu GLM-V multimodal model.
| Type | Formats | Max Count | Source | | ---------------- | --------- | ---------------- | ---------- | | Document (URL) | pdf, docx | 50 | URL | | Document (Local) | pdf only | pages ≤ 50 total | Local path |
Local PDF / 本地 PDF: Local PDF files are converted page-by-page into images (base64) before sending to the model.
PyMuPDFis required (pip install PyMuPDF). URL files support full formats including pdf/docx/txt. 本地 PDF 会自动逐页转为图片(base64)传给模型,需要安装PyMuPDF(pip install PyMuPDF)。URL 文件支持 pdf/docx/txt 等全格式。
After running the script, you must display the complete content (Markdown format) exactly as returned. Do not summarize, truncate, translate, comment, or only say "Writing Completed!".
| Resource | Link | | --------------- | --------------------------------------------------------------------------------------------------------------------------------- | | Get API Key | https://bigmodel.cn/usercenter/proj-mgmt/apikeys | | API Docs | Chat Completions / 对话补全 |
This script reads the key from the ZHIPU_API_KEY environment variable and shares it with other Zhipu skills.
脚本通过 ZHIPU_API_KEY 环境变量获取密钥,与其他智谱技能共用同一个 key。
Get Key / 获取 Key: Visit Zhipu Open Platform API Keys / 智谱开放平台 API Keys to create or copy your key.
Setup options / 配置方式(任选一种):
OpenClaw config (recommended) / OpenClaw 配置(推荐): Set in openclaw.json under skills.entries.glmv-doc-based-writing.env:
"glmv-doc-based-writing": { "enabled": true, "env": { "ZHIPU_API_KEY": "你的密钥" } }
Shell environment variable / Shell 环境变量: Add to ~/.zshrc:
export ZHIPU_API_KEY="你的密钥"
💡 If you already configured another Zhipu skill (for example
zhipu-toolsorglmv-caption), they share the sameZHIPU_API_KEY, so no extra setup is needed. 💡 如果你已为其他智谱 skill(如zhipu-tools、glmv-caption)配置过 key,它们共享同一个ZHIPU_API_KEY,无需重复配置。
python scripts/doc_based_writing.py \
--files "https://example.com/doucment1.pdf" "https://example.com/doucment2.docx" \
--requirements "基于这篇论文撰写公众号文章,要求偏技术风格"
python scripts/doc_based_writing.py \
--files "https://example.com/doucment1.pdf" "https://example.com/doucment2.docx" \
--requirements "总结文档主要内容和核心观点" \
--output result.md
python scripts/doc_based_writing.py \
--files "https://example.com/doucment1.pdf" "https://example.com/doucment2.docx" \
--criteria "撰写新闻稿" \
--output result.json --pretty
python scripts/doc_based_writing.py \
--files "https://example.com/doucment1.pdf" \
--criteria "为这本书撰写书评" \
--system-prompt "你是一位拥有20年跨领域写作经验的资深写作专家,擅长撰写书评"
The model outputs a Markdown content like this:
XXX
python scripts/doc_based_writing.py --files FILE [FILE...] --requirements REQUIREMENTS [OPTIONS]
| Parameter | Required | Description |
| ----------------------- | -------- | ---------------------------------------------------------- |
| --files, -f | ✅ | Document file URLs (pdf/docx, URL only, max 50) |
| --requirements, -c | ✅ | Writing requirements text |
| --model, -m | No | Model name (default: glm-4.6v) |
| --system-prompt, -s | No | Custom system prompt (default: professional HR assistant) |
| --temperature, -t | No | Sampling temperature 0-1 (default: 0.6) |
| --max-tokens | No | Max output tokens (default: 10000) |
| --output, -o | No | Save result to file (.md for markdown, .json for JSON) |
| --pretty | No | Pretty-print JSON output |
API key not configured: → Guide user to configure ZHIPU_API_KEY
Authentication failed (401/403): → API key invalid/expired → reconfigure
Rate limit (429): → Quota exhausted → wait and retry
Local path provided: → Error: only URLs supported
Content filtered: → warning field present → content blocked by safety review
Timeout: → Documents too large or too many → reduce file count
tools
Frontend visual replication skill. Explores a target website’s publicly visible pages via Playwright MCP or agent-browser, captures screenshots and layout information, then generates a static or client-side frontend replica that approximates the original’s visual appearance and page structure. This skill replicates FRONTEND PRESENTATION ONLY — it does not reproduce backend logic, server-side behavior, databases, or any non-public content. The user is responsible for ensuring they have proper authorization (ownership, license, or explicit permission) before replicating any website. ⚠️ Authorization gate: Before starting, the agent MUST confirm with the user that they have the legal right to replicate the target site. If the user cannot confirm, the skill MUST refuse to proceed.
tools
股票分析与涨跌预测分析。 在用户表达分析、判断或预测意图时触发,如“分析一下腾讯”、“0700最近走势如何”、“XX能不能买”、“预测一下后续走势”、“生成一份分析报告”等; 对于简单查询类需求(如“腾讯当前价格是多少”、“茅台代码是什么”)不触发本 Skill。 支持港股、A股、美股,整合多源数据(包括新闻、基本面、技术面、资金流及宏观信息)进行多维综合分析,输出图文结合、包含可视化图表的结构化分析报告。 ⚠️ 需要多模态主模型支持(如 glm-5v-turbo),主模型需能读取图片。
documentation
Screen and evaluate resumes against criteria using ZhiPu GLM-V multimodal model. Reads multiple resume files (PDF/DOCX/TXT), compares against user-defined screening criteria, and outputs a Markdown table with pass/fail analysis. Use when the user wants to filter resumes, compare candidates, or batch-evaluate job applications.
tools
Analyze images/videos and generate professional prompts for text-to-image and text-to-video AI tools (Midjourney, Stable Diffusion, DALL-E, Sora, Runway, Kling, Pika). Use when the user wants to generate prompts from reference images/videos, create AI art prompts, or get prompt engineering suggestions from visual content.