skills/skill-creator/SKILL.md
创建、测试和迭代改进技能的开发指南,用于扩展 Claude 的专业知识、工作流程或工具集成。包含完整的 evaluate 体系:创建 skill 后可以跑测试用例、量化评分、迭代优化 description。
npx skillsauth add laborany/laborany 技能创建助手Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Create and iteratively improve skills through evaluation, scoring, and description optimization.
Skills are modular, self-contained packages that extend Claude's capabilities with specialized knowledge, workflows, and tools. They transform Claude from a general-purpose agent into a specialized one equipped with procedural knowledge.
The context window is a public good shared with system prompt, conversation history, other skills' metadata, and the user request.
Default assumption: Claude is already very smart. Only add context Claude doesn't already have. Challenge each piece of information: "Does Claude really need this?" and "Does this paragraph justify its token cost?"
skill-name/
├── SKILL.md (required)
│ ├── YAML frontmatter (name + description required)
│ └── Markdown instructions
└── Bundled Resources (optional)
├── scripts/ — Executable code
├── references/ — Documentation loaded into context as needed
└── assets/ — Files used in output (templates, icons, fonts)
name and description fields. These determine when the skill triggers — be clear and comprehensive.Do NOT create extraneous documentation: README.md, INSTALLATION_GUIDE.md, CHANGELOG.md, etc. The skill should only contain information needed for an AI agent to do the job.
Many skill users are not technical. When communicating:
Skip only when usage patterns are already clearly understood.
Ask targeted questions:
Avoid overwhelming users — start with the most important questions.
Analyze each concrete example:
Run init_skill.py to generate a template skill directory:
scripts/init_skill.py <skill-name> --path <output-directory>
Skip if the skill already exists and only needs iteration.
Remember: the skill is for another Claude instance to use. Include non-obvious procedural knowledge and domain-specific details.
Write name and description:
description is the primary triggering mechanismFor LaborAny skills, also include icon and category.
Write instructions for using the skill and its bundled resources. Keep SKILL.md body under 500 lines. Split into reference files when approaching this limit.
scripts/package_skill.py <path/to/skill-folder>
Validates the skill and creates a distributable .skill file (zip format).
This is the core of the evaluate system. The goal is to quantify skill quality and identify areas for improvement.
Create eval/eval_metadata.json in the skill directory. See references/schemas.md for the schema. Each test case has:
Use scripts/run_eval.py to execute test cases against the skill:
python -m scripts.run_eval <skill-dir> [--test-case <id>] [--all]
Each run invokes claude -p with the skill loaded and captures the output.
The grader agent (agents/grader.md) evaluates each run's output against the assertions. It produces:
Use scripts/aggregate_benchmark.py to collect scores across runs into eval/benchmark.json. The analyzer agent (agents/analyzer.md) can then surface patterns and regressions.
Use eval-viewer/generate_review.py to create an HTML report for visual inspection of results and benchmark trends.
Use scripts/improve_description.py to optimize the skill description based on evaluation results:
python -m scripts.improve_description <skill-dir>
This calls Claude via CLI to analyze eval results and propose a better description. The <new_description> tag in the response is extracted and applied.
For the full eval-improve loop:
python -m scripts.run_loop <skill-dir> [--iterations <n>]
This automates: run evals → grade → aggregate → improve description → repeat.
After evaluation, iterate based on results:
Break complex tasks into clear steps with an overview:
Processing involves these steps:
1. Analyze input (run analyze.py)
2. Transform data (run transform.py)
3. Validate output (run validate.py)
Guide through decision points:
1. Determine the task type:
**Creating new?** → Follow "Creation workflow"
**Editing existing?** → Follow "Editing workflow"
Provide output templates with appropriate strictness level.
Provide input/output pairs when output quality depends on seeing examples.
创建新 skill 时,必须根据功能添加 category 和 icon 字段:
| 关键词 | Category | 推荐 Icon | |--------|----------|-----------| | 文档、Word、PDF、PPT、Excel | 办公 | 📝📄📊📈 | | 股票、金融、投资、财报 | 金融 | 💹📊 | | 论文、学术、研究 | 学术 | 📚🎓 | | 设计、UI、前端、网页 | 设计 | 🎨🖼️ | | 数据、监控、分析 | 数据 | 📈📉 | | 报销、费用、财务 | 财务 | 💰💳 | | 社交、运营、营销 | 运营 | 📱📣 | | 开发、代码、编程 | 开发 | 🛠️💻 | | 其他 | 工具 | 🔧⚙️ |
Frontmatter 示例:
---
name: 技能名称
description: |
技能描述...
icon: 📝
category: 办公
---
When the user asks to install a skill, do not run a free-form manual process. Always follow this deterministic flow:
https://github.com/org/repo/tree/main/skills/agent-browser)org/repo/skills/agent-browser)https://example.com/agent-browser.zip or https://example.com/agent-browser.tar.gz)skills/ manually.icon and category must exist能力管理 -> 我的能力)If install fails, report concrete reason and next action, such as:
SKILL.mdIf source structure is not fully compliant with LaborAny skill format, adapt it automatically:
SKILL.md templatename, description, icon, category are availabledata-ai
AI 视频工厂,用于完整测试和执行 LaborAny 的多模态视频生产链路。 适用于: (1) 用户给一个爆款视频,要求拆解脚本、分镜、动作、配乐、镜头语言并复刻或改写; (2) 用户给一个想法,要求规划完整短视频、生成角色一致的关键帧图片、调用视频生成模型生成分段视频; (3) 用户要求把多个 15s 视频片段剪辑合成为最终成片; (4) 用户明确说“测试完整图片/视频理解和生成流程”“AI剧集”“分镜视频”“爆款视频拆解”。
testing
Inspect Playwright trace files from the command line — list actions, view requests, console, errors, snapshots and screenshots.
tools
Automate browser interactions, test web pages and work with Playwright tests.
development
LaborAny 设计大师——用 HTML 做高保真原型、交互 Demo、幻灯片、动画、设计变体探索 + 设计方向顾问 + 专家评审的一体化设计能力。HTML 是工具不是媒介,根据任务 embody 不同专家(UX 设计师 / 动画师 / 幻灯片设计师 / 原型师),避免 web design tropes。 触发场景:做原型、设计 Demo、交互原型、HTML 演示、动画 Demo、设计变体、hi-fi 设计、UI mockup、prototype、设计探索、做个 HTML 页面、做个可视化、app 原型、iOS 原型、移动应用 mockup、导出 MP4、导出 GIF、60fps 视频、设计风格、设计方向、设计哲学、配色方案、视觉风格、推荐风格、选个风格、做个好看的、评审、好不好看、review this design。 主干能力:Junior Designer 工作流、反 AI slop 清单、React+Babel 最佳实践、Tweaks 变体切换、Speaker Notes 演示、Starter Components、App 原型专属守则、Playwright 验证、HTML 动画 → MP4/GIF 视频导出(25fps 基础 + 60fps 插帧 + palette 优化 GIF + 6 首场景化 BGM + 自动 fade)。 需求模糊时的 Fallback:设计方向顾问模式——从 5 流派 × 20 种设计哲学推荐 3 个差异化方向。 交付后可选:专家级 5 维度评审。