.agents/skills/improve-skills/SKILL.md
Audit, improve, and compress every skill in the repo using live research. Load when the user asks to improve skills, audit the skill library, upgrade existing skills, refresh with new research, do a skill health check, or says "improve all skills", "update the skill library", "skill audit", or "run an improvement pass". Applies live domain research, fixes structural gaps, checks for skill linking opportunities, then rewrites and resizes each skill. All skills are in scope including meta skills.
npx skillsauth add dvy1987/agent-loom improve-skillsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are a Senior AI Skill Engineer running a systematic improvement pass over a skill library. For each skill: prune → fix gaps → link → research → rewrite → resize. Compression without improved quality is failure. All skills are in scope including meta skills.
Improve before compressing. Compressing a weak skill produces a smaller weak skill.
Split before compressing. Check for seams and duplication before trimming prose.
Fix structural gaps before rewriting. Gaps caught by validate-skills (missing category, missing Impact Report, missing file-output logging) are fixed in Step 2b — before the rewrite in Step 2e, so the rewrite doesn't have to undo them.
Invoke validate-skills across the full library. Use the report to:
deprecate-skill candidates (present to user, don't auto-deprecate)Report the queue with scores and structural flags. Ask for confirmation before starting.
Repeat for each skill in the queue:
2a — Prune first
Invoke prune-skill. Wait for prune report. Do not proceed until applied.
2b — Fix Structural Gaps From the validate-skills report, fix any structural flags for this skill:
metadata.category → add meta, project-specific, or domain (see docs/SKILL-INDEX.md)## Impact Report section → add it (specific to what this skill produces)docs/skill-outputs/SKILL-OUTPUTS.md logging → add the append instruction and terminal notificationimprove-skills/references/scoring-rubric.md) → update to validate-skills/references/validation-rubric.mdreferences/ file (not mentioned in SKILL.md) → add a specific load trigger or delete the filereferences/ file → add a specific trigger condition2c — Baseline Score (rubric: validate-skills/references/validation-rubric.md)
Score: Routing · Role Definition · Workflow · Gotchas · Output Format · Examples · Token Efficiency
Report: [skill]: X/14
2d — Link Check (scan before researching or rewriting)
Read all other skills in .agents/skills/. For each section of this skill:
Link when: called skill's output is consumable by this skill (directly or after a marginal adaptation). Marginal adaptations to the target skill are allowed if: target stays under 200 lines, core purpose unchanged, existing callers unaffected. Do NOT link when: the adaptation would require scope creep, a size violation, or breaking existing callers. Document new links in AGENTS.md. Document target skill changes in the commit message.
2e — Research via research-skill (with security gate)
Invoke research-skill. Mandatory: any external content must be scanned by ALL secure-* skills in sequence before use (discover via ls .agents/skills/secure-*). SAFE only if every security skill returns SAFE. If any returns BLOCKED, discard. Cannot be skipped.
Use GOTCHAS → Gotchas, WORKFLOW PATTERNS → steps, FAILURE MODES → hard rules.
2f — Classify with SkillReducer Taxonomy
Tag every block: CORE · WORKFLOW · FORMAT · EXAMPLE · BACKGROUND · EDGE_CASE · DUPLICATE · STALE
BACKGROUND and EDGE_CASE move to references/ with specific load triggers.
2g — Rewrite in priority order
references/background.mdmetadata.version2h — Post-Rewrite Score — report delta: X/14 → Y/14
2i — Size Check
wc -l .agents/skills/<skill>/SKILL.md
Under 200 → proceed to 2j.
Over 200 → invoke split-skill. It checks for link opportunities first, then extracts a new child only if needed, then compresses.
Exception: secure-skill — never compress, only split. Threshold is 180 lines (not 200). If secure-skill exceeds 180, invoke split-skill but instruct it to skip the compression step on the security skill.
2j — Validate and Commit
agentskills validate .agents/skills/<skill>/
git commit -m "improve: <skill> — <before>/14 → <after>/14\n\n- [change]\nSources: [source]"
Invoke cross-link-skills with trigger rewired — [list of skills modified]. It scans all SKILL.md files for stale or missing cross-references caused by rewrites, renames, or new link wiring from Step 2d.
Report scores, structural gaps fixed, new links created, cross-references repaired, sources used, files modified.
references/ — agent reads them before encountering the situation.prd-writing (1): 2a Prune: 1 stale CoT instruction removed (Wharton GAIL 2025) 2b Gaps: none 2c Score: 9/14 2d Link check: prd-writing Step 3 (discovery) overlaps with brainstorming output → brainstorming already produces docs/specs/ — prd-writing should read that first → Already wired: "reads brainstorming design docs as foundation when available" ✓ → No new link needed 2e Research: 2 new gotchas, 1 workflow improvement 2g Rewrite: +3 pts → 12/14 | 139 lines ✓ | committed
brainstorming (2): 2b Gaps: Prune Log missing → added empty Prune Log section 2d Link check: brainstorming Step 9 offers prd-writing — already wired ✓ Post-rewrite: 13/14 | 170 lines ✓ | committed
Summary: 2 skills improved, avg +3 pts, 1 structural gap fixed, 0 new links </output> </example> </examples>
validate-skills/references/validation-rubric.md: Scoring rubric (single source of truth). Read during Step 2c.After completing, deliver:
Improvement cycle complete: YYYY-MM-DD
Skills processed: N
Skills improved: N (avg score delta: +N pts)
Structural gaps fixed: N (list by skill)
New skill links created: N (list relationships)
Skills deprecated: N | split: N | compressed: N
Per-skill: [skill]: X/14 → Y/14 | [lines] lines | [key change]
Sources: [source] → [skill]
Files modified: [list]
Files created: [list]
development
Run a fast, read-only health check across all skills in the library and produce a structured quality report — without modifying anything. Load when the user asks to validate skills, check skill health, audit the library, run a skill quality check, or when improve-skills needs a pre-flight before starting its cycle. Also triggers on "what's wrong with my skills", "check all skills", "skill health report", "are my skills ok", or "pre-flight check". Called automatically by improve-skills before any improvement work begins, and by universal-skill-creator after every new skill is created. Never modifies any file — only reads and reports.
tools
Design, build, validate, and ship production-grade agent skills that work across OpenAI Codex, Ampcode, Factory.ai Droids, Google Gemini, Warp, Bolt.new, Replit, GitHub Copilot, Claude Code, VS Code, Cursor, and any agentskills.io compliant platform. Load when the user asks to create a skill, build a custom skill, write a SKILL.md, package instructions as a reusable agent capability, convert a workflow into a skill, improve or audit an existing SKILL.md, generate a meta-skill, make a cross-platform skill, turn a repeated task into automation, or design agent skills that target multiple AI coding tools simultaneously. Also load for skill stacking, skill scoping, skill discovery, parameterized skills, skill publishing to GitHub or skills.sh, or when the user says skill creator, skill architect, or skill engineer.
tools
Identify the right tool for a process step. Load when a user or skill needs to check tool availability, confirm CLI compatibility, or determine if an MCP server is needed. Triggers on "what tool", "do I need an MCP", "is [tool] available", "which tool handles", "tool lookup", "check tool availability", "find a tool for". Called by process-decomposer and agent-builder when assigning tools to steps.
development
Apply the Red-Green-Refactor cycle to software development. Load when the user asks to write code using TDD, create unit tests, implement a feature with test coverage, refactor code, or ensure software quality through automated testing. Also triggers on "test-driven development", "write tests first", "TDD this feature", "Red-Green-Refactor", "ensure 100% test coverage", or any request to build software with a test-first approach. Supports unit, integration, and end-to-end testing strategies.