Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

sanurb/skill-optimizer

Name: skill-optimizer
Author: sanurb

skill-optimizer/SKILL.md

npx skillsauth add sanurb/skills skill-optimizer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Skill Optimizer

Restrict what the agent can do so what it does, it does well. This skill tightens the harness on any AI skill — measure, diagnose, fix.

Target Skill

Identify the skill directory to optimize. If not obvious from context, ask the user which skill to optimize before proceeding.

Quick Audit

Run the structural validator against the target skill before the full loop:

bash scripts/validate.sh <skill-path>

If validation fails, fix structural issues first. Do not start the benchmark loop on a broken skill.

Workflow

Copy this checklist and track progress:

Optimization Progress:
- [ ] Step 0: Build eval scenarios (skip if evals exist)
- [ ] Step 1: Measure baseline and skill-on scores
- [ ] Step 2: Diagnose failure pattern
- [ ] Step 3: Apply fix from matched reference
- [ ] Step 4: Re-measure — confirm improvement, no regressions
- [ ] Step 5: Pass release gates before shipping

Step 0: Build evals first (low freedom — follow exactly)

Before touching skill text, create ≥3 eval scenarios:

One per core capability the skill claims
One stressing omission-prone sections (footers, checklists)
One with noisy context (long conversation, irrelevant files loaded)

Run evals WITHOUT the skill. Record baseline scores. This is the ground truth.

Step 1: Measure (low freedom)

Read references/benchmark-loop.md. Produce the benchmark matrix. Do not skip this.

Step 2: Diagnose (use decision tree)

What does the data show?
├─ Skill is ignored by models         → read references/activation-design.md
├─ Scores dropped after skill change  → read references/regression-triage.md
├─ Skill text too large / context rot → read references/context-budget.md
└─ All clear, ready to ship           → read references/release-gates.md

Step 3: Apply fix (follow the matched reference — each is atomic, one job)

Step 4: Re-measure (low freedom — feedback loop)

Re-run the SAME eval scenarios from Step 1:

If improvement AND no regressions → proceed to Step 5
If regression on ANY scenario → read references/regression-triage.md, fix, re-measure
Max 3 iterations before escalating

Step 5: Release gate (low freedom)

Read references/release-gates.md. Ship only if all MUST-PASS gates clear.

Non-Negotiable Acceptance Criteria

Deliver nothing if any criterion fails.

Evals exist — ≥3 scenarios with baseline scores before any edit
Baseline recorded — with-skill vs without-skill scores for ≥2 models/agents
No regressions — zero negative deltas on any scenario after the edit
Imperative wording only — zero instances of "consider", "you may want", "optionally"
Integrated example present — ≥1 realistic before/after example per core capability
Output format defined — every procedure produces a documented artifact
Description front-loaded — key use case in first 250 chars; includes negative triggers

Output

Every optimization produces exactly this artifact. Copy the template from assets/report-template.md and fill it in.

In This Reference

| File | One Job | |------|---------| | benchmark-loop.md | Produce the measurement matrix | | activation-design.md | Fix skill text so models follow it | | context-budget.md | Shrink token cost without losing behavior | | regression-triage.md | Isolate and eliminate negative deltas | | release-gates.md | Go/no-go checklist before shipping |

sanurb/skill-optimizer

skill-optimizer/SKILL.md

Measure and fix AI skill activation, clarity, and cross-agent reliability. Use when skill uptake is weak, scores regress, or context is bloated. Not for writing new skills from scratch or general prompt engineering.

data-ai

Updated Apr 26, 2026

$ install --global

skillsauth

npx skillsauth add sanurb/skills skill-optimizer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 26, 2026, 9:13 AM44.0s8 files scanned

SKILL.md

name:: skill-optimizer
description:: Measure and fix AI skill activation, clarity, and cross-agent reliability. Use when skill uptake is weak, scores regress, or context is bloated. Not for writing new skills from scratch or general prompt engineering.

Skill Optimizer

Restrict what the agent can do so what it does, it does well. This skill tightens the harness on any AI skill — measure, diagnose, fix.

Target Skill

Identify the skill directory to optimize. If not obvious from context, ask the user which skill to optimize before proceeding.

Quick Audit

Run the structural validator against the target skill before the full loop:

bash scripts/validate.sh <skill-path>

If validation fails, fix structural issues first. Do not start the benchmark loop on a broken skill.

Workflow

Copy this checklist and track progress:

Optimization Progress:
- [ ] Step 0: Build eval scenarios (skip if evals exist)
- [ ] Step 1: Measure baseline and skill-on scores
- [ ] Step 2: Diagnose failure pattern
- [ ] Step 3: Apply fix from matched reference
- [ ] Step 4: Re-measure — confirm improvement, no regressions
- [ ] Step 5: Pass release gates before shipping

Step 0: Build evals first (low freedom — follow exactly)

Before touching skill text, create ≥3 eval scenarios:

One per core capability the skill claims
One stressing omission-prone sections (footers, checklists)
One with noisy context (long conversation, irrelevant files loaded)

Run evals WITHOUT the skill. Record baseline scores. This is the ground truth.

Step 1: Measure (low freedom)

Read references/benchmark-loop.md. Produce the benchmark matrix. Do not skip this.

Step 2: Diagnose (use decision tree)

What does the data show?
├─ Skill is ignored by models         → read references/activation-design.md
├─ Scores dropped after skill change  → read references/regression-triage.md
├─ Skill text too large / context rot → read references/context-budget.md
└─ All clear, ready to ship           → read references/release-gates.md

Step 3: Apply fix (follow the matched reference — each is atomic, one job)

Step 4: Re-measure (low freedom — feedback loop)

Re-run the SAME eval scenarios from Step 1:

If improvement AND no regressions → proceed to Step 5
If regression on ANY scenario → read references/regression-triage.md, fix, re-measure
Max 3 iterations before escalating

Step 5: Release gate (low freedom)

Read references/release-gates.md. Ship only if all MUST-PASS gates clear.

Non-Negotiable Acceptance Criteria

Deliver nothing if any criterion fails.

Evals exist — ≥3 scenarios with baseline scores before any edit
Baseline recorded — with-skill vs without-skill scores for ≥2 models/agents
No regressions — zero negative deltas on any scenario after the edit
Imperative wording only — zero instances of "consider", "you may want", "optionally"
Integrated example present — ≥1 realistic before/after example per core capability
Output format defined — every procedure produces a documented artifact
Description front-loaded — key use case in first 250 chars; includes negative triggers

Output

Every optimization produces exactly this artifact. Copy the template from assets/report-template.md and fill it in.

In This Reference

Related Skills

sanurb/setup-sanurb-skills

development

VerifiedTrustedCommunity

Sets up an `## Agent skills` block in AGENTS.md/CLAUDE.md and `docs/agents/` so the engineering skills know this repo's issue tracker (GitHub, GitLab, fp, or local markdown), triage label vocabulary, and domain doc layout. Run before first use of `fp-plan`, `fp-implement`, `fp-review`, `to-issues`, `to-prd`, `triage`, `diagnose`, `tdd`, `improve-codebase-architecture`, or `zoom-out` — or if those skills appear to be missing context about the issue tracker, triage labels, or domain docs.

SKILL.mdUpdated Jun 4, 2026

sanurb/setup-sanurb-skills

sanurb/prototype

development

VerifiedTrustedCommunity

Build a throwaway prototype to flush out a design before committing to it. Routes between two branches — a runnable terminal app for state/business-logic questions, or several radically different UI variations toggleable from one route. Use when the user wants to prototype, sanity-check a data model or state machine, mock up a UI, explore design options, or says "prototype this", "let me play with it", "try a few designs".

SKILL.mdUpdated Jun 4, 2026

sanurb/herdr

tools

VerifiedTrustedCommunity

Control herdr (a terminal-native agent multiplexer) from inside it. Manage workspaces and tabs, split panes, spawn sibling agents, read pane output, and wait for state changes — all via CLI commands that talk to the running herdr instance over a local unix socket. Use when running inside herdr (HERDR_ENV=1). Do not use outside herdr.

SKILL.mdUpdated Jun 4, 2026

sanurb/handoff

documentation

VerifiedTrustedCommunity

Compact the current conversation into a handoff document for another agent to pick up.

SKILL.mdUpdated Jun 4, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/sanurb/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/skill-optimizer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

sanurb/skills

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT