Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

laurigates/evaluate-plugin-batch

Name: evaluate-plugin-batch
Author: laurigates

evaluate-plugin/skills/evaluate-plugin-batch/SKILL.md

Batch evaluate every skill in a plugin and produce a plugin-level report. Use when auditing an entire plugin's quality or validating before a release.

46 stars

tools

Updated Jul 19, 2026

$ install --global

skillsauth

npx skillsauth add laurigates/claude-plugins evaluate-plugin-batch

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jul 19, 2026, 7:01 AM115.6s1 file scanned

SKILL.md

name:: evaluate-plugin-batch
description:: Batch evaluate every skill in a plugin and produce a plugin-level report. Use when auditing an entire plugin's quality or validating before a release.
args:: <plugin-name> [--create-missing-evals] [--parallel N]
allowed-tools:: Task, Read, Write, Glob, Grep, Bash(bash *), SlashCommand, TodoWrite
argument-hint:: git-plugin [--create-missing-evals]
agent:: general-purpose
created:: 2026-03-04
modified:: 2026-07-18
compatibility:: claude-code
reviewed:: 2026-03-04

/evaluate:plugin

Batch evaluate all skills in a plugin. Runs /evaluate:skill for each skill, then produces a plugin-level quality report.

When to Use This Skill

| Use this skill when... | Use alternative when... | |------------------------|------------------------| | Auditing all skills in a plugin before release | Evaluating a single skill -> /evaluate:skill | | Establishing quality baselines across a plugin | Viewing past results -> /evaluate:report | | Checking overall plugin quality after refactoring | Need structural compliance -> plugin-compliance-check.sh |

Context

Available plugins: !find . -maxdepth 2 -type d -name '*-plugin' -not -name '.claude-plugin'

Parameters

Parse these from $ARGUMENTS:

| Parameter | Default | Description | |-----------|---------|-------------| | <plugin-name> | required | Name of the plugin to evaluate | | --create-missing-evals | false | Generate evals for skills that lack them | | --parallel N | 1 | Max concurrent skill evaluations |

Execution

Step 1: Discover skills

Find all skills in the plugin:

<plugin-name>/skills/*/SKILL.md

List them and count the total.

Step 2: Filter and prepare

For each skill, check if evals.json exists:

Has evals: include in evaluation
No evals + --create-missing-evals: include, will create evals during evaluation
No evals, no flag: skip with a note

Report the breakdown:

Found N skills in <plugin-name>:
  - M with eval cases
  - K without eval cases (skipped | will create)

Step 3: Run evaluations

For each included skill, invoke /evaluate:skill via the SlashCommand tool:

SlashCommand: /evaluate:skill <plugin-name>/<skill-name> [--create-evals]

If --parallel N is set and N > 1, batch evaluations into groups of N. Otherwise, run sequentially.

Track progress with TodoWrite — mark each skill as it completes.

Step 4: Aggregate plugin report

After all skill evaluations complete, read each skill's benchmark.json and aggregate:

bash evaluate-plugin/scripts/aggregate_benchmark.sh <plugin-name>

Write aggregated results to <plugin-name>/eval-results/plugin-benchmark.json.

Step 5: Report

Print a plugin-level summary table:

## Plugin Evaluation: <plugin-name>

| Skill | Evals | Pass Rate | Status |
|-------|-------|-----------|--------|
| skill-a | 4 | 100% | PASS |
| skill-b | 3 | 67% | PARTIAL |
| skill-c | 5 | 80% | PASS |

**Overall**: 82% pass rate across N eval cases

Rank skills by pass rate. Flag any below 50% as needing attention.

Agentic Optimizations

| Context | Command | |---------|---------| | Inventory plugin skills + evals | bash evaluate-plugin/scripts/inspect_eval.sh --plugin-dir <plugin> | | Inspect a single skill's evals | bash evaluate-plugin/scripts/inspect_eval.sh --plugin <plugin> --skill <skill> | | Aggregate results | bash evaluate-plugin/scripts/aggregate_benchmark.sh <plugin> |

Quick Reference

| Flag | Description | |------|-------------| | --create-missing-evals | Generate eval cases for skills without them | | --parallel N | Max concurrent evaluations (default: 1) |

Related Skills

laurigates/quoted-description

development

VerifiedTrustedCommunity

Debug HTTP APIs: trace requests, inspect headers. Use when a request fails: check status first.

46SKILL.mdUpdated Jul 20, 2026

laurigates/quoted-description

laurigates/folded-description

documentation

VerifiedTrustedCommunity

Render architecture diagrams from text sources. Use when documenting system topology.

46SKILL.mdUpdated Jul 20, 2026

laurigates/folded-description

laurigates/adapters/tests/fixtures/mini-marketplace-src/beta-plugin/skilldirs/no-name

tools

VerifiedTrustedCommunity

Inspect JSON payloads and extract nested fields. Use when parsing API responses.

46SKILL.mdUpdated Jul 20, 2026

laurigates/adapters/tests/fixtures/mini-marketplace-src/beta-plugin/skilldirs/no-name

laurigates/no-description

tools

VerifiedTrustedCommunity

--- name: no-description allowed-tools: Read --- # No Description This skill has no description and must be dropped with a warning.

46SKILL.mdUpdated Jul 20, 2026

laurigates/no-description

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/laurigates/claude-plugins.git

# Copy into Claude Code skills folder (global)
cp -r claude-plugins/evaluate-plugin/skills/evaluate-plugin-batch ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

laurigates/claude-plugins

46 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

laurigates/evaluate-plugin-batch

evaluate-plugin/skills/evaluate-plugin-batch/SKILL.md

Batch evaluate every skill in a plugin and produce a plugin-level report. Use when auditing an entire plugin's quality or validating before a release.

46 stars

tools

Updated Jul 19, 2026

$ install --global

skillsauth

npx skillsauth add laurigates/claude-plugins evaluate-plugin-batch

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jul 19, 2026, 7:01 AM115.6s1 file scanned

SKILL.md

name:: evaluate-plugin-batch
description:: Batch evaluate every skill in a plugin and produce a plugin-level report. Use when auditing an entire plugin's quality or validating before a release.
args:: <plugin-name> [--create-missing-evals] [--parallel N]
allowed-tools:: Task, Read, Write, Glob, Grep, Bash(bash *), SlashCommand, TodoWrite
argument-hint:: git-plugin [--create-missing-evals]
agent:: general-purpose
created:: 2026-03-04
modified:: 2026-07-18
compatibility:: claude-code
reviewed:: 2026-03-04

/evaluate:plugin

Batch evaluate all skills in a plugin. Runs /evaluate:skill for each skill, then produces a plugin-level quality report.

When to Use This Skill

Context

Available plugins: !find . -maxdepth 2 -type d -name '*-plugin' -not -name '.claude-plugin'

Parameters

Parse these from $ARGUMENTS:

Execution

Step 1: Discover skills

Find all skills in the plugin:

<plugin-name>/skills/*/SKILL.md

List them and count the total.

Step 2: Filter and prepare

For each skill, check if evals.json exists:

Has evals: include in evaluation
No evals + --create-missing-evals: include, will create evals during evaluation
No evals, no flag: skip with a note

Report the breakdown:

Found N skills in <plugin-name>:
  - M with eval cases
  - K without eval cases (skipped | will create)

Step 3: Run evaluations

For each included skill, invoke /evaluate:skill via the SlashCommand tool:

SlashCommand: /evaluate:skill <plugin-name>/<skill-name> [--create-evals]

If --parallel N is set and N > 1, batch evaluations into groups of N. Otherwise, run sequentially.

Track progress with TodoWrite — mark each skill as it completes.

Step 4: Aggregate plugin report

After all skill evaluations complete, read each skill's benchmark.json and aggregate:

bash evaluate-plugin/scripts/aggregate_benchmark.sh <plugin-name>

Write aggregated results to <plugin-name>/eval-results/plugin-benchmark.json.

Step 5: Report

Print a plugin-level summary table:

## Plugin Evaluation: <plugin-name>

| Skill | Evals | Pass Rate | Status |
|-------|-------|-----------|--------|
| skill-a | 4 | 100% | PASS |
| skill-b | 3 | 67% | PARTIAL |
| skill-c | 5 | 80% | PASS |

**Overall**: 82% pass rate across N eval cases

Rank skills by pass rate. Flag any below 50% as needing attention.

Agentic Optimizations

Quick Reference

| Flag | Description | |------|-------------| | --create-missing-evals | Generate eval cases for skills without them | | --parallel N | Max concurrent evaluations (default: 1) |

Related Skills

laurigates/quoted-description

development

VerifiedTrustedCommunity

Debug HTTP APIs: trace requests, inspect headers. Use when a request fails: check status first.

46SKILL.mdUpdated Jul 20, 2026

laurigates/quoted-description

laurigates/folded-description

documentation

VerifiedTrustedCommunity

Render architecture diagrams from text sources. Use when documenting system topology.

46SKILL.mdUpdated Jul 20, 2026

laurigates/folded-description

laurigates/adapters/tests/fixtures/mini-marketplace-src/beta-plugin/skilldirs/no-name

tools

VerifiedTrustedCommunity

Inspect JSON payloads and extract nested fields. Use when parsing API responses.

46SKILL.mdUpdated Jul 20, 2026

laurigates/adapters/tests/fixtures/mini-marketplace-src/beta-plugin/skilldirs/no-name

laurigates/no-description

tools

VerifiedTrustedCommunity

--- name: no-description allowed-tools: Read --- # No Description This skill has no description and must be dropped with a warning.

46SKILL.mdUpdated Jul 20, 2026

laurigates/no-description

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/laurigates/claude-plugins.git

# Copy into Claude Code skills folder (global)
cp -r claude-plugins/evaluate-plugin/skills/evaluate-plugin-batch ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

laurigates/claude-plugins

46 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT