plugins/agentic-engineering/skills/agent-plugin-review/SKILL.md
Use when reviewing an AI plugin pull request, auditing plugin quality before release, or when asked to "review a plugin PR", "review skills in this PR", "check plugin quality", or "review workflow architecture". Covers skill quality, structural linting, and workflow architecture review.
npx skillsauth add entityprocess/agentv agent-plugin-reviewInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Review AI plugin PRs by running deterministic structural checks first, then applying LLM judgment for skill quality and workflow architecture. Post findings as inline PR comments.
Run scripts/lint_plugin.py against the plugin directory:
python scripts/lint_plugin.py <plugin-dir> --evals-dir <evals-dir> --json
The script checks:
skills/*/SKILL.md has a corresponding eval filename and descriptionreferences/*.md) existReport findings grouped by severity (error > warning > info).
If the PR includes eval files, invoke agentv-eval-review for AgentV-specific eval quality checks.
Additionally, check each eval YAML for these structural patterns:
type: file input value MUST start with a leading / (workspace-root-relative). Paths like plugins/foo/SKILL.md are wrong — correct form is /plugins/foo/SKILL.md. Scan every type: file entry and flag any missing leading slash, showing the corrected path.type: file + value) appears identically in every test case, recommend extracting it to the top-level input field. AgentV eval files support a top-level input section that applies to all tests, eliminating per-test duplication.For each SKILL.md, check against references/skill-quality-checklist.md:
references/ files@ force-load syntaxFor plugins with multi-phase workflows, check against references/workflow-checklist.md:
Hard gate detection recipe — For each phase skill after the first:
deploy-plan.md) and stops with a clear message telling the user which skill to run first if the artifact is missingPost findings as inline PR comments at specific line numbers. Group by severity:
Use a PR review (not individual comments) to batch all findings.
scripts/lint_plugin.py — Deterministic plugin linter (Python 3.11+, stdlib only)references/skill-quality-checklist.md — Skill quality checklist (CSO, descriptions, content, discipline skills)references/workflow-checklist.md — Workflow architecture checklist (OpenSpec, hard gates, artifacts)For deeper research on challenging reviews, consult these resources via web fetch, deepwiki, or clone the repo locally:
<HARD-GATE> pattern, brainstorming workflow, skill-based development phasesdevelopment
Write, edit, review, and validate AgentV EVAL.yaml / .eval.yaml evaluation files. Use when asked to create new eval files, update or fix existing ones, add or remove test cases, configure graders (`llm-grader`, `code-grader`, `rubrics`), review whether an eval is correct or complete, convert between EVAL.yaml and evals.json using `agentv convert`, or generate eval test cases from chat transcripts (markdown conversation or JSON messages). Do NOT use for creating SKILL.md files, writing skill definitions, or running evals — running and benchmarking belongs to agentv-bench.
tools
Analyze AgentV evaluation traces and result JSONL files using `agentv inspect` and `agentv compare` CLI commands. Use when asked to inspect AgentV eval results, find regressions between AgentV evaluation runs, identify failure patterns in AgentV trace data, analyze tool trajectories, or compute cost/latency/score statistics from AgentV result files. Do NOT use for benchmarking skill trigger accuracy, analyzing skill-creator eval performance, or measuring skill description quality — those tasks belong to the skill-creator skill.
development
Author, edit, and lint `governance:` blocks in `*.eval.yaml` files. Use when creating or updating evaluation suites that carry AI-governance metadata (OWASP LLM Top 10, OWASP Agentic Top 10, MITRE ATLAS, EU AI Act, ISO 42001). Also use non-interactively (e.g., from a GitHub Action) to lint changed eval files and report violations against the rules in `references/lint-rules.md`. Do NOT use for running evals or benchmarking — that belongs to agentv-bench.
development
Use when reviewing eval YAML files for quality issues, linting eval files before committing, checking eval schema compliance, or when asked to "review these evals", "check eval quality", "lint eval files", or "validate eval structure". Do NOT use for writing evals (use agentv-eval-writer) or running evals (use agentv-bench).