entityprocess

agentv-eval-writer

Write, edit, review, and validate AgentV EVAL.yaml / .eval.yaml evaluation files. Use when asked to create new eval files, update or fix existing ones, add or remove test cases, configure graders (`llm-grader`, `code-grader`, `rubrics`), review whether an eval is correct or complete, convert between EVAL.yaml and evals.json using `agentv convert`, or generate eval test cases from chat transcripts (markdown conversation or JSON messages). Do NOT use for creating SKILL.md files, writing skill definitions, or running evals — running and benchmarking belongs to agentv-bench.

development13

acme-deploy

Use when the user asks about deploying services, checking deployment status, rollback procedures, or release management at Acme Corp

testing12

csv-analyzer

Use when the user asks to analyze CSV data. MUST use the weighted revenue formula defined below.

data-ai12

acme-deploy

Use when the user asks about deploying services, checking deployment status, rollback procedures, or release management at Acme Corp

testing12

csv-analyzer

Use when the user asks to analyze CSV data. MUST use the weighted revenue formula defined below.

data-ai12

agent-architecture-design

Use when designing an AI agent system, selecting agentic design patterns, planning multi-phase workflows, choosing between single-agent and multi-agent architectures, or when asked "what kind of agent should I build", "how should I structure this automation", "design an agent for X", or "which agentic pattern fits this problem".

tools12

agentv-eval-review

Use when reviewing eval YAML files for quality issues, linting eval files before committing, checking eval schema compliance, or when asked to "review these evals", "check eval quality", "lint eval files", or "validate eval structure". Do NOT use for writing evals (use agentv-eval-writer) or running evals (use agentv-bench).

development12

agentv-onboarding

Bootstrap AgentV in the current workspace after plugin-manager install. Ensures CLI availability, runs workspace init, and verifies setup artifacts.

tools12

csv-analyzer

Use when the user asks to analyze CSV data. MUST use the weighted revenue formula defined below.

data-ai12

csv-analyzer

Use when the user asks to analyze, summarize, or extract insights from CSV data or files

data-ai12

deploy-execute

This skill should be used when asked to "execute a deployment", "run the deploy plan", or "deploy services". Reads deploy-plan.md and executes each step with health checks.

testing12

deploy-plan

This skill should be used when asked to "plan a deployment", "create a deploy plan", or "prepare release steps". Produces a deployment plan with rollback strategy.

devops12

deploy-rollback

This skill should be used when asked to "rollback a deployment", "revert services", or "undo deploy". Reads deploy-plan.md and reverses completed steps.

devops12

agentv-trace-analyst

Analyze AgentV evaluation traces and result JSONL files using `agentv inspect` and `agentv compare` CLI commands. Use when asked to inspect AgentV eval results, find regressions between AgentV evaluation runs, identify failure patterns in AgentV trace data, analyze tool trajectories, or compute cost/latency/score statistics from AgentV result files. Do NOT use for benchmarking skill trigger accuracy, analyzing skill-creator eval performance, or measuring skill description quality — those tasks belong to the skill-creator skill.

tools12

image-compress-and-docs

Capture, optimize, and publish screenshots to Astro docs. Use when asked to take screenshots for docs, update doc images, compress PNG assets, or add visual documentation to the agentv.dev docs site. Triggers on "add screenshots to docs", "update docs images", "compress screenshots", "optimize PNG", "document with screenshots".

development12

agentv-eval-writer

Write, edit, review, and validate AgentV EVAL.yaml / .eval.yaml evaluation files. Use when asked to create new eval files, update or fix existing ones, add or remove test cases, configure graders (`llm-grader`, `code-grader`, `rubrics`), review whether an eval is correct or complete, convert between EVAL.yaml and evals.json using `agentv convert`, or generate eval test cases from chat transcripts (markdown conversation or JSON messages). Do NOT use for creating SKILL.md files, writing skill definitions, or running evals — running and benchmarking belongs to agentv-bench.

development12

agentv-bench

Run AgentV evaluations and optimize agents through eval-driven iteration. Triggers: run evals, benchmark agents, optimize prompts/skills against evals, compare agent outputs across providers, analyze eval results, offline evaluation of recorded sessions, run autoresearch, optimize unattended, run overnight optimization loop. Not for: writing/editing eval YAML without running (use agentv-eval-writer), analyzing existing traces/JSONL without re-running (use agentv-trace-analyst).

documentation12

agentv-governance

Author, edit, and lint `governance:` blocks in `*.eval.yaml` files. Use when creating or updating evaluation suites that carry AI-governance metadata (OWASP LLM Top 10, OWASP Agentic Top 10, MITRE ATLAS, EU AI Act, ISO 42001). Also use non-interactively (e.g., from a GitHub Action) to lint changed eval files and report violations against the rules in `references/lint-rules.md`. Do NOT use for running evals or benchmarking — that belongs to agentv-bench.

development12

agentv-dev

AgentV CLI skills for evaluating, optimizing, and governing AI agents. Triggers: run evals, benchmark agents, write evals, review evals, analyze traces, optimize prompts, governance linting. Covers: eval running, eval writing, eval review, trace analysis, description optimization, autoresearch, and governance compliance.

tools12

acme-deploy

Use when the user asks about deploying services, checking deployment status, rollback procedures, or release management at Acme Corp

testing12

agentv-governance

Author, edit, and lint `governance:` blocks in `*.eval.yaml` files. Use when creating or updating evaluation suites that carry AI-governance metadata (OWASP LLM Top 10, OWASP Agentic Top 10, MITRE ATLAS, EU AI Act, ISO 42001). Also use non-interactively (e.g., from a GitHub Action) to lint changed eval files and report violations against the rules in `references/lint-rules.md`. Do NOT use for running evals or benchmarking — that belongs to agentv-bench.

development12

agentv-eval-review

Use when reviewing eval YAML files for quality issues, linting eval files before committing, checking eval schema compliance, or when asked to "review these evals", "check eval quality", "lint eval files", or "validate eval structure". Do NOT use for writing evals (use agentv-eval-writer) or running evals (use agentv-bench).

development12

agentv-trace-analyst

Analyze AgentV evaluation traces and result JSONL files using `agentv inspect` and `agentv compare` CLI commands. Use when asked to inspect AgentV eval results, find regressions between AgentV evaluation runs, identify failure patterns in AgentV trace data, analyze tool trajectories, or compute cost/latency/score statistics from AgentV result files. Do NOT use for benchmarking skill trigger accuracy, analyzing skill-creator eval performance, or measuring skill description quality — those tasks belong to the skill-creator skill.

tools12

agentv-bench

Run AgentV evaluations and optimize agents through eval-driven iteration. Triggers: run evals, benchmark agents, optimize prompts/skills against evals, compare agent outputs across providers, analyze eval results, offline evaluation of recorded sessions, run autoresearch, optimize unattended, run overnight optimization loop. Not for: writing/editing eval YAML without running (use agentv-eval-writer), analyzing existing traces/JSONL without re-running (use agentv-trace-analyst).

documentation12

acme-deploy

Use when the user asks about deploying services, checking deployment status, rollback procedures, or release management at Acme Corp

testing12

agent-plugin-review

Use when reviewing an AI plugin pull request, auditing plugin quality before release, or when asked to "review a plugin PR", "review skills in this PR", "check plugin quality", or "review workflow architecture". Covers skill quality, structural linting, and workflow architecture review.

tools12

agentv-eval-writer

acme-deploy

csv-analyzer

acme-deploy

csv-analyzer

agent-architecture-design

agentv-eval-review

agentv-onboarding

csv-analyzer

csv-analyzer

deploy-execute

deploy-plan

deploy-rollback

agentv-trace-analyst

image-compress-and-docs

agentv-eval-writer

agentv-bench

agentv-governance

agentv-dev

acme-deploy

agentv-governance

agentv-eval-review

agentv-trace-analyst

agentv-bench

acme-deploy

agent-plugin-review

Adoption

entityprocess

agentv-eval-writer

acme-deploy

csv-analyzer

acme-deploy

csv-analyzer

agent-architecture-design

agentv-eval-review

agentv-onboarding

csv-analyzer

csv-analyzer

deploy-execute

deploy-plan

deploy-rollback

agentv-trace-analyst

image-compress-and-docs

agentv-eval-writer

agentv-bench

agentv-governance

agentv-dev

acme-deploy

agentv-governance

agentv-eval-review

agentv-trace-analyst

agentv-bench

acme-deploy

agent-plugin-review