ai-ml/dev-signal/.agent/skills/gcp-agent-eval-metric-configurator/SKILL.md
Provides templates for configuring Vertex AI Gen AI Evaluation metrics like GROUNDING, TOOL_USE_QUALITY, and ResponseMatch for specific agent domains.
npx skillsauth add googlecloudplatform/devrel-demos gcp-agent-eval-metric-configuratorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill helps you configure sophisticated automated evaluation metrics. Grounded in evaluation_blog.md, it supports computation-based, rubric-based, and managed Vertex AI metrics.
Ask Antigravity to:
GROUNDING: Ensures responses are fully supported by context (RAG).TOOL_USE_QUALITY: Checks if the right tool was called with correct parameters (no reference needed).Refer to resources/metric_templates.json for standard definitions.
devops
Standardizes the creation of Sensitive Data Protection (DLP) templates for PII and credential redaction.
development
Implements the "Defense-in-Depth" integration pattern in Python (intercepting prompts, parsing filter results).
testing
Configures Model Armor security policies (Prompt Injection, Jailbreak, RAI filters).
tools
Assists developers in collecting and structuring a library of diverse examples ("Golden Dataset") required for data-driven evaluation, including tool trajectories.