plugins/utils/skills/prompt-eval-builder/SKILL.md
Domain knowledge for extracting eval assertions and generating test invocation prompts from command/agent specs. Used for building promptfoo evaluation configs.
npx skillsauth add rp1-run/rp1 prompt-eval-builderInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Domain knowledge for building evaluation artifacts from prompt specifications. Provides extraction patterns, output templates, validation logic, and invocation prompt generation rules.
LLM Rubrics (Default): Generated for most behavioral requirements. Rubrics contain REQUIRED/PROHIBITED/EDGE CASES sections and evaluate behavior holistically by checking output text AND Metadata JSON.
Programmatic Assertions (Complex Cases): Generated when requirements need exact counting, strict sequencing, or complex conditional logic that cannot be expressed in natural language rubrics. Uses type: javascript with file:// references to shared assertion functions.
| File | Purpose | When to Load | |------|---------|--------------| | PATTERNS.md | Extraction categories, tool mappings, selection rules, invocation generation | Always - core knowledge | | TEMPLATES.md | promptfoo YAML output templates, assertion formats | When generating YAML output | | VALIDATION.md | YAML validation loop, error handling | When validating/writing output |
Agents using this skill:
scripts/validate-yaml.ts for YAML validation| Script | Purpose | Usage |
|--------|---------|-------|
| scripts/validate-yaml.ts | Validate YAML syntax | bun {skill_path}/scripts/validate-yaml.ts {output_file} |
Output format: { "valid": true } or { "valid": false, "error": "message" }
Prompt Text -> Pattern Analysis -> Requirement Categorization -> LLM Rubric Generation -> YAML Validation
\-> Programmatic Assertion (if complex)
Command/Agent Spec -> Metadata Extraction -> Variable Mapping -> Invocation Prompt
Key Concept: Test prompts are USER INPUTS that invoke the command, not distilled versions of the prompt. Example:
/rp1-dev:build-fast "{{REQUEST}}" --git-commit={{GIT_COMMIT}} --afk={{AFK_MODE}}
Both flows share PATTERNS.md for domain knowledge.
tools
Plan and execute splitting a large PR or branch into a reviewable stacked PR sequence.
documentation
Ask about rp1 capabilities, discover skills, and get workflow guidance.
tools
Generate an evidence-grounded markdown walkthrough for a pull request.
development
Run a bounded, evidence-driven two-agent debate into a separate rp1 debate artifact with backend locks only.