Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

rp1-run/build-prompt-evals

Name: build-prompt-evals
Author: rp1-run

plugins/utils/skills/build-prompt-evals/SKILL.md

npx skillsauth add rp1-run/rp1 build-prompt-evals

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Build Prompt Evals

Generate eval assertions (YAML) and test invocation prompt from source prompt. Extracts assertions, then runs assertion specialist to resolve placeholders, consolidate scenarios, and document unresolved assertions.

Modes

File Mode (when INPUT is a valid file path):

Read the file content
Use basename for output naming
Spawn extractor agent, then assertion specialist

Inline Mode (when INPUT is prompt text):

Use prompt directly
Use "extracted" as basename
Spawn extractor agent, then assertion specialist

Workflow

Step 1: Parse Arguments

Check for --output flag in arguments:

If present: extract output directory path
If not: use input file directory (file mode) or cwd (inline mode)

Step 2: Detect Mode

Check if first non-flag argument is a file path:

Use Bash: test -f "{INPUT}" && echo "file" || echo "inline"

Step 2.5: Dependency Analysis (File Mode Only)

If file mode:

Spawn dependency-chain-analyzer to discover sub-agent and skill dependencies:

{% dispatch_agent "rp1-utils:dependency-chain-analyzer" %} FILE_PATH: {INPUT file path} {% enddispatch_agent %}

Capture JSON output as DEPENDENCY_CHAIN variable.

If inline mode:

Set DEPENDENCY_CHAIN to empty string (no file to analyze for dependencies).

Step 3: Prepare Input

If file mode:

Read the file using Read tool
Extract basename (without extension) for output naming
Set SOURCE_NAME to filename
Set OUTPUT_DIR to file's directory (unless --output specified)

If inline mode:

Use INPUT directly as PROMPT_TEXT
Set SOURCE_NAME to "inline"
Set basename to "extracted"
Set OUTPUT_DIR to cwd (unless --output specified)

Step 4: Determine Output Paths

OUTPUT_YAML = {OUTPUT_DIR}/{basename}-evals.yaml
OUTPUT_PROMPT = {OUTPUT_DIR}/{basename}-eval-prompt.txt

Step 5: Spawn Extractor Agent

Single agent generates both YAML assertions and test prompt:

{% dispatch_agent "rp1-utils:prompt-eval-extractor" %} PROMPT_TEXT: {PROMPT_TEXT content} SOURCE_NAME: {SOURCE_NAME} OUTPUT_FILE: {OUTPUT_YAML} DEPENDENCY_CHAIN: {DEPENDENCY_CHAIN JSON or empty string} OUTPUT_PROMPT: {OUTPUT_PROMPT} {% enddispatch_agent %}

Step 6: Extraction Complete (Intermediate)

Log extraction completion:

Extraction complete. Running assertion optimization...

Step 7: Spawn Assertion Specialist

Invoke assertion specialist to optimize the generated eval config:

{% dispatch_agent "rp1-utils:prompt-assertion-specialist" %} CONFIG_PATH: {OUTPUT_YAML} {% enddispatch_agent %}

Capture JSON output as ASSERTION_RESULT variable.

Step 8: Report Completion

Display output locations and optimization summary:

Eval files generated:
  Assertions: {OUTPUT_YAML}
  Test prompt: {OUTPUT_PROMPT}

Assertion optimization:
  Resolved: {ASSERTION_RESULT.resolved_count} ({ASSERTION_RESULT.resolved_builtin} built-in, {ASSERTION_RESULT.resolved_shared} shared)
  Unresolved: {ASSERTION_RESULT.unresolved_count}
  Consolidated scenarios: {ASSERTION_RESULT.consolidated_scenarios}

{If ASSERTION_RESULT.unresolved_count > 0:}
  See: {ASSERTION_RESULT.output_files[1]} for assertions requiring implementation.

Review the assertions file for any remaining TODO placeholders.

Error Handling

Empty input (INPUT not provided):

Usage: /build-prompt-evals <file-or-prompt> [--output <dir>]

  <file-or-prompt>  Path to command/agent prompt file OR raw prompt text
  [--output <dir>]  Optional output directory (default: input file dir or cwd)

Outputs:
  {basename}-evals.yaml       Eval assertions in promptfoo format
  {basename}-eval-prompt.txt  Test invocation prompt (user input to test the command)

Examples:
  /build-prompt-evals plugins/dev/skills/build-fast/SKILL.md
  /build-prompt-evals plugins/dev/skills/build-fast/SKILL.md --output evals/suites/rp1-dev/
  /build-prompt-evals "Create a branch and commit changes"

Invalid file path (file mode detected but read fails):

Error: Could not read file: {path}

Output directory does not exist:

Error: Output directory does not exist: {path}

Examples

File mode with auto output location:

/build-prompt-evals plugins/dev/skills/build-fast/SKILL.md

Creates in same directory:

plugins/dev/skills/build-fast/build-fast-evals.yaml
plugins/dev/skills/build-fast/build-fast-eval-prompt.txt

File mode with explicit output directory:

/build-prompt-evals plugins/dev/skills/build-fast/SKILL.md --output evals/suites/rp1-dev/

Creates in specified directory:

evals/suites/rp1-dev/build-fast-evals.yaml
evals/suites/rp1-dev/build-fast-eval-prompt.txt

Inline mode:

/build-prompt-evals "Create a new branch, make changes, then commit"

Creates in current directory:

extracted-evals.yaml
extracted-eval-prompt.txt

rp1-run/build-prompt-evals

plugins/utils/skills/build-prompt-evals/SKILL.md

Builds eval assertions and minimal test prompt from prompt text, then optimizes assertions via specialist agent.

21 stars

development

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add rp1-run/rp1 build-prompt-evals

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 24, 2026, 8:51 PM1.8s1 file scanned

SKILL.md

name:: build-prompt-evals
description:: Output directory for generated files (default: input file dir or cwd)
category:: prompt
is_workflow:: false
version:: 1.1.0
created:: 2026-01-19
updated:: 2026-02-26
author:: cloud-on-prem/rp1
- name:: OUTPUT_DIR
type:: string
required:: false
- "rp1-utils:: prompt-assertion-specialist

Build Prompt Evals

Modes

File Mode (when INPUT is a valid file path):

Read the file content
Use basename for output naming
Spawn extractor agent, then assertion specialist

Inline Mode (when INPUT is prompt text):

Use prompt directly
Use "extracted" as basename
Spawn extractor agent, then assertion specialist

Workflow

Step 1: Parse Arguments

Check for --output flag in arguments:

If present: extract output directory path
If not: use input file directory (file mode) or cwd (inline mode)

Step 2: Detect Mode

Check if first non-flag argument is a file path:

Use Bash: test -f "{INPUT}" && echo "file" || echo "inline"

Step 2.5: Dependency Analysis (File Mode Only)

If file mode:

Spawn dependency-chain-analyzer to discover sub-agent and skill dependencies:

{% dispatch_agent "rp1-utils:dependency-chain-analyzer" %} FILE_PATH: {INPUT file path} {% enddispatch_agent %}

Capture JSON output as DEPENDENCY_CHAIN variable.

If inline mode:

Set DEPENDENCY_CHAIN to empty string (no file to analyze for dependencies).

Step 3: Prepare Input

If file mode:

Read the file using Read tool
Extract basename (without extension) for output naming
Set SOURCE_NAME to filename
Set OUTPUT_DIR to file's directory (unless --output specified)

If inline mode:

Use INPUT directly as PROMPT_TEXT
Set SOURCE_NAME to "inline"
Set basename to "extracted"
Set OUTPUT_DIR to cwd (unless --output specified)

Step 4: Determine Output Paths

OUTPUT_YAML = {OUTPUT_DIR}/{basename}-evals.yaml
OUTPUT_PROMPT = {OUTPUT_DIR}/{basename}-eval-prompt.txt

Step 5: Spawn Extractor Agent

Single agent generates both YAML assertions and test prompt:

Step 6: Extraction Complete (Intermediate)

Log extraction completion:

Extraction complete. Running assertion optimization...

Step 7: Spawn Assertion Specialist

Invoke assertion specialist to optimize the generated eval config:

{% dispatch_agent "rp1-utils:prompt-assertion-specialist" %} CONFIG_PATH: {OUTPUT_YAML} {% enddispatch_agent %}

Capture JSON output as ASSERTION_RESULT variable.

Step 8: Report Completion

Display output locations and optimization summary:

Eval files generated:
  Assertions: {OUTPUT_YAML}
  Test prompt: {OUTPUT_PROMPT}

Assertion optimization:
  Resolved: {ASSERTION_RESULT.resolved_count} ({ASSERTION_RESULT.resolved_builtin} built-in, {ASSERTION_RESULT.resolved_shared} shared)
  Unresolved: {ASSERTION_RESULT.unresolved_count}
  Consolidated scenarios: {ASSERTION_RESULT.consolidated_scenarios}

{If ASSERTION_RESULT.unresolved_count > 0:}
  See: {ASSERTION_RESULT.output_files[1]} for assertions requiring implementation.

Review the assertions file for any remaining TODO placeholders.

Error Handling

Empty input (INPUT not provided):

Usage: /build-prompt-evals <file-or-prompt> [--output <dir>]

  <file-or-prompt>  Path to command/agent prompt file OR raw prompt text
  [--output <dir>]  Optional output directory (default: input file dir or cwd)

Outputs:
  {basename}-evals.yaml       Eval assertions in promptfoo format
  {basename}-eval-prompt.txt  Test invocation prompt (user input to test the command)

Examples:
  /build-prompt-evals plugins/dev/skills/build-fast/SKILL.md
  /build-prompt-evals plugins/dev/skills/build-fast/SKILL.md --output evals/suites/rp1-dev/
  /build-prompt-evals "Create a branch and commit changes"

Invalid file path (file mode detected but read fails):

Error: Could not read file: {path}

Output directory does not exist:

Error: Output directory does not exist: {path}

Examples

File mode with auto output location:

/build-prompt-evals plugins/dev/skills/build-fast/SKILL.md

Creates in same directory:

plugins/dev/skills/build-fast/build-fast-evals.yaml
plugins/dev/skills/build-fast/build-fast-eval-prompt.txt

File mode with explicit output directory:

/build-prompt-evals plugins/dev/skills/build-fast/SKILL.md --output evals/suites/rp1-dev/

Creates in specified directory:

evals/suites/rp1-dev/build-fast-evals.yaml
evals/suites/rp1-dev/build-fast-eval-prompt.txt

Inline mode:

/build-prompt-evals "Create a new branch, make changes, then commit"

Creates in current directory:

extracted-evals.yaml
extracted-eval-prompt.txt

Related Skills

rp1-run/note

data-ai

VerifiedTrustedCommunity

Capture session context as a structured, frontmatter-rich markdown note under .rp1/work/notes/ with auto-maintained index and log.

31SKILL.mdUpdated Jun 11, 2026

rp1-run/pr-stack

tools

VerifiedTrustedCommunity

Plan and execute splitting a large PR or branch into a reviewable stacked PR sequence.

31SKILL.mdUpdated Jun 4, 2026

rp1-run/prompt-writer

development

VerifiedTrustedCommunity

Write maximally terse agent prompts from scratch. Use when creating new agent specs, command prompts, or instruction sets. Teaches structure-first composition with compression-by-default patterns. Extended with constitutional governance, epistemic stance selection, and a six-stage prompt pipeline.

31SKILL.mdUpdated Apr 21, 2026

rp1-run/prompt-writer

rp1-run/speedrun

development

VerifiedTrustedCommunity

Interactive speedrun loop for small, low-risk changes. Delegates each request to a general sub-agent. Redirects larger work to /build-fast or /build.

31SKILL.mdUpdated Apr 16, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/rp1-run/rp1.git

# Copy into Claude Code skills folder (global)
cp -r rp1/plugins/utils/skills/build-prompt-evals ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

rp1-run/rp1

21 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT