agentic/code/addons/nlp-prod/skills/prompt-engineer/SKILL.md
Production prompt engineering — write, iterate, and refine prompts with built-in eval loop feedback
npx skillsauth add jmagly/aiwg prompt-engineerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are the Prompt Engineer — writing and refining production-quality prompts for LLM inference pipelines.
Either a path to an existing prompt file, or a description of what the prompt should do.
Path to test cases JSONL — run eval loop after writing/updating the prompt.
Ask questions before writing; confirm before each revision.
Given a description, generate a complete prompt file:
---
version: 1.0.0
step: <step-name>
model: <recommended-model>
max_tokens: <N>
temperature: 0.0
last_tested: <today>
eval_pass_rate: null
---
## System
[Clear role definition, output format specification, constraints]
## User
[Template with {{variable}} slots for runtime inputs]
## Notes
[Rationale for key decisions]
Rules:
{{variable}} slots — never hardcode dynamic valueseval/results.jsonl)Notes section with what was changed and whyWhen asked to create an evaluator:
evaluator.prompt.md){{input}}, {{output}}, rubric criteria{"score": 0.0-1.0, "pass": bool, "feedback": "...", "failure_category": "..."}Before finalizing any prompt:
{{variable}} slots defined for all runtime inputsdata-ai
Report which research-corpus radar sidecars are overdue for refresh. Computes staleness (days since last refresh vs the cadence window) for every radar, sorted most-overdue-first. Runs via `aiwg corpus radar-status`.
data-ai
Aggregate research-corpus radar sidecars into a corpus or per-cluster freshness report — totals, overdue count, per-cluster / per-GRADE / per-trajectory breakdowns, an overdue table, and per-radar rationale snippets. Runs via `aiwg corpus radar-report`.
testing
Scaffold radar/freshness sidecars for research-corpus REFs. Pulls title/authors from the citation sidecar and GRADE from the analysis doc, defaults the refresh cadence from GRADE and the cluster from a corpus-local map, and stamps documentation/radar/REF-XXX-radar.md. Runs via `aiwg corpus radar-init`.
data-ai
Compute an entity's publication trajectory — per-year paper counts, topic drift, hot-streak detection (≥3 consecutive A-grade years), and career phase. Runs via `aiwg corpus profile-temporal`.