Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

chelch5/agent-prompt-engineering

Name: agent-prompt-engineering
Author: chelch5

01-package-scaffolding/agent-prompt-engineering/SKILL.md

npx skillsauth add chelch5/skilllibrary agent-prompt-engineering

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Agent Prompt Engineering

Use this skill when prompt wording controls how agents coordinate, route work, and use tools.

Procedure

1. Analyze existing prompts

Read the existing prompt, command, or process doc and identify:

Authority: what the agent owns
Scope: what the agent does NOT own
Tool surface: what tools it can use
Stage transitions: what stages it moves between
Delegation boundaries: who it can delegate to

2. Apply prompt contracts by role

Each agent type has specific prompt requirements:

| Role | Key contract | |------|-------------| | Orchestrator / Team Leader | Resolve state from tools first, verify artifacts before routing | | Planner | Decision-complete plans for one ticket only | | Implementer | Follow the approved plan, stop on missing requirements | | Reviewer / QA | Stay read-only, return findings first — never praise before findings | | Utility | Stay narrow and bounded, single-purpose |

3. Remove anti-patterns

Eliminate these common prompt failures:

Status-over-evidence routing — Routing based on labels instead of actual artifacts. Fix: require tool-read proof before stage transitions.

Raw-file stage control — Editing state files directly instead of using tools. Fix: route all state changes through workflow tools.

Impossible read-only delegation — Telling read-only agents to write files. Fix: verify agent capabilities match task requirements before delegating.

# BAD: Read-only agent told to write
agents:
  researcher:
    permissions: [read]
    task: "Read the code and update the docs"  # IMPOSSIBLE

# GOOD: Capability-matched delegation
agents:
  researcher:
    permissions: [read]
    task: "Read the code and report findings"
  implementer:
    permissions: [read, write]
    task: "Update the docs based on researcher findings"

Broad command follow-on — Commands that silently continue the whole workflow. Fix: each command should have a clear stop point.

Context amnesia — Agent forgets earlier decisions. Fix: load key constraints at task start, reference source-of-truth files.

4. Apply model-specific techniques

Different models have different prompting best practices:

For capable models (Claude Sonnet 4+, GPT-4+):

Can handle nuanced instructions and self-correct
Benefit from "think step by step" and structured reasoning
Use XML tags for complex prompt structure

system_prompt: |
  <role>You are an implementer for project-name.</role>
  <context>Stack: TypeScript, Node.js, Vitest</context>
  <instructions>
  1. Read the ticket fully
  2. Check for existing patterns in src/
  3. Write tests first, then implement
  4. Run full test suite before committing
  </instructions>
  <constraints>
  - No any types
  - All exports must be typed
  - Test coverage > 80%
  </constraints>

For less capable models (Haiku, GPT-3.5, smaller models):

Need explicit step-by-step sequences — do not combine steps
Benefit from few-shot examples over abstract rules
Need guardrails against hallucination
Keep outputs short but highly structured

system_prompt: |
  Follow this exact sequence:
  
  STEP 1: Read file
  Command: cat [filename]
  
  STEP 2: Identify change location
  Output: "Line [N]: [current content]"
  
  STEP 3: Make edit
  Change: [old] → [new]
  
  Do not skip steps. Do not combine steps.
  If stuck after 3 attempts: STOP and report blocker.

When hardening for a specific model:

Check for local model documentation (if maintained in repo)
If none exists, web-search for "[model name] prompting best practices"
Apply discovered techniques
Document findings for future reference

5. Apply weak-model hardening

Ensure all prompts are safe for weaker models:

Outputs are short but highly structured
Exact required sections are stated
Blocker returns are preferred over hidden guesswork
Proof is required before stage transitions
Next specialist or action is named explicitly
Stable procedure lives in skills/tools, not long prose

6. Add self-verification

Every agent prompt should include a verification step:

system_prompt: |
  Before completing any task, self-verify:
  □ Did I follow the stack standards?
  □ Did I write tests for new code?
  □ Did I run the linter?
  □ Does the output match the expected format?
  
  If any check fails, fix before proceeding.

7. Final verification

Re-read the final prompt and ask:

Could a weaker model execute this without inventing hidden state?
Are all tool permissions explicit?
Are all delegation boundaries named?
Is there a clear stop condition?
Does the prompt say what TO do (not just what NOT to do)?

Rules

Prefer tool-backed state over raw file choreography
Keep ticket status coarse and queue-oriented
Require verification before stage changes
Require explicit blocker return paths when material ambiguity remains
Do not let prompts terminate with a summary when the workflow still has another stage
Never instruct read-only agents to mutate repo-tracked files
Never claim a file changed unless a write-capable tool actually wrote it
Put transient approval state in workflow state or explicit artifacts, not ticket status

Output contract

Improved agent definitions with:

Clear role statement
Explicit capabilities and limitations
Step-by-step operating procedure
Output format specification
Self-verification checklist
Stop conditions and blocker paths

Failure handling

Agent still loops: Add explicit loop counter and hard stop (max 3-5 attempts)
Agent ignores instructions: Move critical instructions to top of prompt; use XML tags for structure
Model too weak for task: Simplify task decomposition or upgrade model
Inconsistent behavior: Add more few-shot examples of desired behavior
Status-over-evidence persists: Add explicit "prove it" gates requiring tool output before state changes

References

Anthropic prompting best practices: https://docs.anthropic.com/en/docs/build-with-claude/prompt-engineering/claude-prompting-best-practices
XML tags for structure, examples for steering, explicit over implicit
OpenAI prompt engineering guide: https://developers.openai.com/docs/guides/prompt-engineering

chelch5/agent-prompt-engineering

01-package-scaffolding/agent-prompt-engineering/SKILL.md

Design and harden agent, command, workflow, and tool prompts for reliable execution across different AI models. Use when creating or revising repo-local agents to apply model-specific prompting techniques, tighten scope, and prevent common agent failure modes like doom loops, status-over-evidence routing, and impossible read-only delegation. Do not use for one-off prompts or when agents are already working reliably.

tools

Updated Apr 20, 2026

$ install --global

skillsauth

npx skillsauth add chelch5/skilllibrary agent-prompt-engineering

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 20, 2026, 3:18 AM10.5s5 files scanned

SKILL.md

name:: agent-prompt-engineering
description:: Design and harden agent, command, workflow, and tool prompts for reliable execution across different AI models. Use when creating or revising repo-local agents to apply model-specific prompting techniques, tighten scope, and prevent common agent failure modes like doom loops, status-over-evidence routing, and impossible read-only delegation. Do not use for one-off prompts or when agents are already working reliably.

Agent Prompt Engineering

Use this skill when prompt wording controls how agents coordinate, route work, and use tools.

Procedure

1. Analyze existing prompts

Read the existing prompt, command, or process doc and identify:

Authority: what the agent owns
Scope: what the agent does NOT own
Tool surface: what tools it can use
Stage transitions: what stages it moves between
Delegation boundaries: who it can delegate to

2. Apply prompt contracts by role

Each agent type has specific prompt requirements:

3. Remove anti-patterns

Eliminate these common prompt failures:

Status-over-evidence routing — Routing based on labels instead of actual artifacts. Fix: require tool-read proof before stage transitions.

Raw-file stage control — Editing state files directly instead of using tools. Fix: route all state changes through workflow tools.

Impossible read-only delegation — Telling read-only agents to write files. Fix: verify agent capabilities match task requirements before delegating.

# BAD: Read-only agent told to write
agents:
  researcher:
    permissions: [read]
    task: "Read the code and update the docs"  # IMPOSSIBLE

# GOOD: Capability-matched delegation
agents:
  researcher:
    permissions: [read]
    task: "Read the code and report findings"
  implementer:
    permissions: [read, write]
    task: "Update the docs based on researcher findings"

Broad command follow-on — Commands that silently continue the whole workflow. Fix: each command should have a clear stop point.

Context amnesia — Agent forgets earlier decisions. Fix: load key constraints at task start, reference source-of-truth files.

4. Apply model-specific techniques

Different models have different prompting best practices:

For capable models (Claude Sonnet 4+, GPT-4+):

Can handle nuanced instructions and self-correct
Benefit from "think step by step" and structured reasoning
Use XML tags for complex prompt structure

system_prompt: |
  <role>You are an implementer for project-name.</role>
  <context>Stack: TypeScript, Node.js, Vitest</context>
  <instructions>
  1. Read the ticket fully
  2. Check for existing patterns in src/
  3. Write tests first, then implement
  4. Run full test suite before committing
  </instructions>
  <constraints>
  - No any types
  - All exports must be typed
  - Test coverage > 80%
  </constraints>

For less capable models (Haiku, GPT-3.5, smaller models):

Need explicit step-by-step sequences — do not combine steps
Benefit from few-shot examples over abstract rules
Need guardrails against hallucination
Keep outputs short but highly structured

system_prompt: |
  Follow this exact sequence:
  
  STEP 1: Read file
  Command: cat [filename]
  
  STEP 2: Identify change location
  Output: "Line [N]: [current content]"
  
  STEP 3: Make edit
  Change: [old] → [new]
  
  Do not skip steps. Do not combine steps.
  If stuck after 3 attempts: STOP and report blocker.

When hardening for a specific model:

Check for local model documentation (if maintained in repo)
If none exists, web-search for "[model name] prompting best practices"
Apply discovered techniques
Document findings for future reference

5. Apply weak-model hardening

Ensure all prompts are safe for weaker models:

Outputs are short but highly structured
Exact required sections are stated
Blocker returns are preferred over hidden guesswork
Proof is required before stage transitions
Next specialist or action is named explicitly
Stable procedure lives in skills/tools, not long prose

6. Add self-verification

Every agent prompt should include a verification step:

system_prompt: |
  Before completing any task, self-verify:
  □ Did I follow the stack standards?
  □ Did I write tests for new code?
  □ Did I run the linter?
  □ Does the output match the expected format?
  
  If any check fails, fix before proceeding.

7. Final verification

Re-read the final prompt and ask:

Could a weaker model execute this without inventing hidden state?
Are all tool permissions explicit?
Are all delegation boundaries named?
Is there a clear stop condition?
Does the prompt say what TO do (not just what NOT to do)?

Rules

Prefer tool-backed state over raw file choreography
Keep ticket status coarse and queue-oriented
Require verification before stage changes
Require explicit blocker return paths when material ambiguity remains
Do not let prompts terminate with a summary when the workflow still has another stage
Never instruct read-only agents to mutate repo-tracked files
Never claim a file changed unless a write-capable tool actually wrote it
Put transient approval state in workflow state or explicit artifacts, not ticket status

Output contract

Improved agent definitions with:

Clear role statement
Explicit capabilities and limitations
Step-by-step operating procedure
Output format specification
Self-verification checklist
Stop conditions and blocker paths

Failure handling

Agent still loops: Add explicit loop counter and hard stop (max 3-5 attempts)
Agent ignores instructions: Move critical instructions to top of prompt; use XML tags for structure
Model too weak for task: Simplify task decomposition or upgrade model
Inconsistent behavior: Add more few-shot examples of desired behavior
Status-over-evidence persists: Add explicit "prove it" gates requiring tool output before state changes

References

Anthropic prompting best practices: https://docs.anthropic.com/en/docs/build-with-claude/prompt-engineering/claude-prompting-best-practices
XML tags for structure, examples for steering, explicit over implicit
OpenAI prompt engineering guide: https://developers.openai.com/docs/guides/prompt-engineering

Related Skills

chelch5/context-intelligence

testing

VerifiedTrustedCommunity

Manages context window budgets, loading strategies, and compaction techniques for AI-assisted coding sessions. Trigger on 'context window', 'what to load', 'context management', 'context overflow', 'token budget'. DO NOT USE for loading specific project docs into agent context (use project-context) or prompt wording and optimization (use prompt-crafting).

SKILL.mdUpdated Apr 20, 2026

chelch5/context-intelligence

chelch5/auth-patterns

development

VerifiedTrustedCommunity

Implements authentication, session, token, and authorization patterns for the current stack. Trigger on 'add auth', 'JWT', 'OAuth', 'login endpoint', 'session management', 'API key auth'. DO NOT USE for OWASP hardening checklists (use security-hardening), threat modeling (use security-threat-model), or secret rotation/storage (use security-best-practices).

SKILL.mdUpdated Apr 20, 2026

chelch5/auth-patterns

chelch5/api-schema

tools

VerifiedTrustedCommunity

Defines request/response shapes, versioning, validation, and compatibility rules for API-first work. Trigger on 'design API', 'OpenAPI spec', 'REST schema', 'API versioning', 'generate client SDK'. DO NOT USE for GraphQL schemas, gRPC/protobuf definitions (use stack-standards), auth endpoint logic (use auth-patterns), or external API client wrappers (use external-api-client).

SKILL.mdUpdated Apr 20, 2026

chelch5/ticket-pack-builder

development

VerifiedTrustedCommunity

Create a repo-local ticket system with an index, machine-readable manifest, board, and individual ticket files. Use when a repo needs task decomposition that autonomous agents can follow without re-planning the whole project each session. Do not use for executing tickets (use ticket-execution) or quick fixes that don't warrant formal tickets.

SKILL.mdUpdated Apr 20, 2026

chelch5/ticket-pack-builder

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/chelch5/skilllibrary.git

# Copy into Claude Code skills folder (global)
cp -r skilllibrary/01-package-scaffolding/agent-prompt-engineering ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

chelch5/skilllibrary

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT