.claude/skills/reflection-validation/SKILL.md
Meta-cognitive validation protocol (System 2). Detects proxy optimization, undeclared assumptions, and broken causal chains.
npx skillsauth add gonzalezpazmonica/pm-workspace reflection-validationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Thinking fast catches the obvious. Thinking slow catches the real.
Structured meta-cognition cycle — "wait, does that actually work?" — that humans do naturally but LLMs typically skip. Based on Kahneman's dual-process theory: System 1 (fast, heuristic) vs. System 2 (slow, deliberate).
Re-read the question. Distinguish between:
| Layer | Question | Example | |---|---|---| | Literal | What was asked | "Walk or drive to the car wash 50m away?" | | Real | What needs to happen | "The car must end up washed" | | Implicit | Unstated constraints | "The car must be AT the wash" |
Key question: Does the literal objective match the real one?
List ALL implicit assumptions in the response:
Minimum 3 assumptions per response. Mark each valid/invalid.
Walk through the recommendation step by step:
Template: "User does X → Y happens → Z → ... → objective achieved?" If ANY step produces "???" → the chain is broken.
Identify where the chain fails:
| Gap Type | Description | |---|---| | Missing prerequisite | Something must exist/happen first | | Wrong optimization | Correct metric, wrong variable | | Ignored constraint | Real-world limitation missed | | Anchoring bias | Fixated on one detail | | Satisficing | First plausible answer, unchecked | | Narrow framing | Only one dimension considered |
If gap detected:
Thinking: [initial reasoning and why it seemed correct]
But: [what was missed and why it matters]
Because: [the real objective requires X, not Y]
Corrected: [the right answer with full reasoning]
If NO gap detected:
Validation: Response passes System 2 check.
Objective alignment: [confirmed] | Assumptions: [all valid] | Chain: [complete]
| Bias / Error | Detection Step | Signal | |---|---|---| | Proxy optimization | Step 1 | Literal ≠ real objective | | Undeclared assumption | Step 2 | "Obviously..." or implicit context | | Broken causal chain | Step 3 | Step produces "???" | | Anchoring | Step 1 | One detail dominates reasoning | | Satisficing | Step 4 | No active search for flaws | | Narrow framing | Step 2 | Only one dimension considered | | Confirmation bias | Step 4 | Evidence only supports, never challenges | | Sunk cost reasoning | Step 1 | Past effort justifies current path |
Agents can include this block to activate internal reflection without
invoking the external reflection-validator agent:
## Post-Response Reflection (System 2)
Before delivering your final response, run internally:
1. What is the REAL objective, not the literal one?
2. What did I assume implicitly? List 3 assumptions.
3. If the user follows my advice step by step, do they reach the goal?
4. Is there a broken link in the chain?
If you find a gap → correct and show the reasoning change.
The reflection-validator agent produces a structured report:
═══════════════════════════════════════════════
REFLECTION VALIDATOR — System 2 Analysis
═══════════════════════════════════════════════
Question .............. [original question]
Response evaluated .... [summary]
── Step 1: Real Objective ─────────────────
Literal / Real / Match? YES|NO
── Step 2: Assumptions ────────────────────
1-3 assumptions — valid / invalid
── Step 3: Simulation ─────────────────────
Causal chain — reaches objective? YES|NO
── Step 4: Gaps ───────────────────────────
Gaps or "No gaps detected"
── Step 5: Verdict ────────────────────────
VALIDATED / CORRECTED / REQUIRES_RETHINKING
═══════════════════════════════════════════════
reflection-validator agent: applies this protocol to any input; other agents reference the Embeddable Pattern via @testing
Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.
tools
Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate external APIs or services, whether in Python (FastMCP) or Node/TypeScript (MCP SDK).
tools
Sistema proactivo de bienestar individual
development
Search the web to resolve context gaps — documentation, versions, CVEs, best practices. Auto-starts SearxNG Docker if available, falls back to WebSearch.