harness/claude/skills/self-reflection/SKILL.md
Capture short, actionable reflections when instructions, workflows, or tools caused friction. Use to improve future agent runs.
npx skillsauth add popoffvg/dotfiles self-reflectionInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
When something didn't go smoothly — unclear instructions, unexpected behavior, wasted effort, a better path discovered — write a reflection.
Reflect when the experience is likely to recur or has broader implications:
Think beyond the immediate problem. Ask: what systemic issue does this reveal? What would a different agent or future session benefit from knowing?
Reflections are stored one file per day:
agent/shared/reflections/YYYY-MM-DD.md
2026-04-07.md).Append an entry to agent/shared/reflections/YYYY-MM-DD.md (today's date) using edit (preferred) or write.
Template (adapt freely — structure matters less than insight):
### HH:MM - <Agent/Skill/Workflow> - <Topic>
**What happened:** <the situation and what went wrong or was surprising>
**Root cause:** <why it happened — dig deeper than the surface symptom>
**What would help:** <concrete improvement — to instructions, tools, skills, or workflow>
Eval checklist:
Test inputs:
Can change: reflection template wording, trigger guidance, phrasing examples, concision rules Cannot change: daily file storage path, root-cause orientation, no-secrets rule Min sessions before eval: 5 Runs per experiment: 3
testing
Use when the user asks to create test sets, enumerate scenarios, generate edge cases, or draft a coverage matrix before implementation.
testing
Use when the user asks to review, audit, score, or validate test sets for missed cases before execution or merge.
tools
Test harness plugins in isolation using tmux panes. Runs MCP servers, unit tests, typecheck, and Claude plugin loading. Use when user says "test plugin", "check plugin", "run plugin tests", "validate plugin", or names a specific plugin to test.
development
Guide for designing integration and e2e tests using BDD (Behavior-Driven Development) methodology with Cucumber-style Given/When/Then scenarios. Use when writing or reviewing tests for any service, API, or component. Language-agnostic — covers scenario structure, step notation, assertion principles, async patterns, and common anti-patterns.