plugins/spec-plugin/skills/run-retrospective/SKILL.md
Post-version retrospective that captures lessons learned, fixes documentation drift, and proposes skill improvements. Analyzes PROGRESS.md, story logs, QA results, and commit/change history to identify struggle patterns and knowledge worth preserving. Run after a version is shipped.
npx skillsauth add jaisonerick/spec-plugin run-retrospectiveInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Your output: documentation fixes, topic-based lesson files in docs/, and proposed skill/CLAUDE.md improvements.
Gather all sources of truth about what happened during this version:
lessons.md — specs/<version>/lessons.md is the PO's curated session log; it's your richest single source of what the team struggled with and learned.specs/<version>/logs/engineer-*.md for the raw learnings feed behind lessons.md.specs/<version>/ for ## Execution Log sections ("Issues Encountered", "Struggled With", "Decisions Made")specs/<version>/qa/ for run records, the DoD audit (dod-audit.md), setup/DX findings, and recurring patternsspecs/<version>/setup-playbook.md: what setup friction got discovered and codified (and whether it should be promoted to a reusable script)specs/<version>/context.md and specs/<version>/architecture.md; note any drift between them and realitygit log --oneline for the version's timeframe; look for fix commits, reverts, red-button halts, DoD re-gate loops.claude/skills/*/SKILL.md (or the plugin source) and project conventions for context on what agents are told to doCategorize findings into four buckets:
Architecture doc says X, but the actual code does Y. Examples:
Things that took multiple attempts. Sources:
Places where agents couldn't proceed without help:
Recurring issues that skills should prevent:
After listing ALL findings across all four categories:
Fix factual discrepancies between documentation and reality. These are corrections, not opinions:
specs/<version>/architecture.md) — fix incorrect ports, endpoints, data models, configuration values that don't match the actual codespecs/<version>/stories.md if any stories have incorrect statusspecs/roadmap.md if the version's status or shipped state changedKeep changes minimal and factual. Don't rewrite sections — fix specific inaccuracies.
For each category below, present findings to the user via AskUserQuestion and only proceed with approved changes.
docs/ FilesCreate or update topic-based files in docs/. Each file covers one topic and can contain multiple findings.
File format:
# <Topic>
## <Finding title>
<What went wrong, 1-2 sentences>
<What works and how to prevent it, 1-2 sentences>
## <Finding title>
...
Rules:
docs/<topic>.md fileBefore writing, ask the user (via AskUserQuestion):
"I found these lessons worth documenting. Which should I save?"
Group proposed changes by skill file. For each:
AskUserQuestion whether to applyIMPORTANT: Never edit cached plugin files. Skills from plugins live in .claude/plugins/cache/ — these are read-only copies that get overwritten on plugin updates. If a proposed change targets a plugin skill, edit the source instead:
.claude/plugins/known_marketplaces.json for the marketplace install location<installLocation>/plugins/<plugin-name>/skills/<skill-name>/SKILL.mddocs/ and flag it for manual upstreamingOnly for truly universal conventions that apply across ALL versions. These should be rare.
Present each proposed addition and ask via AskUserQuestion before adding.
docs/ (Phase 4A)Retrospective: <version> — N lessons, M doc fixes, K skill updates
SendMessage:
tools
Assess how the LATEST spec-plugin version is performing across every previous session that invoked it — aggregate run efficiency (thinking%, compactions, exploration-vs-skills, preload firing, fresh-per-story), process adherence, and recurring spec-quality issues — then propose concrete, evidence-backed improvements for the NEXT version (plugin skills/agents/hooks, and spec/process patterns). Read-only: proposes, never self-modifies. Not tied to a single run.
development
Confirm whether a code symbol (method/class/field/endpoint/flag) actually exists and return its REAL signature + definition location — or the nearest match. Uses LSP/introspection, never grep-spelunking. Cheap and fast.
development
Walk one value or action end-to-end across every layer/hop — go-to-definition by go-to-definition, or with a debugger breakpoint — and report the real state transitions and where the contract/shape diverges. The workhorse for architecture sketches and cross-layer debugging.
testing
Bring a fresh worktree/checkout to a runnable state — verify base HEAD, copy gitignored files (.env), allocate per-agent DB/test env, install deps, run the smoke gate. Deterministic, mechanical. Reports a single ready/blocked verdict.