plugins/spec-plugin/skills/probe-contract/SKILL.md
Find out how code ACTUALLY behaves by executing the real classes in a REPL — the real request/response shape, the real return value — without relying on a live staging environment. Beats writing a test for 'how does this behave?'.
npx skillsauth add jaisonerick/spec-plugin probe-contractInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Run this to answer "what does this actually do / what's the real shape?" by running the real code — not by reading it and guessing, and not by assuming a live staging environment exists (it usually doesn't). Past runs shipped contract bugs that 100+ green unit tests hid (a missing _live_ env infix, expires_in vs expires_at, an illegal cookie write) — caught only by exercising the real thing. Exercise the real thing.
This skill runs in an isolated forked Explore child and returns only its tight conclusion — so the caller's context stays clean.
find ~/.claude/plugins -type f -path '*spec-plugin/references/<lang>.md' 2>/dev/null | head -1
Read the matching card (ruby.md / typescript.md / python.md) — it has the exact REPL / tmux / debugger commands.Tool versions resolve via asdf automatically — call bin/rails/node/python directly; interactive sessions go through tmux per the card.
QUESTION: <what behavior/shape was probed>
OBSERVED: <the real value/shape — paste the actual output: the JSON, the return, the stack>
HOW: <one-shot REPL | tmux console | debugger> — <the command used>
CONTRACT NOTES: <any divergence from what the spec/types claimed — or "matches">
Observe and report; don't fix. If the app won't boot or a real dependency is missing, report exactly what's blocking.
tools
Assess how the LATEST spec-plugin version is performing across every previous session that invoked it — aggregate run efficiency (thinking%, compactions, exploration-vs-skills, preload firing, fresh-per-story), process adherence, and recurring spec-quality issues — then propose concrete, evidence-backed improvements for the NEXT version (plugin skills/agents/hooks, and spec/process patterns). Read-only: proposes, never self-modifies. Not tied to a single run.
development
Confirm whether a code symbol (method/class/field/endpoint/flag) actually exists and return its REAL signature + definition location — or the nearest match. Uses LSP/introspection, never grep-spelunking. Cheap and fast.
development
Walk one value or action end-to-end across every layer/hop — go-to-definition by go-to-definition, or with a debugger breakpoint — and report the real state transitions and where the contract/shape diverges. The workhorse for architecture sketches and cross-layer debugging.
testing
Bring a fresh worktree/checkout to a runnable state — verify base HEAD, copy gitignored files (.env), allocate per-agent DB/test env, install deps, run the smoke gate. Deterministic, mechanical. Reports a single ready/blocked verdict.