toolkit/packages/plugins/reflect/skills/recall/SKILL.md
Retrieve relevant prior learnings from the global knowledge base. Hybrid vector + graph search over 170+ indexed learnings, reranked by confidence, recency, and tag overlap. Use when starting work, debugging a recurring problem, or before implementing a feature that may have prior art.
npx skillsauth add stevengonsalvez/agents-in-a-box reflect:recallInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Queries the global learnings KB (GraphRAG + vector) and surfaces the top-N most relevant learnings for the current work, reranked by confidence, recency, and tag overlap.
Also fires automatically via the SessionStart hook (see
hooks/session_start_recall.py) with a 3-result cap, any confidence
(reranked by confidence/recency/tag-overlap). This skill is the
explicit, higher-limit path.
| Invocation | Behavior |
|---|---|
| /reflect:recall <query> | Default — 10 results, any confidence, markdown out |
| /reflect:recall <query> --limit 5 --confidence HIGH | Tight filter |
| /reflect:recall <query> --mode local | Graph-neighborhood search (finds related concepts) |
| /reflect:recall <query> --mode global | Community-based (broad patterns) |
| /reflect:recall <query> --format json | Structured output for programmatic use |
| /reflect:recall <query> --no-cache | Skip cache, force fresh query |
{{HOME_TOOL_DIR}}/skills/recall/scripts/recall.py:
uv run {{HOME_TOOL_DIR}}/skills/recall/scripts/recall.py "$QUERY" --limit 10 --format markdown
[lrn-id], key insight, and how-to-apply.learnings search <id> or check ~/.learnings/documents/learnings/.--tags a,b,c for reranking boost.~/.learnings/cli/learnings search as subprocess.confidence × recency × (1 + tag_overlap_bonus).
~/.reflect/recall_cache/, 1h TTL.~/.reflect/recall_log.jsonl for
future helpfulness analysis (Phase 6 of the retrieval plan)./reflect:ingest — populate the KB/reflect-status — KB health, coverage, pending reviewshooks/settings-snippet.json)documentation
Report reflect drain spend over a time window — tokens split by cached (cache_read), uncached writes (cache_creation), and io (input+output), with a $ estimate, grouped by day / outcome / model / transcript. Reads the drainer's cost log and surfaces outlier runs and cache-reuse health (the 41.5M-token failure mode = low cache reuse + high cache writes). Use to answer "what is reflection costing me" for the last day / week.
development
Show fleet status — every claude session running on the host, merged across ainb + claude-peers broker + background jobs. Use when you need to enumerate sessions before composing an action, see which sessions have a peer registered (broker-routable) vs tmux-only, check the `summary` of each session, or pipe the list into jq for filtering. Default output: text table. Pass --format json for LLM consumption.
testing
Ordered multi-step prompts to fleet targets, ack-gated between steps via JSONL assistant-turn-end detection. Use for cycles like disconnect→reconnect→verify, or any flow where step N+1 requires step N to have completed first. The skill BLOCKS until each target's transcript shows the next assistant turn finishing OR per-step timeout fires (default 300s).
development
Center control panel — enumerate every claude session that is blocked waiting on something: a user answer (AskUserQuestion fired), an API error retry, an idle assistant turn-end with no follow-up, or an explicit WAITING: marker. Returns rich JSON with signal kind + context per session. Use this when you've stepped away from the fleet and want one place to see everything that wants your attention and answer it.