skills/layered-recall/SKILL.md
4-layer memory recall system. Layer 1 (identity) always loaded, Layer 2 (critical facts) per-project, Layer 3 (room recall) on-demand, Layer 4 (deep search) when needed. Progressive context loading for token efficiency.
npx skillsauth add rubicanjr/FinCognis layered-recallInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
4-layer progressive memory system. Each layer adds more context only when needed, saving tokens while ensuring nothing important is missed.
Layer 1: Identity (always loaded, ~200 tokens)
Who is the user? What are their preferences?
Layer 2: Critical Facts (per-project, ~500 tokens)
Hard constraints, active decisions, blockers
Layer 3: Room Recall (on-demand, ~1-2K tokens)
Relevant memories for current task domain
Layer 4: Deep Search (when needed, ~2-5K tokens)
Full semantic search across all memories
Loaded at every session start. Contains:
Source: ~/.claude/projects/*/memory/user_*.md
Loaded when entering a project directory. Contains:
Source: ~/.claude/projects/*/memory/project_*.md + thoughts/CONTEXT.md
Loaded when task domain is detected (auth, database, deploy, etc.). Contains:
Source: Memory palace rooms + mature-instincts.json filtered by domain
Trigger: Intent classifier detects domain (e.g., "fix the login bug" -> room: authentication)
Only loaded when explicitly needed or when Layers 1-3 don't have enough context. Contains:
Source: PostgreSQL vector search + palace cross-wing search
Trigger: Agent explicitly queries, or user asks "have we done this before?"
Session Start
-> Load Layer 1 (identity)
-> Detect project -> Load Layer 2 (facts)
-> User sends prompt
-> Classify intent/domain -> Load Layer 3 (room)
-> If insufficient context -> Load Layer 4 (deep)
| Layer | Tokens | When | |-------|--------|------| | L1 | ~200 | Always | | L2 | ~500 | Per project | | L3 | ~1-2K | Per task domain | | L4 | ~2-5K | On demand | | Total max | ~8K | Worst case |
vs. loading everything: ~30-50K tokens
Savings: 4-6x token reduction
instinct-loader -> feeds Layer 2 and Layer 3smart-memory-recall -> implements Layer 3 scoringintent-classifier -> triggers Layer 3 room selectiongraph-indexer -> powers Layer 4 deep searchdevelopment
Goal-based workflow orchestration - routes tasks to specialist agents based on user goals
tools
Wiring Verification
development
Connection management, room patterns, reconnection strategies, message buffering, and binary protocol design.
development
Screenshot comparison QA for frontend development. Takes a screenshot of the current implementation, scores it across multiple visual dimensions, and returns a structured PASS/REVISE/FAIL verdict with concrete fixes. Use when implementing UI from a design reference or verifying visual correctness.