plugins/memory-palace/skills/memory-clarity-probe/SKILL.md
Probe memory/summary clarity via dual anchor questions: task progress, info gaps. Use when verifying session state or summary before handoff or compression.
npx skillsauth add athola/claude-night-market memory-clarity-probeInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Assess whether a memory, summary, or session state retains enough task information to guide future reasoning.
A quality gate for any memory or summary, based on the dual-probe pattern from MMPO (arXiv:2605.30159, Liu et al. 2026). The probe asks two anchor questions against the current memory and evaluates whether the answers are confident and complete:
A clear memory answers the progress probe with specific, verifiable state (not vague placeholders) and enumerates bounded, concrete unknowns on the gap probe. An ambiguous memory produces hedging on the progress probe and open-ended uncertainty on the gap probe.
The two probes target different failure modes:
The MMPO paper's ablation (Table 4) shows progress+gap outperforms
gap-only across all context lengths. Use both probes.
This skill implements a qualitative clarity assessment. It does not compute the token-level predictive entropy (Belief Entropy, Eq. 5 in MMPO) that the paper uses for RL training. Night-market has no access to the model's internal log-probabilities.
The paper's Table 6 shows that qualitative probing (labeled "direct-answer entropy", r=0.54) is weaker than true entropy (r=0.68), and can encourage premature confidence. Use this probe as a necessary quality check, not a sufficient one.
conserve:clear-context hands off to a continuation agentmemory-palace:session-palace-buildermemory-palace:knowledge-intakeimbue:proof-of-work declares work completeAccept the memory or summary as input. Sources:
Evaluate the memory against:
Based on the memory below, what is the current task progress?
Describe specifically what has been completed and what state
the task is in right now.
<memory>
{memory_content}
</memory>
Score the answer:
Evaluate the memory against:
Based on the memory below, what information is still needed
to complete the task? List specific open questions or missing
facts, not generic categories.
<memory>
{memory_content}
</memory>
Score the answer:
| Progress | Gap | Composite | Action | |----------|-----|-----------|--------| | Clear | Bounded | Clear | Proceed | | Clear | Expanding | Ambiguous | Consider expanding memory | | Clear | Overconfident | Suspect | Re-read task requirements | | Ambiguous | Bounded | Ambiguous | Expand memory or ask user | | Ambiguous | Expanding | Unclear | Regenerate or expand memory | | Unclear | Any | Unclear | Memory must be regenerated |
Produce the output in the format below and take the recommended action if invoked as an autonomous gate.
When evaluating N candidate summaries (e.g., from multiple summarization attempts):
To generate N candidates, invoke a summarization skill N times with varied prompts or temperatures, then pass all results to this probe. Typical N=3 gives a useful signal; N=5 matches the paper's Best-of-5 finding (Figure 3c).
## Clarity Assessment
**Progress probe**: [Clear | Ambiguous | Unclear]
> {exact answer the model produced}
**Gap probe**: [Bounded | Expanding | Overconfident]
> {exact answer the model produced}
**Composite**: [Clear | Ambiguous | Suspect | Unclear]
**Recommendation**: [Proceed | Expand memory | Regenerate]
**Specific issues** (if composite is not Clear):
- {issue 1}
- {issue 2}
As a pre-handoff gate (conserve:clear-context):
Before saving session-state.md, invoke memory-clarity-probe
on the draft state. If composite is Unclear, expand the state
with explicit answers to both probes before saving.
As a session checkpoint (memory-palace:session-palace-builder):
At major task transitions (design complete, implementation
started, tests passing), invoke memory-clarity-probe on the
current palace state. Log the composite score.
As a completion check (imbue:proof-of-work):
Before declaring work complete, invoke memory-clarity-probe.
The progress probe should return Clear with all deliverables
named. The gap probe should return Bounded with zero open items.
tools
Detect friction signals; graduate patterns into rules. Use for session retrospectives.
testing
Use when you need a diff-derived test plan for an MR — reads the diff, groups changes by area, runs targeted verifications, and proves revert-tests are genuine guards, not dead assertions.
development
Curate the web-capture index. Use when the capture backlog grows, captures sit unprocessed at seedling/pending, or to surface stored research during work.
testing
Runs parallel prose and craft review agents against a voice profile. Use when checking generated content for AI patterns and voice drift before publishing.