plugins/claude-code-expert/skills/context-budgeting/SKILL.md
--- name: context-budgeting description: Manage Claude Code's context window — token arithmetic, /compact strategy, anchor preservation, progressive loading, session analytics. Use this skill whenever a session gets long, context approaches limits, after /compact, when deciding what to load into CLAUDE.md vs leave in references, or when analyzing session cost/token usage. Triggers on: "context full", "compact", "too many tokens", "budget", "session analytics", "save tokens", "context window", "/
npx skillsauth add markus41/claude plugins/claude-code-expert/skills/context-budgetingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Every token in context is a cost you pay every turn. This skill is the discipline that keeps long sessions cheap and coherent.
Rough budget on a 200K-context model:
| Consumer | Typical | Budget | |---|---|---| | System prompt + CLAUDE.md | 3–8 K | ≤ 10 K | | Skills frontmatter (all active) | 2–6 K | ≤ 8 K | | MCP tool schemas (passive) | 5–25 K | ≤ 15 K | | Hook definitions | <1 K | <1 K | | Conversation history | variable | ~120 K | | Working headroom | — | ≥ 20 K |
If passive context (everything before conversation history) exceeds 30 K, optimize.
| Tier | What | Size | Loaded |
|---|---|---|---|
| 1 — Frontmatter | name + description | ~50 tokens | always |
| 2 — Body | SKILL.md after frontmatter | ≤ 500 lines | on activation |
| 3 — References | references/*.md, MCP KB artifacts | unlimited | on demand |
Rule: if content doesn't change Claude's behavior every session, it belongs in tier 3.
/compact condenses conversation history when it's filling up. It does NOT touch system prompt, CLAUDE.md, or skills.
When to /compact:
When NOT to /compact:
/clear (nothing to compact).Anchor preservation: before /compact, save anything you'd lose to engram via mem_save. After compact, the post-compact-context-restoration hook re-loads memory rules and recent context.
| Command | What it does |
|---|---|
| /compact | Summarize history, keep system prompt + CLAUDE.md |
| /clear | Wipe everything including CLAUDE.md — fresh session |
Use /clear between unrelated tasks. Use /compact mid-task.
When a single subtask would consume too much context (e.g. scanning a huge codebase), delegate to a subagent. The agent gets a fresh window; you only see its report.
Agent({
description: "Inventory all API endpoints",
prompt: "Scan src/api/ recursively, list every route with method, auth requirement, and response schema. Report under 500 words."
})
Net: the scan consumes agent tokens; your session only pays for the 500-word summary.
Track cost patterns:
| Metric | Target | |---|---| | Passive context (pre-conversation) | ≤ 30 K | | Cost per turn | ≤ $0.05 (Sonnet), ≤ $0.15 (Opus) | | Tools loaded but never called per session | 0 (prune unused MCPs) | | /compact frequency | every 20–30 exchanges on long sessions | | Subagent delegation rate | ~1 per 5 substantive turns |
Session analytics isn't automatic — periodically review what's consuming tokens and trim.
| Need | Tool |
|---|---|
| Settings schema for compact config | cc_docs_settings_schema |
| Compact/budget troubleshooting | cc_docs_troubleshoot("context") |
| Model cost for budgeting | cc_docs_model_recommend(task, budget) |
development
Enhanced plan-authoring skill with Pre-Writing context gathering, task metadata, non-TDD templates, Red Flags, telemetry, and an automated plan linter. Use when you have a spec or requirements for a multi-step task, before touching code.
tools
Documentation intelligence engine with graph-based API docs, algorithm library, and drift detection
tools
Ultraplan cloud planning — kick off a plan in the cloud from your terminal, review and revise in the browser, then execute remotely or send back to CLI
tools
--- name: mcp description: Configure MCP servers for Claude Code — stdio vs HTTP, authentication, Tools/Resources/Prompts distinction, channels (CI webhook, mobile relay, Discord bridge, fakechat), and cost of always-loaded tools. Use this skill whenever adding an MCP server, debugging connection issues, choosing between MCP Tools vs Prompts vs Resources, installing channel servers, or managing .mcp.json. Triggers on: "MCP server", "mcp config", "add Obsidian MCP", "install context7", "channels"