Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

ahundt/cache

Name: cache
Author: ahundt

plugins/autorun/skills/cache/SKILL.md

npx skillsauth add ahundt/autorun cache

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

/ar:cache — Cache Miss / Compaction Protection Skill

A PreToolUse gate that blocks commands when the prompt cache is effectively cold — saving tokens by preventing full-rate re-reads of context that has just fallen out of the 5-minute prompt-cache TTL. Off by default.

When to use this skill

Invoke when the user asks about any of:

setting a cache-miss / cache-hit-ratio threshold
protecting against compaction-triggered context loss
/ar:cache, /ar:cache on, /ar:cache ok 5m, /ar:cache set …
"how do I stop Claude from burning tokens after idle time"
"what is the cache gate", "cache protection"

1. Philosophy

Off by default. No block fires unless the user runs /ar:cache on.
Fail-open. If usage data is unavailable (old Claude Code version, Gemini CLI without cache-token fields, malformed JSONL), the gate allows.
Reuse, don't reinvent. The gate uses the same ScopedAllow grants, parse_scope_args grammar, session_state filelock backend, and check_blocked_commands block UX that /ar:ok / /ar:no already use.
Stay out of the statusline. The gate does not install or modify the user's statusLine setting. An optional one-line tap exists for users who want to feed their statusline's rich JSON into the gate themselves.

2. Commands

/ar:cache                       # show status
/ar:cache on [5m|1h|perm]       # enable
/ar:cache off [5m|1h|perm]      # disable (temporarily or until re-enabled)

/ar:cache set ratio 0.3         # block when cache hit ratio < 30%
/ar:cache set read 50k          # block when cache_read_tokens < 50,000
/ar:cache set age 10m           # block when last cache hit older than 10 min
/ar:cache set full 0.9          # block when total_input / window > 90%

/ar:cache ok 5m                 # override for 5 minutes
/ar:cache ok 3                  # override next 3 tool uses
/ar:cache ok perm               # override until axis clears / session end
/ar:cache no                    # clear outstanding overrides

Units: tokens accept 50000, 50,000, 50_000, 50k, .5M, 1.5M. Percent: 85% or bare decimal 0.85. Durations: 5m, 1h, 2h30m, perm.

3. Grammar decision: `/ar:cache ok` not `/ar:ok cache`

The two grammars are semantically distinct. /ar:ok <pattern> allows a command-string pattern; /ar:cache ok <scope> overrides the feature gate. Overloading /ar:ok cache would silently collide with any user command named cache. Decision in plan §6.5.1 is: no /ar:ok cache alias in v1. A sigil-based alias (e.g. /ar:ok @cache) is backwards-compatible to add later if users ask.

4. How the decision flows (PreToolUse)

 tool use
     │
     ▼
 FeatureToggle("cache").is_enabled()  ──► false → ALLOW (default)
     │ true
     ▼
 CacheGuard._read_usage:
   1) cache/last_usage memo (2 s TTL) — reuse on burst calls
   2) cache/statusline_snapshot       — opt-in fast path
   3) JSONL tail (64 KB reverse scan) — find last assistant message.usage
     │   none available → ALLOW (fail-open)
     ▼
 CacheThreshold axes:
   - cache_hit_ratio_min     (cache_read / (input + cache_read + cache_creation))
   - cache_read_tokens_min   (raw cache_read_input_tokens floor)
   - cache_age_max_seconds   (age of last assistant timestamp)
   - compaction_used_max     (total_input / context_window)
   - rate_limit_5h_max       (from statusline snapshot; rate-limit wording)
   - rate_limit_7d_max       (from statusline snapshot; rate-limit wording)
     │ none tripped → ALLOW
     ▼
 ScopedAllow override active? (/ar:cache ok …)
   yes → consume + ALLOW
   no  → BLOCK with copy-paste override hints

5. Cross-CLI matrix

| Capability | Claude Code | Gemini CLI | |-------------------------------------|-------------------|----------------------| | hook stdin transcript_path | yes | yes | | message.usage.cache_read_* in JSONL | yes | unknown schema; probe | | Compaction event name | PreCompact / PostCompact / SessionStart(compact) | PreCompress (advisory) / SessionStart | | cache_hit_ratio_min axis | active | fail-open (no data) | | cache_read_tokens_min axis | active | fail-open | | cache_age_max_seconds axis | active | active | | compaction_used_max axis | active (with snapshot) | active when total tokens available | | rate_limit_*_max axes | only with statusline tap | unavailable |

/ar:cache status prints which axes are inactive for the current CLI.

6. Multiprocess safety

The gate uses session_state() — a single shared JSON file with prefixed keys and filelock + atomic tempfile rename. All writes are serialised; reads re-load inside the lock. Parallel hook invocations (e.g. rtk-spawned) inherit the same _PARALLEL_GRACE_SECONDS = 1.0 s window that ScopedAllow already uses. No new state file, no new lock.

7. Files

| Path | Role | |----------------------------------------------------------|---------------------------------------| | plugins/autorun/src/autorun/cache_guard.py | Single-file feature (~500 LOC). | | plugins/autorun/tests/test_cache_guard.py | Full test matrix (parser → multiprocess). | | plugins/autorun/commands/cache.md | /ar:cache command definition. | | plugins/autorun/skills/cache/SKILL.md | This file. |

8. Troubleshooting

"Gate doesn't fire." Run /ar:cache status — is it enabled? Are any axes configured? Set at least one axis (/ar:cache set ratio 0.5) after enabling.
"Blocked when I didn't want it." /ar:cache ok 5m or /ar:cache off.
"Gemini says axes unavailable." Correct — Gemini CLI hooks do not surface cache-token fields. Use age axis instead; it works on both CLIs.
"I want the rich statusline numbers." Add one line to your own statusline script: printf '%s' "$INPUT" | autorun --cache-snapshot >/dev/null 2>&1 &.

9. Plan reference

Full design: ~/.claude/plans/make-a-plan-to-sunny-sparkle.md. Key sections: §6.1 (reused magic classes), §6.5 (usage detection), §6.5.1 (grammar decision), §6.7 (block message), §6.8 (multiprocess), §7 (ASCII diagrams).

ahundt/cache

plugins/autorun/skills/cache/SKILL.md

Use this skill when the user asks to "enable cache protection", "block on cache miss", "avoid cache expiration cost", "/ar:cache", "protect against compaction", "cache pressure guard", "cache hit ratio threshold", or "set cache-miss threshold". Covers the /ar:cache slash command family for blocking tool use when the Claude Code prompt cache is cold or context is near compaction.

9 stars

tools

Updated Apr 26, 2026

$ install --global

skillsauth

npx skillsauth add ahundt/autorun cache

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 26, 2026, 6:26 AM49.6s1 file scanned

SKILL.md

name:: cache
description:: Use this skill when the user asks to "enable cache protection", "block on cache miss", "avoid cache expiration cost", "/ar:cache", "protect against compaction", "cache pressure guard", "cache hit ratio threshold", or "set cache-miss threshold". Covers the /ar:cache slash command family for blocking tool use when the Claude Code prompt cache is cold or context is near compaction.

/ar:cache — Cache Miss / Compaction Protection Skill

When to use this skill

Invoke when the user asks about any of:

setting a cache-miss / cache-hit-ratio threshold
protecting against compaction-triggered context loss
/ar:cache, /ar:cache on, /ar:cache ok 5m, /ar:cache set …
"how do I stop Claude from burning tokens after idle time"
"what is the cache gate", "cache protection"

1. Philosophy

Off by default. No block fires unless the user runs /ar:cache on.
Fail-open. If usage data is unavailable (old Claude Code version, Gemini CLI without cache-token fields, malformed JSONL), the gate allows.
Reuse, don't reinvent. The gate uses the same ScopedAllow grants, parse_scope_args grammar, session_state filelock backend, and check_blocked_commands block UX that /ar:ok / /ar:no already use.
Stay out of the statusline. The gate does not install or modify the user's statusLine setting. An optional one-line tap exists for users who want to feed their statusline's rich JSON into the gate themselves.

2. Commands

/ar:cache                       # show status
/ar:cache on [5m|1h|perm]       # enable
/ar:cache off [5m|1h|perm]      # disable (temporarily or until re-enabled)

/ar:cache set ratio 0.3         # block when cache hit ratio < 30%
/ar:cache set read 50k          # block when cache_read_tokens < 50,000
/ar:cache set age 10m           # block when last cache hit older than 10 min
/ar:cache set full 0.9          # block when total_input / window > 90%

/ar:cache ok 5m                 # override for 5 minutes
/ar:cache ok 3                  # override next 3 tool uses
/ar:cache ok perm               # override until axis clears / session end
/ar:cache no                    # clear outstanding overrides

Units: tokens accept 50000, 50,000, 50_000, 50k, .5M, 1.5M. Percent: 85% or bare decimal 0.85. Durations: 5m, 1h, 2h30m, perm.

3. Grammar decision: `/ar:cache ok` not `/ar:ok cache`

4. How the decision flows (PreToolUse)

 tool use
     │
     ▼
 FeatureToggle("cache").is_enabled()  ──► false → ALLOW (default)
     │ true
     ▼
 CacheGuard._read_usage:
   1) cache/last_usage memo (2 s TTL) — reuse on burst calls
   2) cache/statusline_snapshot       — opt-in fast path
   3) JSONL tail (64 KB reverse scan) — find last assistant message.usage
     │   none available → ALLOW (fail-open)
     ▼
 CacheThreshold axes:
   - cache_hit_ratio_min     (cache_read / (input + cache_read + cache_creation))
   - cache_read_tokens_min   (raw cache_read_input_tokens floor)
   - cache_age_max_seconds   (age of last assistant timestamp)
   - compaction_used_max     (total_input / context_window)
   - rate_limit_5h_max       (from statusline snapshot; rate-limit wording)
   - rate_limit_7d_max       (from statusline snapshot; rate-limit wording)
     │ none tripped → ALLOW
     ▼
 ScopedAllow override active? (/ar:cache ok …)
   yes → consume + ALLOW
   no  → BLOCK with copy-paste override hints

5. Cross-CLI matrix

/ar:cache status prints which axes are inactive for the current CLI.

6. Multiprocess safety

7. Files

8. Troubleshooting

"Gate doesn't fire." Run /ar:cache status — is it enabled? Are any axes configured? Set at least one axis (/ar:cache set ratio 0.5) after enabling.
"Blocked when I didn't want it." /ar:cache ok 5m or /ar:cache off.
"Gemini says axes unavailable." Correct — Gemini CLI hooks do not surface cache-token fields. Use age axis instead; it works on both CLIs.
"I want the rich statusline numbers." Add one line to your own statusline script: printf '%s' "$INPUT" | autorun --cache-snapshot >/dev/null 2>&1 &.

9. Plan reference

Related Skills

ahundt/claude-session-tools

tools

VerifiedTrustedCommunity

(Renamed) Search, recover, and analyze sessions from Claude Code, AI Studio, and Gemini CLI — use /ar:ai-session-tools (this is an alias that redirects there).

9SKILL.mdUpdated Mar 26, 2026

ahundt/claude-session-tools

ahundt/autorun-maintainer

tools

VerifiedTrustedCommunity

Expertise in maintaining, debugging, and deploying the autorun hook system for Claude Code and Gemini CLI. Use when the user asks to "fix hooks", "deploy autorun", "debug hook errors", "update autorun version", or when troubleshooting "invisible failures" where safety guards appear inactive, piped commands are blocked, or work appears to have "reverted" after a session.

9SKILL.mdUpdated Mar 26, 2026

ahundt/autorun-maintainer

ahundt/ai-session-tools

tools

VerifiedTrustedCommunity

Search, recover, and analyze AI session histories across Claude Code, AI Studio, and Gemini CLI. Use when user asks to "find that file from last week", "search sessions", "recover context after compaction", "what did the AI do", "export session to markdown", "find corrections", "analyze session quality", "improve CLAUDE.md from past mistakes", or "turn AI mistakes into rules". Contains session search, file recovery, correction detection, self-improvement workflow.

9SKILL.mdUpdated Mar 26, 2026

ahundt/ai-session-tools

ahundt/pdf-extractor

development

VerifiedTrustedCommunity

This skill should be used when the user asks to "extract text from PDF", "convert PDF to text", "parse PDF", "read PDF contents", "extract data from documents", "batch PDF extraction", "PDF to markdown", "OCR PDF", "get text from PDF files", "I have a PDF", "can you read this PDF", "what's in this PDF", "summarize this PDF", "open PDF file", "extract from [filename].pdf", or needs to process PDF documents for data extraction. Handles single-file extraction, batch processing, and OCR for scanned documents with automatic backend selection.

6SKILL.mdUpdated Mar 26, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/ahundt/autorun.git

# Copy into Claude Code skills folder (global)
cp -r autorun/plugins/autorun/skills/cache ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

ahundt/autorun

9 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

ahundt/cache

$ install --global

Security Scan Results

SKILL.md

/ar:cache — Cache Miss / Compaction Protection Skill

When to use this skill

1. Philosophy

2. Commands

3. Grammar decision: /ar:cache ok not /ar:ok cache

4. How the decision flows (PreToolUse)

5. Cross-CLI matrix

6. Multiprocess safety

7. Files

8. Troubleshooting

9. Plan reference

Related Skills

ahundt/claude-session-tools

ahundt/autorun-maintainer

ahundt/ai-session-tools

ahundt/pdf-extractor

ahundt/cache

$ install --global

Security Scan Results

SKILL.md

/ar:cache — Cache Miss / Compaction Protection Skill

When to use this skill

1. Philosophy

2. Commands

3. Grammar decision: /ar:cache ok not /ar:ok cache

4. How the decision flows (PreToolUse)

5. Cross-CLI matrix

6. Multiprocess safety

7. Files

8. Troubleshooting

9. Plan reference

Related Skills

ahundt/claude-session-tools

ahundt/autorun-maintainer

ahundt/ai-session-tools

ahundt/pdf-extractor

3. Grammar decision: `/ar:cache ok` not `/ar:ok cache`

3. Grammar decision: `/ar:cache ok` not `/ar:ok cache`