Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

jaykim88/caching-strategy

Name: caching-strategy
Author: jaykim88

plugins/backend-toolkit/skills/caching-strategy/SKILL.md

npx skillsauth add jaykim88/claude-ai-engineering caching-strategy

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Caching Strategy

Purpose

Add caching deliberately — with a clear read/write/invalidate flow, stampede protection, and an invalidation plan — so it reduces load without serving stale or inconsistent data or collapsing under a hot-key herd.

Universal — cache-aside flow, TTL+jitter, stampede prevention, and invalidation strategy are caching principles independent of the cache store; Redis is the default implementation.

Procedure

Optimize the query FIRST, cache second
- Caching a slow query hides the problem and adds staleness risk
- Run query-optimization before adding a cache layer
Use cache-aside (lazy loading) as the default pattern
- Read: check cache → miss → read DB → populate cache → return
- Write: write DB → invalidate (delete) the cache key (don't write-through unless justified)
- Delete-on-write > update-on-write: avoids cache/DB races
Set TTL with jitter
- Every cached key gets a TTL (no infinite caches without an invalidation plan)
- Add random jitter to TTLs so keys don't all expire simultaneously (a synchronized expiry = mass stampede)

3b. Bound the cache: memory budget + eviction policy

Set maxmemory and choose an eviction policy deliberately (allkeys-lru for general read-through caches; volatile-lru if you mix persistent state into the same instance — but ideally don't)
Without a bound, a runaway key generator (per-user, per-query-fingerprint) eats memory until OOM
Cache-key cardinality: unbounded distinct keys = unbounded memory; cap or hash high-cardinality identifiers

Prevent cache stampede on hot keys
- Single-flight lock: first request acquires an atomic compare-and-set lock, recomputes, others wait/serve-stale
- Probabilistic early refresh: recompute slightly before expiry with rising probability
- Both must be executed atomically (store-specific mechanism in Implementation)
- Without this, a popular key expiring under load → thundering herd hammers the DB
- Negative caching: cache "this key doesn't exist" (a sentinel value, short TTL) for queries that miss — otherwise the same non-existent key hits the DB on every request (silent thundering herd from 404s)
Plan invalidation explicitly — the hard part
- Know exactly which writes invalidate which keys
- Use key naming conventions (user:{id}:profile) so invalidation is targeted
- Tag-based / versioned keys for "invalidate everything related to X"

5b. Cache is an optimization, not a source of truth

The app MUST keep working with an empty / unavailable cache — degraded latency, not broken behavior
Never store authoritative state only in the cache (auth tokens, balances). On Redis restart you lose it
Use resilience-patterns circuit breaker around the cache client so a cache outage doesn't cascade

Validate (validation loop)
- Load-test with a hot key expiring under concurrency → verify no DB spike (stampede prevented)
- After a write, verify the cache returns fresh data (invalidation works)
- If stale data served → invalidation gap; fix the write→invalidate wiring

Anti-patterns

| ❌ Anti-pattern | ✅ Correct | |---|---| | Caching before optimizing the query | Optimize first, cache second | | No TTL (infinite cache, no invalidation plan) | TTL + explicit invalidation strategy | | All keys same TTL | TTL + jitter to desynchronize expiry | | Update-cache-on-write (race-prone) | Delete-cache-on-write (cache-aside) | | No stampede protection on hot keys | Single-flight lock or probabilistic refresh | | No maxmemory / eviction policy (OOM under load) | maxmemory + allkeys-lru (or chosen policy) | | Unbounded distinct cache keys (memory leak) | Cap or hash high-cardinality keys | | 404s hitting the DB on every retry (negative-cache miss) | Cache "not found" sentinel with a short TTL | | Auth/session state stored only in cache | Cache is optimization, not source of truth |

Severity tiers

| Tier | Examples | Action SLA | |---|---|---| | Critical | Hot key with no stampede protection causing DB overload; cache serving stale auth/permission data | Fix immediately | | Major | Infinite-TTL cache with no invalidation plan; update-on-write races | Fix this sprint | | Minor | Uniform TTLs (no jitter); cache key naming inconsistency | Schedule within 2 sprints |

Completion Criteria

[ ] Underlying query optimized before caching
[ ] Cache-aside with delete-on-write applied
[ ] Every key has a TTL + jitter
[ ] Hot keys have stampede protection (verified under load)
[ ] Invalidation mapping documented (which write → which key)

Output

Cache layer code: cache-aside helpers + invalidation hooks
Invalidation map: docs/cache-invalidation.md — write → invalidated keys
Commit format: perf(cache): add cache-aside for <query> / fix(cache): single-flight lock on <hot key>

Implementation

TypeScript + Redis (default)

Cache-aside helper around Redis GET/SETEX/DEL
Stampede: single-flight via SET key val NX PX ttl lock, or ioredis + Lua EVAL for atomicity
Probabilistic refresh: store (value, computed_at, ttl) and recompute when now - computed_at > ttl * random_threshold
Supabase: pair with Postgres; Redis via Upstash/managed

Other stacks

Python: redis-py + aiocache; same cache-aside + Lua patterns
Go: go-redis + singleflight package (stdlib-adjacent) for stampede
Universal: cache-aside, TTL+jitter, and stampede prevention are store-agnostic; Memcached works for simple cases (no Lua → use add-based locks)

Related skills

query-optimization — cache only after the query itself is optimized
resilience-patterns — cache as a fallback when a dependency is down
transaction-management — invalidate cache after the write commits, not before

Reference

Key insight encoded: A popular key expiring under concurrency triggers a stampede (thundering herd); prevent it with a single-flight mutex lock or probabilistic early refresh, both executed atomically via Lua. Delete-on-write (not update) avoids cache/DB races.

jaykim88/caching-strategy

plugins/backend-toolkit/skills/caching-strategy/SKILL.md

Design a cache layer — cache-aside read/write/invalidate, TTL + jitter, stampede prevention (single-flight / probabilistic refresh), and explicit invalidation. Use when read latency is high, the DB is read-bound, or a hot key causes thundering-herd load. Not for fixing the slow query at its source (use query-optimization first) or HTTP/browser caching (a frontend concern).

development

Updated Jun 9, 2026

$ install --global

skillsauth

npx skillsauth add jaykim88/claude-ai-engineering caching-strategy

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 9, 2026, 8:25 AM147.5s1 file scanned

SKILL.md

name:: caching-strategy
description:: Design a cache layer — cache-aside read/write/invalidate, TTL + jitter, stampede prevention (single-flight / probabilistic refresh), and explicit invalidation. Use when read latency is high, the DB is read-bound, or a hot key causes thundering-herd load. Not for fixing the slow query at its source (use query-optimization first) or HTTP/browser caching (a frontend concern).
license:: MIT

Caching Strategy

Purpose

Universal — cache-aside flow, TTL+jitter, stampede prevention, and invalidation strategy are caching principles independent of the cache store; Redis is the default implementation.

Procedure

Optimize the query FIRST, cache second
- Caching a slow query hides the problem and adds staleness risk
- Run query-optimization before adding a cache layer
Use cache-aside (lazy loading) as the default pattern
- Read: check cache → miss → read DB → populate cache → return
- Write: write DB → invalidate (delete) the cache key (don't write-through unless justified)
- Delete-on-write > update-on-write: avoids cache/DB races
Set TTL with jitter
- Every cached key gets a TTL (no infinite caches without an invalidation plan)
- Add random jitter to TTLs so keys don't all expire simultaneously (a synchronized expiry = mass stampede)

3b. Bound the cache: memory budget + eviction policy

Set maxmemory and choose an eviction policy deliberately (allkeys-lru for general read-through caches; volatile-lru if you mix persistent state into the same instance — but ideally don't)
Without a bound, a runaway key generator (per-user, per-query-fingerprint) eats memory until OOM
Cache-key cardinality: unbounded distinct keys = unbounded memory; cap or hash high-cardinality identifiers

Prevent cache stampede on hot keys
- Single-flight lock: first request acquires an atomic compare-and-set lock, recomputes, others wait/serve-stale
- Probabilistic early refresh: recompute slightly before expiry with rising probability
- Both must be executed atomically (store-specific mechanism in Implementation)
- Without this, a popular key expiring under load → thundering herd hammers the DB
- Negative caching: cache "this key doesn't exist" (a sentinel value, short TTL) for queries that miss — otherwise the same non-existent key hits the DB on every request (silent thundering herd from 404s)
Plan invalidation explicitly — the hard part
- Know exactly which writes invalidate which keys
- Use key naming conventions (user:{id}:profile) so invalidation is targeted
- Tag-based / versioned keys for "invalidate everything related to X"

5b. Cache is an optimization, not a source of truth

The app MUST keep working with an empty / unavailable cache — degraded latency, not broken behavior
Never store authoritative state only in the cache (auth tokens, balances). On Redis restart you lose it
Use resilience-patterns circuit breaker around the cache client so a cache outage doesn't cascade

Validate (validation loop)
- Load-test with a hot key expiring under concurrency → verify no DB spike (stampede prevented)
- After a write, verify the cache returns fresh data (invalidation works)
- If stale data served → invalidation gap; fix the write→invalidate wiring

Anti-patterns

Severity tiers

Completion Criteria

[ ] Underlying query optimized before caching
[ ] Cache-aside with delete-on-write applied
[ ] Every key has a TTL + jitter
[ ] Hot keys have stampede protection (verified under load)
[ ] Invalidation mapping documented (which write → which key)

Output

Cache layer code: cache-aside helpers + invalidation hooks
Invalidation map: docs/cache-invalidation.md — write → invalidated keys
Commit format: perf(cache): add cache-aside for <query> / fix(cache): single-flight lock on <hot key>

Implementation

TypeScript + Redis (default)

Cache-aside helper around Redis GET/SETEX/DEL
Stampede: single-flight via SET key val NX PX ttl lock, or ioredis + Lua EVAL for atomicity
Probabilistic refresh: store (value, computed_at, ttl) and recompute when now - computed_at > ttl * random_threshold
Supabase: pair with Postgres; Redis via Upstash/managed

Other stacks

Python: redis-py + aiocache; same cache-aside + Lua patterns
Go: go-redis + singleflight package (stdlib-adjacent) for stampede
Universal: cache-aside, TTL+jitter, and stampede prevention are store-agnostic; Memcached works for simple cases (no Lua → use add-based locks)

Related skills

query-optimization — cache only after the query itself is optimized
resilience-patterns — cache as a fallback when a dependency is down
transaction-management — invalidate cache after the write commits, not before

Reference

Key insight encoded: A popular key expiring under concurrency triggers a stampede (thundering herd); prevent it with a single-flight mutex lock or probabilistic early refresh, both executed atomically via Lua. Delete-on-write (not update) avoids cache/DB races.

Related Skills

jaykim88/webhook-design

development

VerifiedTrustedCommunity

Design webhooks correctly on both sides — sending (HMAC signing, retries with backoff, at-least-once) and receiving (verify signature on raw body, enqueue + 200 fast, dedupe on event id). Use when adding webhook delivery or consuming a provider's webhooks. Not for internal service-to-service events (use async-messaging) or general outbound-call retry policy (use resilience-patterns).

SKILL.mdUpdated Jun 9, 2026

jaykim88/webhook-design

jaykim88/transaction-management

testing

VerifiedTrustedCommunity

Use transactions and isolation levels correctly — keep them short, no network calls inside, explicit isolation, retry on serialization conflicts, and choose optimistic vs pessimistic locking. Use when a write spans multiple tables, when concurrent updates corrupt data, or when designing money/inventory flows. Not for cross-service event delivery (use async-messaging Outbox) or schema-level constraints (use schema-design).

SKILL.mdUpdated Jun 9, 2026

jaykim88/transaction-management

jaykim88/test-strategy

development

VerifiedTrustedCommunity

Backend testing pyramid — unit for pure logic, integration against a real DB (Testcontainers), and consumer-driven contract testing (Pact) for service boundaries. Use before a feature, after a bug fix, or when services break each other on deploy. Not for load testing (use performance-profiling) or security testing (use backend-security-audit).

SKILL.mdUpdated Jun 9, 2026

jaykim88/test-strategy

jaykim88/schema-design

data-ai

VerifiedTrustedCommunity

Design a relational schema — normalize to 3NF then denormalize with justification, choose the right Postgres index type per data shape, enforce constraints at the DB. Use when modeling a new domain, when queries are slow, or before a migration. Not for diagnosing slow queries (use query-optimization) or shipping the change without downtime (use migration-strategy).

SKILL.mdUpdated Jun 9, 2026

jaykim88/schema-design

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/jaykim88/claude-ai-engineering.git

# Copy into Claude Code skills folder (global)
cp -r claude-ai-engineering/plugins/backend-toolkit/skills/caching-strategy ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

jaykim88/claude-ai-engineering

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT