Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

jaykim88/query-optimization

Name: query-optimization
Author: jaykim88

plugins/backend-toolkit/skills/query-optimization/SKILL.md

npx skillsauth add jaykim88/claude-ai-engineering query-optimization

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Query Optimization

Purpose

Find the queries that actually cost the most, understand why they're slow from the plan, and fix the root cause (missing index, N+1, bad estimate) — measure-first, never guess.

Universal — the find-rank-diagnose-fix workflow (statement stats → query plan → index/rewrite) applies to any SQL DB; tool names differ.

Procedure

Rank queries by total cost — measure first
- Use pg_stat_statements ordered by total_exec_time (not single-call time)
- The biggest win is usually a moderately-slow query called millions of times, not the one slow query
- Identify the top 5 offenders
Diagnose each offender with EXPLAIN (ANALYZE, BUFFERS)
- Look for: Seq Scan on large tables, bad row estimates (estimated vs actual rows far apart), nested loops over big sets, high buffers read (IO)
- ANALYZE runs the query; BUFFERS shows IO — both needed for truth
- Compare planning time vs execution time — if planning dominates, suspect many partitions, complex inheritance, or stale prepared-statement plans
- "Slow query" can really be connection-pool exhaustion (queries queue behind connections, not behind themselves) — confirm via pg_stat_activity waits before chasing the plan
Detect N+1 at the ORM layer first
- Symptom: one query + N follow-up queries per row
- Fix in the ORM (include / findMany batching / DataLoader) before reaching for raw SQL
- N+1 is the #1 backend perf bug and it hides in the application, not the DB
Fix by root cause
- Missing index → add the right type (see schema-design)
- Bad estimate → ANALYZE the table / increase statistics target. Multi-column correlations (e.g. WHERE country='KR' AND city='Seoul') need CREATE STATISTICS (extended stats) — single-column stats over-estimate selectivity
- Index used to work, now slow → suspect index bloat from heavy UPDATE/DELETE; REINDEX INDEX CONCURRENTLY and tune autovacuum
- Hot-row contention (many writers contending on one row) → not an index problem; design SELECT ... FOR UPDATE SKIP LOCKED for queue patterns, or partition the hot key (cross-ref transaction-management)
- N+1 → batch / eager-load
- Genuinely expensive aggregate → materialized view or cache (see caching-strategy)
Validate (validation loop)
- Re-run EXPLAIN ANALYZE; if still Seq Scan / still slow → the index isn't being used (check column order, type mismatch, function-wrapping) → adjust and re-run
- Re-check pg_stat_statements after deploy — confirm the query dropped in ranking

Anti-patterns

| ❌ Anti-pattern | ✅ Correct | |---|---| | Adding indexes by guessing | pg_stat_statements → EXPLAIN ANALYZE → targeted index | | Optimizing the single slowest query | Optimize highest total_exec_time (frequency × cost) | | Fixing N+1 with raw SQL | Fix at ORM layer (batching/eager-load) first | | EXPLAIN without ANALYZE | EXPLAIN (ANALYZE, BUFFERS) for real timings + IO | | Index on WHERE lower(email) but querying email | Match index expression to query expression | | Diagnosing as "slow query" when it's connection-pool saturation | Confirm via pg_stat_activity waits before tuning the plan | | Bloated index after long heavy writes (silently slow) | REINDEX INDEX CONCURRENTLY; tune autovacuum so it doesn't reach this state | | Single-column stats for correlated predicates | CREATE STATISTICS (extended stats) for multi-column correlations |

Severity tiers

| Tier | Examples | Action SLA | |---|---|---| | Critical | N+1 on a hot endpoint causing timeouts; full table scan on a multi-million row table per request | Fix immediately | | Major | Missing index on a frequent query (high total_exec_time); unbounded query (no LIMIT) | Fix this sprint | | Minor | Suboptimal plan on a rare query; slightly stale table statistics | Schedule within 2 sprints |

Completion Criteria

[ ] Top 5 pg_stat_statements offenders diagnosed
[ ] No Seq Scan on large tables in hot paths
[ ] N+1 eliminated at ORM layer
[ ] Each fix verified with before/after EXPLAIN ANALYZE
[ ] All Critical findings fixed; all Major scheduled

Output

Query audit report: docs/query-audit-YYYY-MM-DD.md — top offenders, plans, fixes, before/after timings
Index migrations + ORM eager-load changes
Commit format: perf(db): eliminate N+1 in <endpoint> / perf(db): add index for <query>

Implementation

TypeScript + Prisma + Postgres (default)

Stats: SELECT * FROM pg_stat_statements ORDER BY total_exec_time DESC LIMIT 10;
N+1: Prisma include / select with relations; avoid per-row findUnique in a loop
Plan: EXPLAIN (ANALYZE, BUFFERS) <query> in psql / Supabase SQL editor
Prisma query logging (log: ['query']) in dev to spot N+1
PgBouncer transaction-pooling caveat: it doesn't keep server-side prepared statements across transactions. Either run PgBouncer in session-pooling for prepared-statement workloads, or disable Prisma prepared statements (?statement_cache_size=0) — silent perf cliff otherwise
Bloat / autovacuum: pg_stat_user_indexes for unused indexes; pgstattuple for bloat; REINDEX INDEX CONCURRENTLY to rebuild without lock

Other stacks

Python: SQLAlchemy selectinload/joinedload for N+1; pg_stat_statements same
Go: sqlc / GORM Preload; same Postgres tooling
Universal: pg_stat_statements + EXPLAIN ANALYZE are Postgres; MySQL uses performance_schema + EXPLAIN ANALYZE (8.0+)

Related skills

schema-design — the fix is often the right index type
performance-profiling — query time is usually the top backend bottleneck
caching-strategy — cache the query result when optimization hits its limit

Reference

Key insight encoded: Use pg_stat_statements to rank by total time, then EXPLAIN (ANALYZE, BUFFERS) the offenders looking for Seq Scans / bad row estimates / N+1 before adding indexes. Three senior diagnostics that look like "slow query" but aren't: connection-pool exhaustion, index bloat after heavy writes (re-REINDEX CONCURRENTLY), and correlated-predicate misestimation (needs extended CREATE STATISTICS). PgBouncer transaction-pooling silently breaks server-side prepared statements — pin pooling mode to your driver's assumption.

jaykim88/query-optimization

plugins/backend-toolkit/skills/query-optimization/SKILL.md

Find and fix slow Postgres queries — rank by pg_stat_statements, diagnose with EXPLAIN (ANALYZE, BUFFERS), kill N+1 at the ORM layer, add the right index. Use when an endpoint is slow, DB CPU is high, or before scaling traffic. Not for schema/index design from scratch (use schema-design) or result-level caching (use caching-strategy).

data-ai

Updated Jun 9, 2026

$ install --global

skillsauth

npx skillsauth add jaykim88/claude-ai-engineering query-optimization

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 9, 2026, 8:32 AM393.9s1 file scanned

SKILL.md

name:: query-optimization
description:: Find and fix slow Postgres queries — rank by pg_stat_statements, diagnose with EXPLAIN (ANALYZE, BUFFERS), kill N+1 at the ORM layer, add the right index. Use when an endpoint is slow, DB CPU is high, or before scaling traffic. Not for schema/index design from scratch (use schema-design) or result-level caching (use caching-strategy).
license:: MIT

Query Optimization

Purpose

Find the queries that actually cost the most, understand why they're slow from the plan, and fix the root cause (missing index, N+1, bad estimate) — measure-first, never guess.

Universal — the find-rank-diagnose-fix workflow (statement stats → query plan → index/rewrite) applies to any SQL DB; tool names differ.

Procedure

Rank queries by total cost — measure first
- Use pg_stat_statements ordered by total_exec_time (not single-call time)
- The biggest win is usually a moderately-slow query called millions of times, not the one slow query
- Identify the top 5 offenders
Diagnose each offender with EXPLAIN (ANALYZE, BUFFERS)
- Look for: Seq Scan on large tables, bad row estimates (estimated vs actual rows far apart), nested loops over big sets, high buffers read (IO)
- ANALYZE runs the query; BUFFERS shows IO — both needed for truth
- Compare planning time vs execution time — if planning dominates, suspect many partitions, complex inheritance, or stale prepared-statement plans
- "Slow query" can really be connection-pool exhaustion (queries queue behind connections, not behind themselves) — confirm via pg_stat_activity waits before chasing the plan
Detect N+1 at the ORM layer first
- Symptom: one query + N follow-up queries per row
- Fix in the ORM (include / findMany batching / DataLoader) before reaching for raw SQL
- N+1 is the #1 backend perf bug and it hides in the application, not the DB
Fix by root cause
- Missing index → add the right type (see schema-design)
- Bad estimate → ANALYZE the table / increase statistics target. Multi-column correlations (e.g. WHERE country='KR' AND city='Seoul') need CREATE STATISTICS (extended stats) — single-column stats over-estimate selectivity
- Index used to work, now slow → suspect index bloat from heavy UPDATE/DELETE; REINDEX INDEX CONCURRENTLY and tune autovacuum
- Hot-row contention (many writers contending on one row) → not an index problem; design SELECT ... FOR UPDATE SKIP LOCKED for queue patterns, or partition the hot key (cross-ref transaction-management)
- N+1 → batch / eager-load
- Genuinely expensive aggregate → materialized view or cache (see caching-strategy)
Validate (validation loop)
- Re-run EXPLAIN ANALYZE; if still Seq Scan / still slow → the index isn't being used (check column order, type mismatch, function-wrapping) → adjust and re-run
- Re-check pg_stat_statements after deploy — confirm the query dropped in ranking

Anti-patterns

Severity tiers

Completion Criteria

[ ] Top 5 pg_stat_statements offenders diagnosed
[ ] No Seq Scan on large tables in hot paths
[ ] N+1 eliminated at ORM layer
[ ] Each fix verified with before/after EXPLAIN ANALYZE
[ ] All Critical findings fixed; all Major scheduled

Output

Query audit report: docs/query-audit-YYYY-MM-DD.md — top offenders, plans, fixes, before/after timings
Index migrations + ORM eager-load changes
Commit format: perf(db): eliminate N+1 in <endpoint> / perf(db): add index for <query>

Implementation

TypeScript + Prisma + Postgres (default)

Stats: SELECT * FROM pg_stat_statements ORDER BY total_exec_time DESC LIMIT 10;
N+1: Prisma include / select with relations; avoid per-row findUnique in a loop
Plan: EXPLAIN (ANALYZE, BUFFERS) <query> in psql / Supabase SQL editor
Prisma query logging (log: ['query']) in dev to spot N+1
PgBouncer transaction-pooling caveat: it doesn't keep server-side prepared statements across transactions. Either run PgBouncer in session-pooling for prepared-statement workloads, or disable Prisma prepared statements (?statement_cache_size=0) — silent perf cliff otherwise
Bloat / autovacuum: pg_stat_user_indexes for unused indexes; pgstattuple for bloat; REINDEX INDEX CONCURRENTLY to rebuild without lock

Other stacks

Python: SQLAlchemy selectinload/joinedload for N+1; pg_stat_statements same
Go: sqlc / GORM Preload; same Postgres tooling
Universal: pg_stat_statements + EXPLAIN ANALYZE are Postgres; MySQL uses performance_schema + EXPLAIN ANALYZE (8.0+)

Related skills

schema-design — the fix is often the right index type
performance-profiling — query time is usually the top backend bottleneck
caching-strategy — cache the query result when optimization hits its limit

Reference

Key insight encoded: Use pg_stat_statements to rank by total time, then EXPLAIN (ANALYZE, BUFFERS) the offenders looking for Seq Scans / bad row estimates / N+1 before adding indexes. Three senior diagnostics that look like "slow query" but aren't: connection-pool exhaustion, index bloat after heavy writes (re-REINDEX CONCURRENTLY), and correlated-predicate misestimation (needs extended CREATE STATISTICS). PgBouncer transaction-pooling silently breaks server-side prepared statements — pin pooling mode to your driver's assumption.

Related Skills

jaykim88/webhook-design

development

VerifiedTrustedCommunity

Design webhooks correctly on both sides — sending (HMAC signing, retries with backoff, at-least-once) and receiving (verify signature on raw body, enqueue + 200 fast, dedupe on event id). Use when adding webhook delivery or consuming a provider's webhooks. Not for internal service-to-service events (use async-messaging) or general outbound-call retry policy (use resilience-patterns).

SKILL.mdUpdated Jun 9, 2026

jaykim88/webhook-design

jaykim88/transaction-management

testing

VerifiedTrustedCommunity

Use transactions and isolation levels correctly — keep them short, no network calls inside, explicit isolation, retry on serialization conflicts, and choose optimistic vs pessimistic locking. Use when a write spans multiple tables, when concurrent updates corrupt data, or when designing money/inventory flows. Not for cross-service event delivery (use async-messaging Outbox) or schema-level constraints (use schema-design).

SKILL.mdUpdated Jun 9, 2026

jaykim88/transaction-management

jaykim88/test-strategy

development

VerifiedTrustedCommunity

Backend testing pyramid — unit for pure logic, integration against a real DB (Testcontainers), and consumer-driven contract testing (Pact) for service boundaries. Use before a feature, after a bug fix, or when services break each other on deploy. Not for load testing (use performance-profiling) or security testing (use backend-security-audit).

SKILL.mdUpdated Jun 9, 2026

jaykim88/test-strategy

jaykim88/schema-design

data-ai

VerifiedTrustedCommunity

Design a relational schema — normalize to 3NF then denormalize with justification, choose the right Postgres index type per data shape, enforce constraints at the DB. Use when modeling a new domain, when queries are slow, or before a migration. Not for diagnosing slow queries (use query-optimization) or shipping the change without downtime (use migration-strategy).

SKILL.mdUpdated Jun 9, 2026

jaykim88/schema-design

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/jaykim88/claude-ai-engineering.git

# Copy into Claude Code skills folder (global)
cp -r claude-ai-engineering/plugins/backend-toolkit/skills/query-optimization ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

jaykim88/claude-ai-engineering

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT