.claude/skills/ts-capacity-planner/SKILL.md
Translates performance test results into infrastructure scaling recommendations with cost estimates. Use when someone has load test data and needs to know what infrastructure changes are required to handle target traffic. Trigger words: capacity planning, scaling plan, infrastructure sizing, how many pods, RDS sizing, handle more traffic, scale for launch.
npx skillsauth add eliferjunior/Claude capacity-plannerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill takes performance test results (breaking points, latency data, error patterns) and produces actionable infrastructure scaling recommendations. It identifies bottlenecks, recommends specific resource changes, estimates costs, and generates verification criteria.
Ask or determine:
From load test error patterns, determine the primary bottleneck:
| Error Pattern | Likely Bottleneck | |---|---| | Timeouts increasing gradually | CPU saturation on application pods | | Sudden spike in 5xx errors | Database connection pool exhaustion | | Connection refused errors | Pod/instance limit reached | | Slow first byte, fast transfer | Database query latency | | Memory-related crashes | Application memory leak or undersized instances | | Intermittent 503s | Load balancer target limits |
Use linear scaling as baseline, then apply correction factors:
Required pods = current_pods × (target_RPS / current_max_RPS) × 1.3 safety factor
Required DB connections = current_pool × (target_RPS / current_max_RPS) × 1.2
Adjust for:
Use these AWS reference prices (us-east-1, on-demand):
| Resource | Spec | Monthly Cost | |---|---|---| | EKS pod (500m/1Gi) | ~0.25 vCPU equiv | ~$35/mo | | RDS db.r5.large | 2 vCPU, 16 GiB | $274/mo | | RDS db.r5.xlarge | 4 vCPU, 32 GiB | $548/mo | | ElastiCache r6g.large | 2 vCPU, 13 GiB | $172/mo | | RDS Read Replica | Same as primary instance | Same cost | | ALB | Base + LCU | ~$25-50/mo |
Note: actual costs vary by region, reserved instances, and savings plans.
Format as actionable items with:
User prompt:
We break at 400 RPS with 3 pods and RDS db.r5.large. Need to handle 1200 RPS for launch.
Agent output:
Capacity Plan: 400 → 1,200 RPS
───────────────────────────────
Step Change Cost Delta Addresses
1 DB pool_size: 20 → 80 $0 Connection exhaustion (primary bottleneck)
2 RDS: db.r5.large → db.r5.2xlarge +$822/mo Write throughput + connections
3 Add RDS read replica +$1,096/mo Read query offloading
4 API pods: 3 → 10 (HPA min=7) +$245/mo Request processing capacity
5 Redis: r6g.medium → r6g.xlarge +$258/mo Session/cache throughput
Total additional cost: $2,421/month
Implementation order: 1 → 2 → 3 → 4 → 5 (each step unblocks the next)
Verification: Re-run load test after each step. Expected milestones:
After step 1-2: Breaking point moves to ~700 RPS
After step 3: Breaking point moves to ~1,000 RPS
After step 4-5: Target 1,200 RPS with p95 < 500ms
User prompt:
We need to handle 2x traffic but leadership wants minimal cost increase.
Agent output:
Cost-Optimized Scaling Plan (2x traffic)
────────────────────────────────────────
FREE WINS (configuration only)
1. Increase DB connection pool: 20 → 40 +0$/mo Impact: +30% throughput
2. Enable gzip compression on API responses +0$/mo Impact: -40% bandwidth
3. Add Redis caching for GET /api/products (TTL 60s) +0$/mo Impact: -50% DB reads
LOW COST
4. Add 2 API pods (3 → 5) +$70/mo
5. Enable RDS query cache +$0/mo
Expected result: 1.8-2.2x capacity at +$70/month
Tradeoff: Less headroom than full scaling. If traffic exceeds 2.2x, latency degrades.
development
Expert guidance for Fireworks AI, the platform for running open-source LLMs (Llama, Mixtral, Qwen, etc.) with enterprise-grade speed and reliability. Helps developers integrate Fireworks' inference API, fine-tune models, and deploy custom model endpoints with function calling and structured output support.
development
Convert any website into clean, structured data with Firecrawl — API-first web scraping service. Use when someone asks to "turn a website into markdown", "scrape website for LLM", "Firecrawl", "extract website content as clean text", "crawl and convert to structured data", or "scrape website for RAG". Covers single-page scraping, full-site crawling, structured extraction, and LLM-ready output.
tools
Expert guidance for Firebase, Google's platform for building and scaling web and mobile applications. Helps developers set up authentication, Firestore/Realtime Database, Cloud Functions, hosting, storage, and analytics using Firebase's SDK and CLI.
development
When the user needs to build file upload functionality for a web application. Use when the user mentions "file upload," "image upload," "upload endpoint," "multipart upload," "presigned URL," "S3 upload," "file validation," "upload to cloud storage," or "accept user files." Handles upload endpoints, file validation (type, size, magic bytes), cloud storage integration, and upload status tracking. For image/video processing after upload, see media-transcoder.