skills/runbook/SKILL.md
Generate operational runbooks for services, procedures, or incident response with step-by-step procedures, troubleshooting guides, and escalation paths
npx skillsauth add thoreinstein/agents runbookInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate operational runbooks for services, procedures, or incident response. Investigates the codebase and infrastructure to produce accurate, actionable procedures.
Launch parallel investigation tracks to gather comprehensive information:
Generate the runbook document using the template at references/templates/runbook.md.
The runbook should include:
Input: "Generate runbook for the payment-service"
Investigation:
- Found deployment at k8s/payment-service/
- Found health endpoints: /health, /ready
- Dependencies: PostgreSQL (critical), Redis (cache), Stripe API
- Scaling: HPA configured, min 3, max 10 replicas
- Alerts: Prometheus rules in monitoring/
Generated Runbook: payment-service-runbook.md
## Overview
- Service: payment-service
- Owner: payments-team
- Criticality: P1
## Dependencies
| Dependency | Type | Criticality | Failure Impact |
|------------|------|-------------|----------------|
| PostgreSQL | Database | Critical | Full outage |
| Redis | Cache | High | Degraded latency |
| Stripe API | External | Critical | Payment failures |
## Procedures
### Deployment
1. Verify no active transactions
```bash
kubectl exec -it payment-service-0 -- curl localhost:8080/metrics | grep active_transactions
kubectl apply -f k8s/payment-service/deployment.yaml
kubectl rollout status deployment/payment-service
kubectl scale deployment payment-service --replicas=5
Symptoms: p99 latency > 500ms Diagnosis:
kubectl top pods -l app=payment-service
kubectl logs -l app=payment-service --tail=100 | grep -i slow
Resolution: Check Redis connection, scale if CPU > 80%
Begin by identifying the service or operation to document and launching investigation tracks.
testing
Consult the whizz-mind knowledge base for documentation and answers. Use when the user asks questions that might be answered by stored documentation or when explicitly asked to check whizz-mind.
development
Comprehensive web quality audit covering performance, accessibility, SEO, and best practices. Use when asked to "audit my site", "review web quality", "run lighthouse audit", "check page quality", or "optimize my website".
testing
Ultra-deep multi-perspective analysis for complex architectural and strategic decisions requiring systematic reasoning across technical, business, user, and system perspectives
data-ai
Optimize for search engine visibility and ranking. Use when asked to "improve SEO", "optimize for search", "fix meta tags", "add structured data", "sitemap optimization", or "search engine optimization".