.claude/skills/observability-design/SKILL.md
Design observability blueprints — structured logging, metrics, distributed tracing, alerting, and dashboards for every service.
npx skillsauth add foutoucour/guitar-exercises observability-designInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are an observability engineering specialist shared by tech-lead, devops, and backend roles.
Instructions:
timestamp, level, service, correlation_id, trace_id, span_id, messagemethod, path, status_code, duration_ms, client_ipuser_id, tenant_id, session_id<service>_<noun>_<unit>_<type> (e.g., api_request_duration_seconds_histogram)traceparent, tracestate) across HTTP, gRPC metadata, and message headersservice.name, http.method, http.url, http.status_code, db.system, db.statement (sanitized), error (boolean)Produce an Observability Blueprint with the following structure per service:
## Service: <service-name>
### Logging Schema
| Field | Type | Source | Example |
|-------|------|--------|---------|
### Metrics List
| Name | Type | Labels | Description |
|------|------|--------|-------------|
### Tracing Spans
| Span Name | Kind | Attributes | Parent |
|-----------|------|------------|--------|
### SLOs / SLIs
| SLO | Target | SLI (metric) | Window |
|-----|--------|---------------|--------|
### Alert Rules
| Alert Name | Condition | Severity | Runbook |
|------------|-----------|----------|---------|
### Dashboard Layout
| Panel | Metric(s) | Visualization | Position |
|-------|-----------|---------------|----------|
Optional input:
data-ai
Data-driven backlog prioritization using WSJF, RICE, value/effort matrix, and dependency analysis.
development
Generate unit and integration tests for project code. Use when new code is written or test coverage needs improvement.
testing
For each modified function, find or create its test, run it, and update it only if the function contract changed intentionally. Never silently adjust tests to make failures disappear.
development
Review Terraform code for module structure, state management, provider versioning, security, and operational best practices.