devops/implementing-observability/SKILL.md
Instrument the application with Logging, Metrics, and Tracing (OpenTelemetry) to understand system behavior and debug production issues.
npx skillsauth add 7a336e6e/skills Implementing ObservabilityInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
4 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Make the system's internal state inferable from its external outputs. Answer "Why is it slow?" and "Why did it fail?" without SSH-ing into a server.
Text logs are hard to query. Use JSON.
trace_id, request_id, user_id.INFO for normal ops, WARN for handled issues, ERROR for unhandled crashes.{"level": "info", "msg": "User logged in", "user_id": 123, "trace_id": "abc-123"}
Trace a request across boundaries (Frontend -> API -> DB).
Track the four key metrics for every service:
Alert on symptoms (High Error Rate), not causes (High CPU).
Error Rate > 1% for 5 minutes.Disk Usage > 80%.docker-compose.yml with Prometheus/Grafana/Jaeger (for dev).tracing.py).backend/managing-flask-middleware/SKILL.md (where instrumentation lives)shared/debugging/SKILL.mddevelopment
Implement features using the Red-Green-Refactor cycle to ensure testability and correctness from the start.
data-ai
Manage the `tasks.md` ledger with strict locking and collision avoidance protocols to allow multiple agents to work in parallel safely.
development
The git-workflow skill defines branching conventions, commit message formats, and pull request standards that all agents must follow for consistent version control.
development
The environment-config skill standardizes how agents manage environment variables, secrets, and application configuration across local development and deployed environments.