skills/robusta-dev/SKILL.md
Robusta Kubernetes observability and alert automation platform. USE WHEN installing Robusta OR configuring playbooks OR setting up notification sinks OR troubleshooting Kubernetes alerts OR creating custom actions OR integrating with Prometheus/AlertManager OR automating incident remediation.
npx skillsauth add julianobarbosa/claude-code-skills robusta-devInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Comprehensive guide for Robusta - the SRE agent that transforms Kubernetes alerts into actionable insights using playbooks, AI investigation, and automated remediation.
| Component | Purpose | |-----------|---------| | Playbooks | Rules engine defining alert responses | | Triggers | Events that activate playbook execution | | Actions | Remediation steps and enrichments | | Sinks | Notification destinations (Slack, Teams, PagerDuty, etc.) |
| Workflow | Trigger | File |
|----------|---------|------|
| Install | "install robusta", "deploy robusta" | Workflows/Install.md |
| ConfigurePlaybooks | "create playbook", "configure playbook" | Workflows/ConfigurePlaybooks.md |
| ConfigureSinks | "setup slack", "configure notifications" | Workflows/ConfigureSinks.md |
| Troubleshoot | "robusta not working", "alerts not firing" | Workflows/Troubleshoot.md |
# Generate configuration
pipx run robusta-cli gen-config --enable-prometheus-stack
# Or using Docker
curl -fsSL -o robusta https://docs.robusta.dev/master/_static/robusta
chmod +x robusta
./robusta gen-config --enable-prometheus-stack
# Install via Helm
helm repo add robusta https://robusta-charts.storage.googleapis.com
helm repo update
helm install robusta robusta/robusta \
-f ./generated_values.yaml \
--set clusterName=<YOUR_CLUSTER_NAME>
# Verify installation
kubectl get pods -A | grep robusta
pipx run robusta-cli gen-config
helm install robusta robusta/robusta -f ./generated_values.yaml
# Example playbook in generated_values.yaml
customPlaybooks:
- triggers:
- on_prometheus_alert:
alert_name: KubePodCrashLooping
actions:
- logs_enricher: {}
- pod_events_enricher: {}
sinks:
- slack
| Trigger | Description |
|---------|-------------|
| on_prometheus_alert | Fires on Prometheus/AlertManager alerts |
| on_pod_create | When pod is created |
| on_pod_update | When pod is updated |
| on_deployment_update | When deployment changes |
| on_schedule | Cron-based scheduled execution |
| on_kubernetes_warning_event | On K8s warning events |
| Action | Purpose |
|--------|---------|
| logs_enricher | Add pod logs to alert |
| pod_events_enricher | Add K8s events |
| node_cpu_enricher | Add CPU metrics |
| node_memory_enricher | Add memory metrics |
| deployment_status_enricher | Add deployment info |
| delete_pod | Auto-remediate by deleting pod |
| node_bash_enricher | Run bash commands on node |
sinksConfig:
- slack_sink:
name: main_slack
slack_channel: alerts
api_key: xoxb-your-token
sinksConfig:
- ms_teams_sink:
name: teams_alerts
webhook_url: https://outlook.office.com/webhook/...
sinksConfig:
- pagerduty_sink:
name: pagerduty
api_key: your-integration-key
sinksConfig:
- webhook_sink:
name: custom_webhook
url: https://your-endpoint.com/alerts
Example 1: Install Robusta with Prometheus
User: "Install Robusta on my AKS cluster"
-> Generate config with gen-config --enable-prometheus-stack
-> Add Helm repo and install with cluster name
-> Verify pods are running
Example 2: Create crash loop enrichment playbook
User: "Add pod logs to CrashLoopBackOff alerts"
-> Create playbook with on_prometheus_alert trigger
-> Add logs_enricher and pod_events_enricher actions
-> Configure Slack sink for notifications
Example 3: Configure Slack notifications
User: "Send Robusta alerts to #k8s-alerts Slack channel"
-> Add slack_sink to sinksConfig
-> Set channel name and API key
-> Optionally filter by severity or namespace
Example 4: Debug missing alerts
User: "Robusta isn't sending alerts to Slack"
-> Check robusta-runner pod logs
-> Verify sink configuration in generated_values.yaml
-> Test with manual trigger: robusta playbooks trigger
Prometheus Alert -> AlertManager -> Robusta -> Playbook -> Actions -> Sinks
Robusta groups related alerts using Slack threads to reduce notification spam.
Optional AI-powered root cause analysis available with Robusta Pro or self-hosted HolmesGPT.
references/Installation.md - Detailed installation guidereferences/Playbooks.md - Complete playbook referencereferences/Triggers.md - All trigger typesreferences/Actions.md - Available actionsreferences/Sinks.md - Sink configurationreferences/Troubleshooting.md - Common issues and fixescustomPlaybooks order matters — first match wins for the same trigger: A catch-all on_prometheus_alert without alert_name: filter at the top swallows every alert, and the specific-alert playbooks below never fire. Always put specific-alert playbooks first, generic last.api_key in slack_sink is the bot token (xoxb-), not the webhook URL: Pasting a webhook URL there silently fails — Robusta logs "auth failed" once and stops. Use slack_channel + api_key for bot mode, or switch to a different sink type for webhook delivery.delete_pod auto-remediation has no rate limit: A misconfigured CrashLoopBackOff playbook with delete_pod action against a flapping deployment will hot-loop pod deletions and exhaust pod IPs in the subnet. Always pair with on_prometheus_alert + a firing filter, not pod-state triggers.gen-config --enable-prometheus-stack installs a full kube-prometheus-stack: Running it on a cluster that already has Prometheus produces two Prometheus instances scraping each other. Use plain gen-config (no flag) when integrating with existing monitoring.ms_teams_sink webhook URLs were deprecated in 2024 — connectors retire: Existing webhook URLs keep working briefly then return 410. Migrate to Power Automate workflows or the new Teams sink format before the deprecation cliff hits.testing
Brief description of what this skill does. Include specific triggers - when should Claude use this skill? Example triggers, file types, or keywords that indicate this skill applies.
tools
Manage and troubleshoot PATH configuration in zsh. Use when adding tools to PATH (bun, nvm, Python venv, cargo, go), diagnosing "command not found" errors, validating PATH entries, or organizing shell configuration in .zshrc and .zshrc.local files.
tools
Zabbix monitoring system automation via API and Python. Use when: (1) Managing hosts, templates, items, triggers, or host groups, (2) Automating monitoring configuration, (3) Sending data via Zabbix trapper/sender, (4) Querying historical data or events, (5) Bulk operations on Zabbix objects, (6) Maintenance window management, (7) User/permission management
development
Operate YouTube Music via natural language. Search songs, artists, albums, playlists, lyrics, charts, recommendations, and control playback. Browse personal library, manage playlists, rate tracks, and inspect account info. Use this skill whenever the user asks about YouTube Music, wants to play music, manage playlists, search by song or artist name, inspect lyrics, or control playback.