skills/kubeblocks-observability-bootstrap-stack/SKILL.md
Bootstrap a Prometheus / Grafana monitoring stack for KubeBlocks database metrics when no usable observability base exists yet.
npx skillsauth add apecloud/kubeblocks-skills kubeblocks-observability-bootstrap-stackInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this path only when the environment does not already have a usable Prometheus / Grafana base.
If the cluster already has monitoring infrastructure, route to kubeblocks-observability-existing-stack instead.
- [ ] Step 1: Confirm no existing stack should be reused
- [ ] Step 2: Install kube-prometheus-stack
- [ ] Step 3: Add scrape config for KubeBlocks clusters
- [ ] Step 4: Confirm readiness level
kubectl get pods -A | grep -E 'prometheus|grafana' || true
If Prometheus / Grafana already exists and is the team's system of record, stop and use the existing-stack path instead.
helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
helm repo update
helm install prometheus prometheus-community/kube-prometheus-stack \
-n monitoring --create-namespace \
--set prometheus.prometheusSpec.podMonitorSelectorNilUsesHelmValues=false \
--set prometheus.prometheusSpec.serviceMonitorSelectorNilUsesHelmValues=false
Wait for operator, Prometheus, and Grafana pods to become ready.
After the stack exists, add PodMonitor / ServiceMonitor for KubeBlocks-managed database pods.
See the compatibility shim kubeblocks-setup-monitoring and the original reference file at ../kubeblocks-setup-monitoring/references/reference.md for addon-specific exporter notes.
At minimum, state whether the result is:
metrics-readyscrape-readydashboard-readyalerting-readyDo not collapse all of these into "monitoring is ready".
devops
Expand persistent volume storage for KubeBlocks database clusters via OpsRequest. Requires the StorageClass to support volume expansion (allowVolumeExpansion=true). Use when the user needs more disk space, wants to increase storage, expand volumes, or resize PVCs. NOT for changing CPU/memory (see vertical-scaling) or adding more replicas (see horizontal-scaling). Note that volume shrinking is not supported by Kubernetes.
data-ai
Scale CPU and memory resources for KubeBlocks database clusters via OpsRequest (vertical scaling). Supports in-place updates when the feature gate is enabled. Use when the user wants to change, increase, decrease, resize, or adjust CPU or memory resources of a database cluster. NOT for adding/removing replicas or shards (see horizontal-scaling) or expanding disk storage (see volume-expansion).
data-ai
Upgrade the KubeBlocks operator itself via Helm. Covers update operator, upgrade to v1.0, update kubeblocks version, and CRD updates. Use when the user wants to upgrade KubeBlocks, update the operator, or upgrade to a new KubeBlocks release. NOT for upgrading database engine versions (see minor-version-upgrade).
development
Diagnostic guide for KubeBlocks-managed database clusters. Use when the user reports troubleshoot, debug, diagnose, not working, error, failed, stuck, CrashLoopBackOff, cluster exception, or similar problems with their database cluster. This skill guides the agent through diagnostic steps — it does NOT perform actions.