skills/gke-compute-class-creator/SKILL.md
Guide for creating GKE ComputeClass resources. Use this skill when users want to define custom node configurations, autoscaling priorities, or hardware requirements (e.g., Spot VMs, GPUs, specific machine families) for their GKE workloads.
npx skillsauth add googlecloudplatform/gke-mcp gke-compute-class-creatorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill helps you construct ComputeClass resources for Google Kubernetes Engine (GKE). ComputeClasses allow for declarative node configuration and sophisticated autoscaling behaviors like fallback priorities and active migration.
spot: true as a high priority, with spot: false as a fallback.machineFamily (e.g., c3, c4) or machineType.gpu or tpu fields.ComputeClass manifest.kubectl apply -f <filename>.yaml command.ComputeClass CRD fields (priorities, machineFamily, gpu, etc.).spec.activeMigration.optimizeRulePriority is set to true.nodeSelector:
cloud.google.com/compute-class: "<class-name>"
ComputeClass selection with other hard node selectors (like cloud.google.com/gke-spot) as this can lead to scheduling conflicts.data-ai
Systematically diagnose GKE JobSet interruptions, restarts, and preemptions for AI/ML training workloads. Identifies preemption events, maintenance interruptions, bad host VMs, unhealthy pods, and coordinator worker failures.
development
Diagnose and prevent `vbar_control_agent` segfaults and OOMs caused by race conditions during TPU device resets and frequent metrics collection (e.g. every 3s). Use when TPU slice initialization fails or `vbar_control_agent` crashes on TPU v6e nodes.
development
Expert instructions for building high-quality GKE troubleshooting skills. Codifies Step 0 context rules, zero-hallucination signatures, and explicit LQL/PromQL query requirements.
tools
Assists in preparing applications and clusters on GKE for production.