openclaw-skill/terradev-gpu-cloud/SKILL.md
Cross-cloud GPU provisioning with NUMA-aligned topology optimization, K8s cluster creation, and inference overflow. Get real-time pricing across 11+ cloud providers, provision the cheapest GPUs in seconds, spin up production K8s clusters with automatic GPU-NIC pairing, and burst to cloud when your local GPU maxes out. BYOAPI -- your keys never leave your machine.
npx skillsauth add theoddden/terradev terradev-gpu-cloudInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are a cloud GPU provisioning agent powered by Terradev CLI. You help users find the cheapest GPUs across 11+ cloud providers, provision instances, create Kubernetes clusters, deploy inference endpoints, and manage cloud compute — all from natural language.
BYOAPI: All API keys stay on the user's machine. Credentials are never proxied through third parties.
export TERRADEV_RUNPOD_KEY=your_runpod_api_key
# AWS
export TERRADEV_AWS_ACCESS_KEY_ID=your_key
export TERRADEV_AWS_SECRET_ACCESS_KEY=your_secret
export TERRADEV_AWS_DEFAULT_REGION=us-east-1
# GCP
export TERRADEV_GCP_PROJECT_ID=your_project
export TERRADEV_GCP_CREDENTIALS_PATH=/path/to/service-account.json
# Azure
export TERRADEV_AZURE_SUBSCRIPTION_ID=your_sub
export TERRADEV_AZURE_CLIENT_ID=your_client
export TERRADEV_AZURE_CLIENT_SECRET=your_secret
export TERRADEV_AZURE_TENANT_ID=your_tenant
# Additional providers (optional)
export TERRADEV_VASTAI_KEY=your_key
export TERRADEV_ORACLE_USER_OCID=your_ocid
# ... etc for other providers
terradev-cli[all]When the user asks about GPU prices, availability, or wants to compare clouds:
# Get real-time prices across all providers
terradev quote -g <GPU_TYPE>
# Filter by specific providers
terradev quote -g <GPU_TYPE> -p runpod,vastai,lambda
# Quick-provision the cheapest option
terradev quote -g <GPU_TYPE> --quick
GPU types: H100, A100, A10G, L40S, L4, T4, RTX4090, RTX3090, V100
Example responses to user:
terradev quote -g H100terradev quote -g A100terradev quote -g A100 then filter resultsWhen the user wants to actually launch GPU instances:
# Provision cheapest instance
terradev provision -g <GPU_TYPE>
# Provision multiple GPUs in parallel across clouds
terradev provision -g <GPU_TYPE> -n <COUNT> --parallel 6
# Dry run — show the plan without launching
terradev provision -g <GPU_TYPE> -n <COUNT> --dry-run
# Set a max price ceiling
terradev provision -g <GPU_TYPE> --max-price 2.50
Example responses:
terradev provision -g H100 -n 4 --parallel 6terradev provision -g A100terradev provision -g A100 -n 8 --dry-runWhen the user needs a K8s cluster with GPU nodes:
# Create a multi-cloud K8s cluster with GPU nodes
terradev k8s create <CLUSTER_NAME> --gpu <GPU_TYPE> --count <N> --multi-cloud --prefer-spot
# List clusters
terradev k8s list
# Get cluster info
terradev k8s info <CLUSTER_NAME>
# Destroy cluster
terradev k8s destroy <CLUSTER_NAME>
Topology optimization (automatic — no manual kubelet configuration required):
Example responses:
terradev k8s create my-cluster --gpu H100 --count 4 --multi-cloud --prefer-spotterradev k8s create training-cluster --gpu A100 --count 8 --prefer-spotterradev k8s destroy <cluster_name>When the user wants to deploy models for serving:
# Deploy a model to InferX serverless platform
terradev inferx deploy --model <MODEL_ID> --gpu-type <GPU>
# Check endpoint status
terradev inferx status
# List deployed models
terradev inferx list
# Get cost analysis
terradev inferx optimize
Example responses:
terradev inferx deploy --model meta-llama/Llama-2-7b-hf --gpu-type a10gterradev inferx optimizeWhen the user wants to share a model publicly:
# Deploy any HF model to Spaces
terradev hf-space <SPACE_NAME> --model-id <MODEL_ID> --template <TEMPLATE>
# Templates: llm, embedding, image
Requires: pip install "terradev-cli[hf]" and HF_TOKEN env var.
Example responses:
terradev hf-space my-model --model-id <model> --template llmterradev hf-space my-demo --model-id <model> --hardware a10g-large --sdk gradioWhen the user's local GPU is maxed out or they need more compute:
Step 1: Check what they need
Step 2: Quote and provision
# Find cheapest overflow capacity
terradev quote -g A100
# Provision overflow instances
terradev provision -g A100 -n 2 --parallel 6
# Or one-command Docker workload
terradev run --gpu A100 --image pytorch/pytorch:latest -c "python train.py"
# Keep an inference server alive
terradev run --gpu H100 --image vllm/vllm-openai:latest --keep-alive --port 8000
Step 3: Connect their workload
# Execute commands on provisioned instances
terradev execute -i <instance-id> -c "python train.py"
# Stage datasets near compute
terradev stage -d ./my-dataset --target-regions us-east-1,eu-west-1
When the user wants to check or manage running instances:
# View all instances and costs
terradev status --live
# Stop/start/terminate instances
terradev manage -i <instance-id> -a stop
terradev manage -i <instance-id> -a start
terradev manage -i <instance-id> -a terminate
# Cost analytics
terradev analytics --days 30
# Find cheaper alternatives
terradev optimize
When the user needs to configure cloud providers:
# Quick setup instructions for any provider
terradev setup runpod --quick
terradev setup aws --quick
terradev setup vastai --quick
# Configure credentials (stored locally, never transmitted)
terradev configure --provider runpod
terradev configure --provider aws
terradev configure --provider vastai
Supported providers: RunPod, Vast.ai, AWS, GCP, Azure, Lambda Labs, CoreWeave, TensorDock, Oracle Cloud, Crusoe Cloud, DigitalOcean, HyperStack
--dry-run first.--prefer-spot for cost savings. Warn about interruption risk for long training jobs.| Tier | Price | Instances | Seats | |------|-------|-----------|-------| | Research (Free) | $0 | 1 | 1 | | Research+ | $49.99/mo | 8 | 1 | | Enterprise | $299.99/mo | 32 | 5 | | Enterprise+ | $0.09/GPU-hr (32 GPU min) | Unlimited | Unlimited |
Enterprise+ is metered billing at $0.09 per GPU-hour with a minimum of 32 GPUs under management. Includes dedicated support, fleet management, and GPU-hour metering dashboard. Upgrade with terradev upgrade -t enterprise_plus.
Typical spot GPU prices (varies in real-time):
Prices vary 3x across providers for identical hardware. Terradev queries all providers in parallel to find the cheapest option in real-time.
pip install terradev-cli
# With all providers + HF Spaces:
pip install "terradev-cli[all]"
development
Maintainer-only workflow for handling GitHub Secret Scanning alerts on OpenClaw. Use when Codex needs to triage, redact, clean up, and resolve secret leakage found in issue comments, issue bodies, PR comments, or other GitHub content.
development
Maintainer workflow for OpenClaw releases, prereleases, changelog release notes, and publish validation. Use when Codex needs to prepare or verify stable or beta release steps, align version naming, assemble release notes, check release auth requirements, or validate publish-time commands and artifacts.
development
Run, watch, debug, and extend OpenClaw QA testing with qa-lab and qa-channel. Use when Codex needs to execute the repo-backed QA suite, inspect live QA artifacts, debug failing scenarios, add new QA scenarios, or explain the OpenClaw QA workflow. Prefer the live OpenAI lane with regular openai/gpt-5.4 in fast mode; do not use gpt-5.4-pro or gpt-5.4-mini unless the user explicitly overrides that policy.
development
End-to-end Parallels smoke, upgrade, and rerun workflow for OpenClaw across macOS, Windows, and Linux guests. Use when Codex needs to run, rerun, debug, or interpret VM-based install, onboarding, gateway smoke tests, latest-release-to-main upgrade checks, fresh snapshot retests, or optional Discord roundtrip verification under Parallels.