Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

alphaonedev/llm-ops

Name: llm-ops
Author: alphaonedev

skills/aimlops/llm-ops/SKILL.md

npx skillsauth add alphaonedev/openclaw-graph llm-ops

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

llm-ops

Purpose

This skill automates the deployment, scaling, and monitoring of large language models (LLMs) in AI/ML operations, handling infrastructure for models like GPT or BERT variants to ensure efficient runtime management.

When to Use

Use this skill when deploying LLMs in production environments, such as scaling a chatbot backend during peak traffic, monitoring model performance in real-time, or updating models in Kubernetes-based ML ops setups. Apply it in scenarios involving resource-constrained environments or when integrating LLMs with CI/CD pipelines for automated deployments.

Key Capabilities

Deploy LLMs to cloud providers (e.g., AWS, GCP) with automatic containerization.
Scale instances dynamically based on metrics like CPU usage or request volume.
Monitor key metrics including latency, throughput, and error rates via integrated dashboards.
Handle model versioning and rollbacks for safe updates.
Integrate with logging tools like ELK stack for detailed tracing.

Usage Patterns

To deploy an LLM, first set the environment variable for authentication: export OPENCLAW_API_KEY=your_api_key. Then, use the CLI to initiate deployment with specific flags. For scaling, monitor metrics and trigger adjustments programmatically. Always specify the model ID and target environment in commands to avoid conflicts. For API-based usage, include the API key in headers and handle responses for asynchronous operations.

Common Commands/API

Use the OpenClaw CLI for quick operations; prefix commands with openclaw llm. For API calls, target the base endpoint https://api.openclaw.ai/llm and include the header Authorization: Bearer $OPENCLAW_API_KEY.

Deploy Command: openclaw llm deploy --model-id my-llm-123 --env production --replicas 3 --config-path ./config.json
- Example config.json: {"image": "my-llm-image:v1", "resources": {"cpu": "2", "memory": "4Gi"}}
- Code snippet (Python):
```
import requests
response = requests.post('https://api.openclaw.ai/llm/deploy', json={'model_id': 'my-llm-123', 'replicas': 3}, headers={'Authorization': f'Bearer {os.environ["OPENCLAW_API_KEY"]}'})
print(response.json())
```
Scale Command: openclaw llm scale --model-id my-llm-123 --scale-to 5 --metric cpu_utilization
- This adjusts replicas based on the specified metric threshold (e.g., >80% CPU).
- API Endpoint: POST /api/llm/scale with body: {"model_id": "my-llm-123", "scale_to": 5}
Monitor Command: openclaw llm monitor --model-id my-llm-123 --duration 60 --output json
- Outputs metrics to stdout or file; use --alert-threshold 0.9 for CPU alerts.
- API Endpoint: GET /api/llm/metrics?model_id=my-llm-123&duration=60
Rollback Command: openclaw llm rollback --model-id my-llm-123 --version v1.0
- Reverts to a previous model version; requires versioning enabled in config.

Config formats are JSON-based, e.g.,:

{
  "model_id": "my-llm-123",
  "deployment": {
    "type": "kubernetes",
    "namespace": "aiml"
  }
}

Integration Notes

Integrate this skill with existing ML ops tools by exporting metrics to Prometheus or using webhooks for CI/CD. For Kubernetes, apply manifests generated by openclaw llm generate-k8s --model-id my-llm-123. When combining with other OpenClaw skills, chain commands like openclaw llm deploy && openclaw monitoring setup. Use environment variables for secrets, e.g., set $OPENCLAW_API_KEY in your .env file and load it via dotenv in Python scripts. Ensure network accessibility to API endpoints; configure firewalls to allow traffic to api.openclaw.ai.

Error Handling

Check command exit codes; for example, if openclaw llm deploy fails with code 1, parse the error message for details like "Model not found". In API responses, handle HTTP status codes: 401 for authentication issues (retry with export OPENCLAW_API_KEY=new_key), 404 for missing models, or 500 for server errors (wait and retry with exponential backoff). Include try-except blocks in code snippets:

try:
    response = requests.post('https://api.openclaw.ai/llm/deploy', ...)
    response.raise_for_status()
except requests.exceptions.HTTPError as e:
    print(f"Error: {e.response.status_code} - {e.response.text}")
    sys.exit(1)

Log errors to files using --log-file errors.log in CLI commands and monitor for common issues like resource limits.

Concrete Usage Examples

Deploy and Scale an LLM: First, export your API key: export OPENCLAW_API_KEY=abc123. Deploy a model with: openclaw llm deploy --model-id gpt-finetuned --env staging --replicas 2. Then, scale it based on load: openclaw llm scale --model-id gpt-finetuned --scale-to 10 --metric request_rate.
Monitor and Rollback: Run monitoring: openclaw llm monitor --model-id gpt-finetuned --duration 300. If issues arise, rollback: openclaw llm rollback --model-id gpt-finetuned --version v2.1.

Graph Relationships

Related to: aimlops (cluster), llm (tag), mlops (tag)
Depends on: authentication services for API access
Integrates with: monitoring tools, deployment orchestrators like Kubernetes
Conflicts with: none specified; ensure no overlapping model IDs in multi-skill environments

alphaonedev/llm-ops

skills/aimlops/llm-ops/SKILL.md

Manages deployment, scaling, and monitoring of large language models in AI/ML operations environments.

2 stars

devops

Updated Apr 3, 2026

$ install --global

skillsauth

npx skillsauth add alphaonedev/openclaw-graph llm-ops

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 3, 2026, 8:41 PM8.0s1 file scanned

SKILL.md

name:: llm-ops
cluster:: aimlops
description:: Manages deployment, scaling, and monitoring of large language models in AI/ML operations environments.
tags:: ["llm","mlops","aimlops"]
dependencies:: []
composes:: []
similar_to:: []
called_by:: []
authorization_required:: false
scope:: general
model_hint:: claude-sonnet
embedding_hint:: llm operations mlops ai deployment scaling monitoring

llm-ops

Purpose

When to Use

Key Capabilities

Deploy LLMs to cloud providers (e.g., AWS, GCP) with automatic containerization.
Scale instances dynamically based on metrics like CPU usage or request volume.
Monitor key metrics including latency, throughput, and error rates via integrated dashboards.
Handle model versioning and rollbacks for safe updates.
Integrate with logging tools like ELK stack for detailed tracing.

Usage Patterns

Common Commands/API

Deploy Command: openclaw llm deploy --model-id my-llm-123 --env production --replicas 3 --config-path ./config.json
- Example config.json: {"image": "my-llm-image:v1", "resources": {"cpu": "2", "memory": "4Gi"}}
- Code snippet (Python):
```
import requests
response = requests.post('https://api.openclaw.ai/llm/deploy', json={'model_id': 'my-llm-123', 'replicas': 3}, headers={'Authorization': f'Bearer {os.environ["OPENCLAW_API_KEY"]}'})
print(response.json())
```
Scale Command: openclaw llm scale --model-id my-llm-123 --scale-to 5 --metric cpu_utilization
- This adjusts replicas based on the specified metric threshold (e.g., >80% CPU).
- API Endpoint: POST /api/llm/scale with body: {"model_id": "my-llm-123", "scale_to": 5}
Monitor Command: openclaw llm monitor --model-id my-llm-123 --duration 60 --output json
- Outputs metrics to stdout or file; use --alert-threshold 0.9 for CPU alerts.
- API Endpoint: GET /api/llm/metrics?model_id=my-llm-123&duration=60
Rollback Command: openclaw llm rollback --model-id my-llm-123 --version v1.0
- Reverts to a previous model version; requires versioning enabled in config.

Config formats are JSON-based, e.g.,:

{
  "model_id": "my-llm-123",
  "deployment": {
    "type": "kubernetes",
    "namespace": "aiml"
  }
}

Integration Notes

Error Handling

try:
    response = requests.post('https://api.openclaw.ai/llm/deploy', ...)
    response.raise_for_status()
except requests.exceptions.HTTPError as e:
    print(f"Error: {e.response.status_code} - {e.response.text}")
    sys.exit(1)

Log errors to files using --log-file errors.log in CLI commands and monitor for common issues like resource limits.

Concrete Usage Examples

Deploy and Scale an LLM: First, export your API key: export OPENCLAW_API_KEY=abc123. Deploy a model with: openclaw llm deploy --model-id gpt-finetuned --env staging --replicas 2. Then, scale it based on load: openclaw llm scale --model-id gpt-finetuned --scale-to 10 --metric request_rate.
Monitor and Rollback: Run monitoring: openclaw llm monitor --model-id gpt-finetuned --duration 300. If issues arise, rollback: openclaw llm rollback --model-id gpt-finetuned --version v2.1.

Graph Relationships

Related to: aimlops (cluster), llm (tag), mlops (tag)
Depends on: authentication services for API access
Integrates with: monitoring tools, deployment orchestrators like Kubernetes
Conflicts with: none specified; ensure no overlapping model IDs in multi-skill environments

Related Skills

alphaonedev/web

tools

VerifiedTrustedCommunity

Root web development: project structure, tooling selection, deployment decisions

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-wasm

development

VerifiedTrustedCommunity

WebAssembly: Rust/Go/C to WASM, wasm-bindgen, Emscripten, WASM Component Model

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-vue

development

VerifiedTrustedCommunity

Vue 3: Composition API script setup, Pinia, Vue Router 4, SFCs, Vite, Nuxt 3

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-tailwind

tools

VerifiedTrustedCommunity

Tailwind CSS 4: utility classes, config, JIT, arbitrary values, darkMode, plugins, shadcn/ui

2SKILL.mdUpdated Apr 3, 2026

alphaonedev/web-tailwind

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/alphaonedev/openclaw-graph.git

# Copy into Claude Code skills folder (global)
cp -r openclaw-graph/skills/aimlops/llm-ops ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

alphaonedev/openclaw-graph

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT