bagelhole

162 verified skills2,986 total stars

model-serving-kubernetes

Deploy ML models on Kubernetes with KServe (formerly KFServing) and NVIDIA Triton Inference Server. Includes canary deployments, autoscaling, model versioning, A/B testing, and GPU resource management for production model serving.

testing28

openclaw-security-hardening

Harden OpenClaw self-hosted environments with baseline host controls, auth tightening, secret handling, network segmentation, and safe update/rollback workflows. Use when deploying OpenClaw in home labs, startups, or production-like local AI infrastructure.

testing28

sre-dashboards

Design and operationalize SRE dashboards that surface reliability, latency, error, saturation, and capacity signals across services. Use when building observability views for SLOs, incident response, and executive reliability reporting.

development28

vector-database-ops

Deploy, manage, and optimize vector databases for AI applications. Covers Qdrant, Weaviate, pgvector, and Pinecone — collection management, indexing strategies, backup, and performance tuning for production RAG and semantic search workloads.

devops28

ai-pipeline-orchestration

Orchestrate AI/ML pipelines for data ingestion, model training, batch inference, and RAG indexing using Prefect, Airflow, or Dagster. Build reliable, observable, and retriable workflows for production AI systems.

development28

llm-caching

Implement multi-layer LLM caching with exact match, semantic similarity, and provider-side prompt caching. Reduce API costs by 30–70%, cut latency, and improve throughput using Redis, GPTCache, and provider caching APIs.

development28

llm-cost-optimization

Reduce LLM API and infrastructure costs through model selection, prompt caching, batching, caching, quantization, and self-hosting strategies. Track spend by team and model, set budgets, and implement cost-aware routing.

development28

gitlab-ci

Configure GitLab CI/CD pipelines and runners for automated building, testing, and deployment. Create .gitlab-ci.yml configurations, manage runners, and implement DevOps workflows. Use when working with GitLab repositories or self-hosted GitLab instances.

bagelhole

model-serving-kubernetes

openclaw-security-hardening

sre-dashboards

vector-database-ops

ai-pipeline-orchestration

llm-caching

llm-cost-optimization

gitlab-ci

ebpf-observability

kubernetes-ops

platform-engineering

azure-aks

terraform-azure

gcp-gke

mongodb

firebase-app-platform

systemd-services

object-storage

ai-security-hardening

llm-app-security

linux-hardening

ssl-tls-management

zero-trust

sast-scanning

sops-encryption

access-review

agent-observability

ai-agent-security

ai-coding-agent-guardrails

ai-inference-service-mesh

ai-red-teaming

alerting-oncall

argocd-gitops

arm-templates

asset-inventory

audit-logging

aws-cost-optimization

aws-ec2

aws-ecs-fargate

aws-iam

aws-lambda

aws-rds

aws-s3

aws-secrets-manager

aws-vpc

azure-devops

azure-functions

azure-monitor-audit

azure-sql

azure-vms

backup-recovery

blue-green-deploy

business-continuity

cdn-setup

change-management

circleci

cloudflare-r2

cloudflare-workers

container-hardening

container-registries

container-scanning

convex-backend

dast-scanning

database-backups

dependency-scanning

devcontainers-nix

disaster-recovery

dns-management

docker-compose

docker-management

elk-stack

firewall-config

gcp-cloud-functions

gcp-compute

gcp-networking

gcp-secret-manager

github-actions

git-workflow

gpu-kubernetes-operations