Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

thesoftwarehouse/tsh-implementing-kubernetes

Name: tsh-implementing-kubernetes
Author: thesoftwarehouse

.github/skills/tsh-implementing-kubernetes/SKILL.md

npx skillsauth add thesoftwarehouse/copilot-collections tsh-implementing-kubernetes

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Kubernetes Patterns

When to Use

Deploying applications to Kubernetes
Designing Deployment, StatefulSet, or Job configurations
Implementing auto-scaling (HPA, VPA, KEDA)
Creating or modifying Helm charts
Setting up ingress, networking, and service mesh
Configuring resource requests, limits, and QoS

Project Detection

Check which Kubernetes tooling the project uses:

helm/ or Chart.yaml → Helm charts
kustomize/ or kustomization.yaml → Kustomize
k8s/ or kubernetes/ with *.yaml → Raw manifests
skaffold.yaml → Skaffold for local dev
argocd/ or Application resources → ArgoCD GitOps
flux-system/ or Kustomization CRD → Flux GitOps

Use context7 to look up Kubernetes API versions and syntax.

Workload Type Decision

| Workload Type | Use When | |---------------|----------| | Deployment | Stateless apps, web servers, APIs | | StatefulSet | Databases, stateful apps needing stable identity | | DaemonSet | Node-level agents (logging, monitoring) | | Job | One-time tasks, batch processing | | CronJob | Scheduled recurring tasks |

Deployment Configuration

Resource Management

resources:
  requests:    # Scheduler uses for placement
    memory: "256Mi"
    cpu: "100m"
  limits:      # Kubelet enforces these
    memory: "512Mi"
    cpu: "500m"

Rules:

Always set requests (required for scheduling)
Set memory limits to prevent OOM impact on node
CPU limits optional (can cause throttling)
Request:Limit ratio of 1:2 is good starting point

QoS Classes

| Class | Condition | Eviction Priority | |-------|-----------|-------------------| | Guaranteed | requests == limits (all containers) | Last to evict | | Burstable | requests < limits | Medium | | BestEffort | No requests or limits | First to evict |

Rule: Production workloads should be Guaranteed or Burstable, never BestEffort.

Probes Configuration

livenessProbe:      # Restarts container if fails
  httpGet:
    path: /healthz
    port: 8080
  initialDelaySeconds: 15
  periodSeconds: 10
  failureThreshold: 3

readinessProbe:     # Removes from Service if fails
  httpGet:
    path: /ready
    port: 8080
  initialDelaySeconds: 5
  periodSeconds: 5
  failureThreshold: 3

startupProbe:       # Delays liveness until startup complete
  httpGet:
    path: /healthz
    port: 8080
  failureThreshold: 30
  periodSeconds: 10

Rules:

Always configure readinessProbe (graceful traffic handling)
Use startupProbe for slow-starting apps (instead of long initialDelaySeconds)
livenessProbe should check app health, not dependencies

Pod Disruption Budget

apiVersion: policy/v1
kind: PodDisruptionBudget
metadata:
  name: api-pdb
spec:
  minAvailable: 2        # OR maxUnavailable: 1
  selector:
    matchLabels:
      app: api

Rule: Always create PDB for production workloads to ensure availability during node drains.

Pod Anti-Affinity

affinity:
  podAntiAffinity:
    preferredDuringSchedulingIgnoredDuringExecution:
      - weight: 100
        podAffinityTerm:
          labelSelector:
            matchLabels:
              app: api
          topologyKey: kubernetes.io/hostname

Rule: Spread replicas across nodes/zones for high availability.

Scaling Strategies

Horizontal Pod Autoscaler (HPA)

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: api-hpa
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: api
  minReplicas: 2
  maxReplicas: 10
  metrics:
    - type: Resource
      resource:
        name: cpu
        target:
          type: Utilization
          averageUtilization: 70
  behavior:
    scaleDown:
      stabilizationWindowSeconds: 300  # Prevent flapping

Scaling Decision Matrix

| Scaling Type | Use When | Tool | |--------------|----------|------| | CPU-based | General compute workloads | HPA | | Memory-based | Memory-intensive apps | HPA | | Custom metrics | Queue depth, request rate | HPA + Prometheus Adapter | | Event-driven | Message queues, scheduled jobs | KEDA | | Vertical | Right-sizing requests/limits | VPA |

Helm Chart Structure

mychart/
├── Chart.yaml          # Chart metadata
├── values.yaml         # Default values
├── values-dev.yaml     # Environment overrides
├── values-prod.yaml
├── templates/
│   ├── _helpers.tpl    # Template helpers
│   ├── deployment.yaml
│   ├── service.yaml
│   ├── ingress.yaml
│   ├── hpa.yaml
│   ├── pdb.yaml
│   └── configmap.yaml
└── charts/             # Dependencies

Helm Best Practices

# values.yaml - use structured defaults
replicaCount: 2

image:
  repository: myapp
  tag: ""  # Override in CI, not here
  pullPolicy: IfNotPresent

resources:
  requests:
    memory: "256Mi"
    cpu: "100m"
  limits:
    memory: "512Mi"

# Enable/disable optional components
autoscaling:
  enabled: true
  minReplicas: 2
  maxReplicas: 10

Rules:

Don't hardcode image tags in values.yaml (set in CI)
Use {{ include "mychart.fullname" . }} for resource names
Provide sensible defaults, override per environment

Ingress Configuration

Ingress Class Decision

| Ingress Controller | Use When | |-------------------|----------| | nginx-ingress | General purpose, widely supported | | AWS ALB | AWS-native, integrated with WAF/ACM | | Traefik | Simple setup, automatic HTTPS | | Istio Gateway | Service mesh already in use |

Ingress Example (nginx)

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: api-ingress
  annotations:
    nginx.ingress.kubernetes.io/ssl-redirect: "true"
    cert-manager.io/cluster-issuer: "letsencrypt-prod"
spec:
  ingressClassName: nginx
  tls:
    - hosts:
        - api.example.com
      secretName: api-tls
  rules:
    - host: api.example.com
      http:
        paths:
          - path: /
            pathType: Prefix
            backend:
              service:
                name: api
                port:
                  number: 80

Security Context

securityContext:
  runAsNonRoot: true
  runAsUser: 1000
  runAsGroup: 1000
  fsGroup: 1000
  seccompProfile:
    type: RuntimeDefault

containers:
  - name: app
    securityContext:
      allowPrivilegeEscalation: false
      readOnlyRootFilesystem: true
      capabilities:
        drop:
          - ALL

Rule: Always run as non-root with minimal capabilities in production.

Process

Discover context → Check existing K8s manifests, Helm charts, Kustomize
Choose workload type → Deployment, StatefulSet, Job based on requirements
Configure resources → Set requests/limits based on profiling or estimates
Add probes → Configure readiness, liveness, and startup probes
Enable scaling → Add HPA/KEDA based on scaling requirements
Add resilience → PDB, pod anti-affinity, topology spread
Configure security → Security context, network policies
Validate → kubectl apply --dry-run=server, helm template

Checklist

[ ] Resource requests and limits defined
[ ] Readiness and liveness probes configured
[ ] PodDisruptionBudget created for production
[ ] Pod anti-affinity or topology spread configured
[ ] HPA configured for variable workloads
[ ] Security context with non-root user
[ ] Image pull policy appropriate (Never use latest in prod)
[ ] Labels consistent (app, version, environment)
[ ] Namespace isolation per environment

Anti-Patterns

| Don't | Do | |-------|-----| | Use latest image tag | Pin specific versions or SHA | | Skip resource requests | Always set requests for scheduling | | Single replica in production | Minimum 2 replicas with PDB | | Run as root | Use non-root user with minimal caps | | Missing readiness probe | Configure probes for graceful traffic | | kubectl apply in production | GitOps with ArgoCD/Flux | | Hardcode values in manifests | Use Helm values or Kustomize overlays | | Ignore pod eviction | Set PDB to maintain availability |

Related Skills

tsh-implementing-observability - For K8s monitoring and logging setup
tsh-implementing-ci-cd - For K8s deployment pipelines
tsh-managing-secrets - For K8s secret management patterns
tsh-implementing-terraform-modules - For provisioning K8s clusters

thesoftwarehouse/tsh-implementing-kubernetes

.github/skills/tsh-implementing-kubernetes/SKILL.md

Kubernetes deployment patterns, Helm charts, and cluster management. Use when deploying applications to K8s, designing workload configurations, implementing scaling strategies, or managing cluster resources.

206 stars

devops

Updated Apr 15, 2026

$ install --global

skillsauth

npx skillsauth add thesoftwarehouse/copilot-collections tsh-implementing-kubernetes

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 15, 2026, 11:58 PM9.1s1 file scanned

SKILL.md

name:: tsh-implementing-kubernetes
description:: Kubernetes deployment patterns, Helm charts, and cluster management. Use when deploying applications to K8s, designing workload configurations, implementing scaling strategies, or managing cluster resources.

Kubernetes Patterns

When to Use

Deploying applications to Kubernetes
Designing Deployment, StatefulSet, or Job configurations
Implementing auto-scaling (HPA, VPA, KEDA)
Creating or modifying Helm charts
Setting up ingress, networking, and service mesh
Configuring resource requests, limits, and QoS

Project Detection

Check which Kubernetes tooling the project uses:

helm/ or Chart.yaml → Helm charts
kustomize/ or kustomization.yaml → Kustomize
k8s/ or kubernetes/ with *.yaml → Raw manifests
skaffold.yaml → Skaffold for local dev
argocd/ or Application resources → ArgoCD GitOps
flux-system/ or Kustomization CRD → Flux GitOps

Use context7 to look up Kubernetes API versions and syntax.

Workload Type Decision

Deployment Configuration

Resource Management

resources:
  requests:    # Scheduler uses for placement
    memory: "256Mi"
    cpu: "100m"
  limits:      # Kubelet enforces these
    memory: "512Mi"
    cpu: "500m"

Rules:

Always set requests (required for scheduling)
Set memory limits to prevent OOM impact on node
CPU limits optional (can cause throttling)
Request:Limit ratio of 1:2 is good starting point

QoS Classes

Rule: Production workloads should be Guaranteed or Burstable, never BestEffort.

Probes Configuration

livenessProbe:      # Restarts container if fails
  httpGet:
    path: /healthz
    port: 8080
  initialDelaySeconds: 15
  periodSeconds: 10
  failureThreshold: 3

readinessProbe:     # Removes from Service if fails
  httpGet:
    path: /ready
    port: 8080
  initialDelaySeconds: 5
  periodSeconds: 5
  failureThreshold: 3

startupProbe:       # Delays liveness until startup complete
  httpGet:
    path: /healthz
    port: 8080
  failureThreshold: 30
  periodSeconds: 10

Rules:

Always configure readinessProbe (graceful traffic handling)
Use startupProbe for slow-starting apps (instead of long initialDelaySeconds)
livenessProbe should check app health, not dependencies

Pod Disruption Budget

apiVersion: policy/v1
kind: PodDisruptionBudget
metadata:
  name: api-pdb
spec:
  minAvailable: 2        # OR maxUnavailable: 1
  selector:
    matchLabels:
      app: api

Rule: Always create PDB for production workloads to ensure availability during node drains.

Pod Anti-Affinity

affinity:
  podAntiAffinity:
    preferredDuringSchedulingIgnoredDuringExecution:
      - weight: 100
        podAffinityTerm:
          labelSelector:
            matchLabels:
              app: api
          topologyKey: kubernetes.io/hostname

Rule: Spread replicas across nodes/zones for high availability.

Scaling Strategies

Horizontal Pod Autoscaler (HPA)

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: api-hpa
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: api
  minReplicas: 2
  maxReplicas: 10
  metrics:
    - type: Resource
      resource:
        name: cpu
        target:
          type: Utilization
          averageUtilization: 70
  behavior:
    scaleDown:
      stabilizationWindowSeconds: 300  # Prevent flapping

Scaling Decision Matrix

Helm Chart Structure

mychart/
├── Chart.yaml          # Chart metadata
├── values.yaml         # Default values
├── values-dev.yaml     # Environment overrides
├── values-prod.yaml
├── templates/
│   ├── _helpers.tpl    # Template helpers
│   ├── deployment.yaml
│   ├── service.yaml
│   ├── ingress.yaml
│   ├── hpa.yaml
│   ├── pdb.yaml
│   └── configmap.yaml
└── charts/             # Dependencies

Helm Best Practices

# values.yaml - use structured defaults
replicaCount: 2

image:
  repository: myapp
  tag: ""  # Override in CI, not here
  pullPolicy: IfNotPresent

resources:
  requests:
    memory: "256Mi"
    cpu: "100m"
  limits:
    memory: "512Mi"

# Enable/disable optional components
autoscaling:
  enabled: true
  minReplicas: 2
  maxReplicas: 10

Rules:

Don't hardcode image tags in values.yaml (set in CI)
Use {{ include "mychart.fullname" . }} for resource names
Provide sensible defaults, override per environment

Ingress Configuration

Ingress Class Decision

Ingress Example (nginx)

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: api-ingress
  annotations:
    nginx.ingress.kubernetes.io/ssl-redirect: "true"
    cert-manager.io/cluster-issuer: "letsencrypt-prod"
spec:
  ingressClassName: nginx
  tls:
    - hosts:
        - api.example.com
      secretName: api-tls
  rules:
    - host: api.example.com
      http:
        paths:
          - path: /
            pathType: Prefix
            backend:
              service:
                name: api
                port:
                  number: 80

Security Context

securityContext:
  runAsNonRoot: true
  runAsUser: 1000
  runAsGroup: 1000
  fsGroup: 1000
  seccompProfile:
    type: RuntimeDefault

containers:
  - name: app
    securityContext:
      allowPrivilegeEscalation: false
      readOnlyRootFilesystem: true
      capabilities:
        drop:
          - ALL

Rule: Always run as non-root with minimal capabilities in production.

Process

Discover context → Check existing K8s manifests, Helm charts, Kustomize
Choose workload type → Deployment, StatefulSet, Job based on requirements
Configure resources → Set requests/limits based on profiling or estimates
Add probes → Configure readiness, liveness, and startup probes
Enable scaling → Add HPA/KEDA based on scaling requirements
Add resilience → PDB, pod anti-affinity, topology spread
Configure security → Security context, network policies
Validate → kubectl apply --dry-run=server, helm template

Checklist

[ ] Resource requests and limits defined
[ ] Readiness and liveness probes configured
[ ] PodDisruptionBudget created for production
[ ] Pod anti-affinity or topology spread configured
[ ] HPA configured for variable workloads
[ ] Security context with non-root user
[ ] Image pull policy appropriate (Never use latest in prod)
[ ] Labels consistent (app, version, environment)
[ ] Namespace isolation per environment

Anti-Patterns

Related Skills

tsh-implementing-observability - For K8s monitoring and logging setup
tsh-implementing-ci-cd - For K8s deployment pipelines
tsh-managing-secrets - For K8s secret management patterns
tsh-implementing-terraform-modules - For provisioning K8s clusters

Related Skills

thesoftwarehouse/tsh-writing-hooks

development

VerifiedTrustedCommunity

Custom hook and composable patterns — naming, composition, stable return shapes, lifecycle cleanup, and testing strategies. Use when writing reusable logic units (React hooks, Vue composables), refactoring logic into hooks, debugging hook behavior, or reviewing hook implementations.

206SKILL.mdUpdated Apr 15, 2026

thesoftwarehouse/tsh-writing-hooks

thesoftwarehouse/tsh-ui-verifying

testing

VerifiedTrustedCommunity

UI verification criteria, structure checklists, severity definitions, and tolerance rules for comparing implementations against Figma designs. Use for verifying UI matches design, understanding what to check, and determining acceptable differences.

206SKILL.mdUpdated Apr 15, 2026

thesoftwarehouse/tsh-ui-verifying

thesoftwarehouse/tsh-transcript-processing

development

VerifiedTrustedCommunity

Clean raw workshop or meeting transcripts from small talk, filler words, and off-topic tangents. Extract and structure business-relevant content into a standardized format with discussion topics, key decisions, action items, and open questions.

206SKILL.mdUpdated Apr 15, 2026

thesoftwarehouse/tsh-transcript-processing

thesoftwarehouse/tsh-technical-context-discovering

development

VerifiedTrustedCommunity

Discover and establish technical context before implementing any feature. Prioritize project instructions, existing codebase patterns, and external documentation in that order. Use for any task requiring understanding of project conventions, coding standards, architecture patterns, and established practices before writing code.

206SKILL.mdUpdated Apr 15, 2026

thesoftwarehouse/tsh-technical-context-discovering

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/thesoftwarehouse/copilot-collections.git

# Copy into Claude Code skills folder (global)
cp -r copilot-collections/.github/skills/tsh-implementing-kubernetes ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

thesoftwarehouse/copilot-collections

206 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT