Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

tokenized2027/observability-setup

Name: observability-setup
Author: tokenized2027

claude-code-framework/essential/skills/operations/observability-setup/SKILL.md

npx skillsauth add tokenized2027/claude-initilization-v7 observability-setup

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Observability Setup

Monitoring, structured logging, and alerting so your mini PC agents don't silently fail at 3 AM.

Instructions

Layer 1: Health Checks

Every service gets a /health endpoint:

// app/api/health/route.ts
import { NextResponse } from 'next/server'

export async function GET() {
  const checks = {
    api: 'ok',
    database: await checkDatabase(),
    redis: await checkRedis(),
    uptime: process.uptime(),
    memory: process.memoryUsage().heapUsed / 1024 / 1024, // MB
  }

  const healthy = checks.database === 'ok' && checks.redis === 'ok'

  return NextResponse.json(checks, {
    status: healthy ? 200 : 503,
  })
}

async function checkDatabase(): Promise<string> {
  try {
    await db.query('SELECT 1')
    return 'ok'
  } catch {
    return 'error'
  }
}

async function checkRedis(): Promise<string> {
  try {
    await redis.ping()
    return 'ok'
  } catch {
    return 'error'
  }
}

Layer 2: Structured Logging

Never use console.log with raw strings. Use structured JSON logs:

// lib/logger.ts
type LogLevel = 'debug' | 'info' | 'warn' | 'error'

interface LogEntry {
  level: LogLevel
  message: string
  service: string
  timestamp: string
  [key: string]: unknown
}

export function createLogger(service: string) {
  return {
    info: (message: string, data?: Record<string, unknown>) =>
      log('info', service, message, data),
    warn: (message: string, data?: Record<string, unknown>) =>
      log('warn', service, message, data),
    error: (message: string, data?: Record<string, unknown>) =>
      log('error', service, message, data),
  }
}

function log(level: LogLevel, service: string, message: string, data?: Record<string, unknown>) {
  const entry: LogEntry = {
    level,
    message,
    service,
    timestamp: new Date().toISOString(),
    ...data,
  }
  
  // JSON logs are parseable by any log aggregator
  console.log(JSON.stringify(entry))
}

// Usage:
// const log = createLogger('orchestrator')
// log.info('Task routed', { agent: 'frontend-developer', taskId: 'abc123' })
// log.error('Agent failed', { agent: 'backend-developer', error: err.message })

Layer 3: Cron Health Monitor

Simple bash script for mini PC monitoring:

#!/bin/bash
# ~//scripts/health-monitor.sh
# Run via cron every 5 minutes: */5 * * * * ~//scripts/health-monitor.sh

SERVICES=("http://localhost:8000/health" "http://localhost:3000/api/health")
TELEGRAM_BOT_TOKEN="${TELEGRAM_BOT_TOKEN}"
TELEGRAM_CHAT_ID="${TELEGRAM_CHAT_ID}"
LOG_FILE="/var/log/health-monitor.log"

send_alert() {
  local message="$1"
  curl -s -X POST "https://api.telegram.org/bot${TELEGRAM_BOT_TOKEN}/sendMessage" \
    -d chat_id="${TELEGRAM_CHAT_ID}" \
    -d text="🚨 MINI PC ALERT: ${message}" \
    -d parse_mode="Markdown" > /dev/null
}

for url in "${SERVICES[@]}"; do
  response=$(curl -s -o /dev/null -w "%{http_code}" --max-time 10 "$url")
  
  if [ "$response" != "200" ]; then
    echo "$(date -Iseconds) FAIL $url (HTTP $response)" >> "$LOG_FILE"
    send_alert "Service DOWN: \`$url\` returned HTTP $response"
  else
    echo "$(date -Iseconds) OK   $url" >> "$LOG_FILE"
  fi
done

# Check disk space
disk_usage=$(df / | awk 'NR==2 {print $5}' | tr -d '%')
if [ "$disk_usage" -gt 85 ]; then
  send_alert "Disk usage at ${disk_usage}% — clean up needed"
fi

# Check memory
mem_available=$(free -m | awk 'NR==2 {print $7}')
if [ "$mem_available" -lt 500 ]; then
  send_alert "Low memory: only ${mem_available}MB available"
fi

# Check Docker containers
stopped=$(docker ps -a --filter "status=exited" --format "{{.Names}}" | head -5)
if [ -n "$stopped" ]; then
  send_alert "Stopped containers: \`$stopped\`"
fi

Layer 4: Docker Compose Logging

# In docker-compose.yml — add to every service
services:
  app:
    logging:
      driver: "json-file"
      options:
        max-size: "10m"
        max-file: "3"

View logs:

# All services
docker-compose logs --tail 100 -f

# Specific service
docker-compose logs --tail 50 orchestrator

# Search for errors
docker-compose logs | grep '"level":"error"'

Layer 5: Agent Activity Dashboard (Optional)

Track agent performance over time:

-- Create a simple metrics table
CREATE TABLE agent_metrics (
  id SERIAL PRIMARY KEY,
  agent_name TEXT NOT NULL,
  task_id TEXT NOT NULL,
  status TEXT NOT NULL, -- 'started' | 'completed' | 'failed'
  tokens_used INTEGER,
  duration_ms INTEGER,
  created_at TIMESTAMPTZ DEFAULT NOW()
);

-- Useful queries
-- Agent success rate last 24h
SELECT agent_name,
  COUNT(*) FILTER (WHERE status = 'completed') AS successes,
  COUNT(*) FILTER (WHERE status = 'failed') AS failures,
  ROUND(100.0 * COUNT(*) FILTER (WHERE status = 'completed') / COUNT(*), 1) AS success_rate
FROM agent_metrics
WHERE created_at > NOW() - INTERVAL '24 hours'
GROUP BY agent_name;

-- Average task duration by agent
SELECT agent_name, AVG(duration_ms) / 1000 AS avg_seconds
FROM agent_metrics
WHERE status = 'completed'
GROUP BY agent_name;

Quick Setup Checklist

- [ ] Health endpoint on every service
- [ ] Structured JSON logging (not raw console.log)
- [ ] Docker log rotation configured
- [ ] Cron health monitor running every 5 min
- [ ] Telegram alerts connected
- [ ] Disk and memory alerts at 85% / 500MB
- [ ] Stopped container detection

When to Use This Skill

✅ Use observability-setup when:

Deploying a new service on the mini PC
Services fail silently
Need to diagnose intermittent issues
Setting up the monitoring stack for the first time

❌ Don't use for:

Application-level debugging (use systematic-debugging)
Docker container issues (use docker-debugger)
Cost monitoring (use cost-optimizer)

tokenized2027/observability-setup

claude-code-framework/essential/skills/operations/observability-setup/SKILL.md

Set up monitoring, logging, and alerting for mini PC services and autonomous agents. Use when deploying new services, setting up health checks, or diagnosing reliability issues. Triggers on "monitoring", "logging", "alerts", "health check", "uptime", "service down", "observability".

testing

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add tokenized2027/claude-initilization-v7 observability-setup

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 3:28 PM5.8s1 file scanned

SKILL.md

name:: observability-setup
description:: Set up monitoring, logging, and alerting for mini PC services and autonomous agents. Use when deploying new services, setting up health checks, or diagnosing reliability issues. Triggers on "monitoring", "logging", "alerts", "health check", "uptime", "service down", "observability".
author:: Mastering Claude Code (adapted from community contributions)
version:: 1.0.0
category:: operations
source:: community-contributed
license:: MIT

Observability Setup

Monitoring, structured logging, and alerting so your mini PC agents don't silently fail at 3 AM.

Instructions

Layer 1: Health Checks

Every service gets a /health endpoint:

// app/api/health/route.ts
import { NextResponse } from 'next/server'

export async function GET() {
  const checks = {
    api: 'ok',
    database: await checkDatabase(),
    redis: await checkRedis(),
    uptime: process.uptime(),
    memory: process.memoryUsage().heapUsed / 1024 / 1024, // MB
  }

  const healthy = checks.database === 'ok' && checks.redis === 'ok'

  return NextResponse.json(checks, {
    status: healthy ? 200 : 503,
  })
}

async function checkDatabase(): Promise<string> {
  try {
    await db.query('SELECT 1')
    return 'ok'
  } catch {
    return 'error'
  }
}

async function checkRedis(): Promise<string> {
  try {
    await redis.ping()
    return 'ok'
  } catch {
    return 'error'
  }
}

Layer 2: Structured Logging

Never use console.log with raw strings. Use structured JSON logs:

// lib/logger.ts
type LogLevel = 'debug' | 'info' | 'warn' | 'error'

interface LogEntry {
  level: LogLevel
  message: string
  service: string
  timestamp: string
  [key: string]: unknown
}

export function createLogger(service: string) {
  return {
    info: (message: string, data?: Record<string, unknown>) =>
      log('info', service, message, data),
    warn: (message: string, data?: Record<string, unknown>) =>
      log('warn', service, message, data),
    error: (message: string, data?: Record<string, unknown>) =>
      log('error', service, message, data),
  }
}

function log(level: LogLevel, service: string, message: string, data?: Record<string, unknown>) {
  const entry: LogEntry = {
    level,
    message,
    service,
    timestamp: new Date().toISOString(),
    ...data,
  }
  
  // JSON logs are parseable by any log aggregator
  console.log(JSON.stringify(entry))
}

// Usage:
// const log = createLogger('orchestrator')
// log.info('Task routed', { agent: 'frontend-developer', taskId: 'abc123' })
// log.error('Agent failed', { agent: 'backend-developer', error: err.message })

Layer 3: Cron Health Monitor

Simple bash script for mini PC monitoring:

#!/bin/bash
# ~//scripts/health-monitor.sh
# Run via cron every 5 minutes: */5 * * * * ~//scripts/health-monitor.sh

SERVICES=("http://localhost:8000/health" "http://localhost:3000/api/health")
TELEGRAM_BOT_TOKEN="${TELEGRAM_BOT_TOKEN}"
TELEGRAM_CHAT_ID="${TELEGRAM_CHAT_ID}"
LOG_FILE="/var/log/health-monitor.log"

send_alert() {
  local message="$1"
  curl -s -X POST "https://api.telegram.org/bot${TELEGRAM_BOT_TOKEN}/sendMessage" \
    -d chat_id="${TELEGRAM_CHAT_ID}" \
    -d text="🚨 MINI PC ALERT: ${message}" \
    -d parse_mode="Markdown" > /dev/null
}

for url in "${SERVICES[@]}"; do
  response=$(curl -s -o /dev/null -w "%{http_code}" --max-time 10 "$url")
  
  if [ "$response" != "200" ]; then
    echo "$(date -Iseconds) FAIL $url (HTTP $response)" >> "$LOG_FILE"
    send_alert "Service DOWN: \`$url\` returned HTTP $response"
  else
    echo "$(date -Iseconds) OK   $url" >> "$LOG_FILE"
  fi
done

# Check disk space
disk_usage=$(df / | awk 'NR==2 {print $5}' | tr -d '%')
if [ "$disk_usage" -gt 85 ]; then
  send_alert "Disk usage at ${disk_usage}% — clean up needed"
fi

# Check memory
mem_available=$(free -m | awk 'NR==2 {print $7}')
if [ "$mem_available" -lt 500 ]; then
  send_alert "Low memory: only ${mem_available}MB available"
fi

# Check Docker containers
stopped=$(docker ps -a --filter "status=exited" --format "{{.Names}}" | head -5)
if [ -n "$stopped" ]; then
  send_alert "Stopped containers: \`$stopped\`"
fi

Layer 4: Docker Compose Logging

# In docker-compose.yml — add to every service
services:
  app:
    logging:
      driver: "json-file"
      options:
        max-size: "10m"
        max-file: "3"

View logs:

# All services
docker-compose logs --tail 100 -f

# Specific service
docker-compose logs --tail 50 orchestrator

# Search for errors
docker-compose logs | grep '"level":"error"'

Layer 5: Agent Activity Dashboard (Optional)

Track agent performance over time:

-- Create a simple metrics table
CREATE TABLE agent_metrics (
  id SERIAL PRIMARY KEY,
  agent_name TEXT NOT NULL,
  task_id TEXT NOT NULL,
  status TEXT NOT NULL, -- 'started' | 'completed' | 'failed'
  tokens_used INTEGER,
  duration_ms INTEGER,
  created_at TIMESTAMPTZ DEFAULT NOW()
);

-- Useful queries
-- Agent success rate last 24h
SELECT agent_name,
  COUNT(*) FILTER (WHERE status = 'completed') AS successes,
  COUNT(*) FILTER (WHERE status = 'failed') AS failures,
  ROUND(100.0 * COUNT(*) FILTER (WHERE status = 'completed') / COUNT(*), 1) AS success_rate
FROM agent_metrics
WHERE created_at > NOW() - INTERVAL '24 hours'
GROUP BY agent_name;

-- Average task duration by agent
SELECT agent_name, AVG(duration_ms) / 1000 AS avg_seconds
FROM agent_metrics
WHERE status = 'completed'
GROUP BY agent_name;

Quick Setup Checklist

- [ ] Health endpoint on every service
- [ ] Structured JSON logging (not raw console.log)
- [ ] Docker log rotation configured
- [ ] Cron health monitor running every 5 min
- [ ] Telegram alerts connected
- [ ] Disk and memory alerts at 85% / 500MB
- [ ] Stopped container detection

When to Use This Skill

✅ Use observability-setup when:

Deploying a new service on the mini PC
Services fail silently
Need to diagnose intermittent issues
Setting up the monitoring stack for the first time

❌ Don't use for:

Application-level debugging (use systematic-debugging)
Docker container issues (use docker-debugger)
Cost monitoring (use cost-optimizer)

Related Skills

tokenized2027/systematic-debugging

development

VerifiedTrustedCommunity

Methodical debugging using reproducible steps, instrumentation, and root-cause analysis. Use when something is broken and you don't know why. Triggers on "bug", "broken", "not working", "error", "fails intermittently", "regression", "unexpected behavior".

SKILL.mdUpdated Apr 16, 2026

tokenized2027/systematic-debugging

tokenized2027/prompt-engineering

development

VerifiedTrustedCommunity

Optimize prompts for Claude Code agents, API calls, and multi-agent orchestration. Use when writing system prompts, agent instructions, or refining LLM interactions. Triggers on "improve prompt", "write a prompt", "agent instructions", "system prompt", "prompt not working", "LLM output quality".

SKILL.mdUpdated Apr 16, 2026

tokenized2027/prompt-engineering

tokenized2027/brainstorming

tools

VerifiedTrustedCommunity

Structured ideation and design review before any creative or constructive work. Use before building features, components, architecture, dashboards, or automation workflows. Triggers on "plan this", "design this", "brainstorm", "think through", "what should we build", "how should I approach".

SKILL.mdUpdated Apr 16, 2026

tokenized2027/brainstorming

tokenized2027/test-scaffold

testing

VerifiedTrustedCommunity

Generates test files for components and functions with setup, basic tests, and mocks. Use when user says "add tests", "create test", "test this component", or mentions testing.

SKILL.mdUpdated Apr 16, 2026

tokenized2027/test-scaffold

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/tokenized2027/claude-initilization-v7.git

# Copy into Claude Code skills folder (global)
cp -r claude-initilization-v7/claude-code-framework/essential/skills/operations/observability-setup ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

tokenized2027/claude-initilization-v7

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT