skills/platform/health/SKILL.md
System health overview from discovered observability sources. Aggregates errors, performance metrics, and SLO status across services. Correlates with deployments and code changes. Use for proactive health monitoring and post-deployment validation. Use when: checking aggregated system health, validating a post-deployment state, or correlating production status with recent changes.
npx skillsauth add mikeparcewski/wicked-garden healthInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Aggregate system health from discovered observability sources with deployment correlation.
Use capability-based discovery to find available integrations:
# Discover available integrations via capability detection
# Scan for capabilities by analyzing server descriptions and resources:
# - error-tracking capability: Exception/error tracking and reporting
# - apm capability: Application performance monitoring and metrics
# - logging capability: Log aggregation, search, and analysis
# - tracing capability: Distributed tracing and service mapping
# - telemetry capability: Metrics collection and custom instrumentation
For each discovered source, collect:
HEALTHY: All metrics within SLO, no active alerts, stable trends DEGRADED: Some metrics elevated, minor alerts, or negative trends CRITICAL: SLO violations, critical alerts, or severe degradation
Check for recent changes that might impact health:
Based on health status:
This skill discovers integrations at runtime based on capability:
| Capability | What to Look For | Provides | |------------|------------------|----------| | error-tracking | Exception tracking, error reporting, crash analytics | Error rates, stack traces, user impact | | apm | Performance monitoring, service metrics, observability | Latency, throughput, service health | | logging | Log aggregation, log search, log analysis | Log aggregation, search, patterns | | tracing | Distributed tracing, request tracing, trace analysis | Distributed traces, dependencies | | telemetry | Metrics collection, custom instrumentation, time-series data | Custom metrics, instrumentation |
Fallback: If no integrations found, perform local analysis via wicked-garden:search for error patterns in code.
See refs/sources.md for detailed capability discovery patterns.
## System Health Report
**Overall Status**: [HEALTHY | DEGRADED | CRITICAL]
**Assessment Time**: {timestamp}
**Data Sources**: {list of integrations used}
### Health Summary
| Service | Status | Error Rate | Latency (p95) | SLO Status |
|---------|--------|------------|---------------|------------|
| {service} | {status} | {rate} | {latency} | {✓ or ✗} |
### Issues Detected
[For each issue]
**{Service}: {Issue Description}**
- Severity: [CRITICAL | HIGH | MEDIUM | LOW]
- Started: {timestamp}
- Metric: {specific metric and values}
- Pattern: {error pattern or behavior}
- Correlation: {deployment or change if found}
- Blast Radius: {impact scope}
### Trends (24h)
- Error Rates: {trend with percentage}
- Latency: {trend with percentage}
- Traffic: {trend with percentage}
### Recommendations
**Immediate**:
{critical actions needed now}
**Short-term**:
{optimizations and improvements}
**Capacity**:
{capacity planning insights}
Error rates or latency increase after deployment. Correlate metrics with deployment time and consider rollback.
Metrics slowly degrading over hours/days. Investigate memory leaks, growing data, cache efficiency.
Performance degrades with traffic spikes. Check capacity utilization and scaling policies.
Single service failure causes downstream issues. Use traces to identify root cause and implement circuit breakers.
When crew enters build phase:
Emit events:
observe:health:checked:successobserve:health:degraded:warningobserve:health:critical:failureWhen debugging issues, provide observability context:
development
--- name: large-scale-migration description: How to execute a LARGE MECHANICAL change across any codebase with LEVERAGE instead of an agent-grind or hand-edits — a cross-cutting migration, refactor, rename, dialect/framework/DB port, library adoption, or bulk transform. The map→transform→gate pattern: a deterministic transform driven by a source-of-truth map, proven by a differential-equivalence gate. Use when the work is "migrate all X to Y", "rename Z everywhere", "port to a new DB/dialect/fra
testing
v11 LLM-based work-shape classifier. Replaces the regex archetype detector with the model's own reasoning. Reads the user's prompt, picks the right archetype(s) from the catalog, identifies signals (blast_radius, novelty, reversibility, etc.), and persists to SessionState so subsequent turns steer correctly. Use when: the prompt_submit hook emitted a `<wg classify-due />` directive, OR explicitly invoked at session start, OR when re-classifying after the user changes scope mid-session.
tools
v11 work-shape archetype runner. When a prompt has been routed to one of the 9 archetypes (triage, explore, specify, decide, ship, review, incident, build, migrate), this skill is the entry point. It picks the right per-archetype playbook from refs/ and executes the phase shape declared in `.claude-plugin/archetypes.json`. Use when: a `<wg archetype="X">` or `<wg archetypes>` system-reminder tag appears, an explicit "let's run the X archetype" request, or when one of the per-archetype slash commands resolves to this skill.
development
Show or set the session intent variable. Intent gates how loud the framework is — simple-edit (silent), feature/research (synthesis directive), rigor (full crew context). Auto-detected on turn 1; this skill overrides explicitly. Sticky for the session. Use when: "set intent", "intent override", "/wicked-garden:intent", "make the framework quiet", "force rigor", "what's my intent".