skills/dashboard/SKILL.md
Real-time harness observability dashboard. Reads campaigns, fleet sessions, telemetry, and pending queues to present a snapshot of harness state at a glance. Invoked by /dashboard, /do status, or phrases like "what's happening" and "show activity".
npx skillsauth add SethGammon/Citadel dashboardInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
/dashboardNone required. Works with whatever state exists on disk.
Run the local dashboard implementation from the project root:
node scripts/dashboard.js
If the package scripts are available, this equivalent command is also valid:
npm run dashboard
The script is read-only. It renders a user-facing control-plane snapshot from
.planning/, telemetry, hook config, coordination state, worktrees, and cost
data. Use the manual collection protocol below only as a fallback if the script
is missing or fails in the current project.
Read the following sources. Each is optional — if a file or directory doesn't exist, treat it as empty. Never crash on missing state.
Campaigns:
.planning/campaigns/*.mdStatus: fieldDirection: field (truncate to 60 chars)Phase N of M or ## Phase headings)- [ from the Decision Logneeds-completion and show:
node scripts/campaign.js complete <slug> --archive.planning/campaigns/,
report needs-archive and show the same archive command.review-package Exit
Evidence row is still pending, missing, or points at a missing local package,
report a repair before campaign completion:
node scripts/package-delivery.js <slug>Cost Data (two sources, prefer real):
node scripts/session-tokens.js --today and --all — reads Claude Code's native session JSONL for exact token counts.planning/telemetry/session-costs.jsonl; cost priority real_cost > override_cost > estimated_cost; group by campaign_slug, sum cost/agents/minutes, compute grand total.planning/telemetry/cost-tracker-state.json for burn rateFleet Sessions:
.planning/fleet/session-*.mdstatus: fieldwave: or wave numberagents: or agent countRecent Telemetry:
.planning/telemetry/hook-timing.jsonl (if it exists).planning/telemetry/audit.jsonl (if it exists)ts (or timestamp), hook (or event), and a
short description field. Format as relative time.Recent Hook Activity (separate from general telemetry):
.planning/telemetry/hook-timing.jsonlevent: "timing" entries: extract hook, duration_ms, timestamp (relative), and outcome (pass if no matching error in hook-errors.jsonl within 1s; block if a block entry exists)event: "counter" entries: extract metric name as the "event" column with count contextHook Overhead (timing percentiles):
.planning/telemetry/hook-timing.jsonl (if it exists)duration_ms; group by hookRoutine Quota (account-wide 15 runs / 24h cap):
.planning/telemetry/routine-runs.jsonl (if it exists){"ts": "<ISO timestamp>", "kind": "RemoteTrigger" | "CronCreate" | "ScheduleWakeup"}ts inside the last 24 hours; compare against the cap of 15docs/ROUTINE-QUOTA.md)local-watch.js, local-daemon.js, local-schedule.js) never consume quota
and must not be counted.Pending Queues:
.planning/telemetry/doc-sync-queue.jsonl where status is pending or needs-review (or 0 if missing).planning/telemetry/merge-check-queue.jsonl (or 0 if missing).planning/intake/ (or 0 if missing)Hook Value Data (for HOOKS VALUE section):
.planning/telemetry/hook-errors.jsonl (if it exists, last 200 lines)
hook = "protect-files" (blocked file access)hook = "external-action-gate" (gated external actions)hook = "quality-gate" (quality violations).planning/telemetry/hook-timing.jsonl (if it exists, last 200 lines)
hook = "circuit-breaker" and metric = "trips".planning/telemetry/audit.jsonl (if it exists, last 200 lines)
Hook Problem Taxonomy:
.planning/telemetry/hook-errors.jsonl.protect-files blocks and hard external-action-gate blocks as
safety-block with info severity; they prove protection fired and do not
create a repair action by themselves.error and parse-fail actions as hook-failure with high
severity; these are actionable.blocked-restricted as restricted-scope-block with high
severity; this is actionable.first-encounter and consent-block from external-action-gate as
approval-needed with medium severity; this is actionable.external-action-gate approval entry has a later matching
tool-call entry in audit.jsonl, classify it as resolved-approval with
info severity; it should not create a repair action. Treat git push -u
and git push as equivalent for the same branch, and allow a small
near-simultaneous timestamp skew between hook and tool-call entries.stale-approval with low severity; it should not create a current
repair action.stale with low severity and do
not create a repair action from stale entries./telemetry repair action should appear only when actionable entries are
present. Safety blocks remain visible in PROBLEMS and HOOKS VALUE.Health:
.planning/telemetry/audit.jsonl written todayhooks array of .claude/hooks-template.json (or
.claude/hooks.json if template not present); use 0 if neither exists.claude/harness.json → trust object:
sessions_completed, campaigns_completed counterstrust.override is set, use that and note "(override)"Convert ISO timestamps: <60s → "just now" | <60min → "{N} min ago" | <24h → "{N} hr ago" | else → "{N} days ago". Display unparseable timestamps as-is.
Output verbatim, substituting real values. Always show section headers even when content is "(none active)".
=== Citadel Dashboard ===
As of: {relative timestamp of most recent event, or "now"}
NEXT ACTION
Command: {exact command}
Why: {why this is next}
Confidence: {low | medium | high}
Repair available: {yes | no}
Runbook: {docs or skill path}
REPAIR CONSOLE
{repair|review} | {confidence} | {label}
command: {exact command}
why: {short reason}
runbook: {docs or skill path}
CAMPAIGNS
{slug}: Phase {N}/{total} — {direction, max 60 chars, ellipsis if truncated}
Last event: {most recent telemetry entry for this campaign, or "no telemetry"}
(none active)
COSTS
This session: ${cost} | {duration} min | ${rate}/min | {messages} msgs | {agents} agents
Today: ${today_total} across {today_sessions} sessions
All time: ${all_time_total} across {all_time_sessions} sessions ({data_source})
By campaign:
{slug}: ${total_cost} across {sessions} sessions ({agents} agents, {minutes} min)
_unattached: ${total_cost} across {sessions} sessions
(no cost data recorded yet)
ROUTINE QUOTA
Runs (last 24h): {N}/15
WARNING: {N} of 15 routine runs used in the last 24h. Hitting the cap pauses every routine on the account. See docs/ROUTINE-QUOTA.md.
(remote-run logging populates .planning/telemetry/routine-runs.jsonl - local runners do not consume quota)
HOOKS VALUE
Circuit breaker: {N} trips (prevented token spirals)
Quality gate: {N} violations caught pre-commit
Protect-files: {N} blocks (path traversal, secrets)
External gate: {N} actions gated
Total hook fires today: {N}
(raw facts only -- no inflated savings claims)
FLEET SESSIONS
{slug}: Wave {N} — {agent count} agents — {status}
(none active)
RECENT ACTIVITY (last 10 events)
{relative time} | {hook/event name} | {description}
(no telemetry recorded yet)
HOOK ACTIVITY (last 10 hook fires)
{relative time} | {hook name} | {duration_ms}ms | {outcome: pass/block/warn}
(no hook timing recorded yet — set CITADEL_DEBUG=true in settings.json for verbose output)
HOOK OVERHEAD (sorted by p95 descending)
hook count p50 p95 max
{hook name} {N} {N}ms {N}ms {N}ms
(no hook timing data recorded yet)
PROBLEMS
Actionable: {N} | Safety blocks: {N} | Resolved approvals: {N} | Stale: {N}
{relative time} | {severity} | {category} | {hook name} | {description}
(none recorded)
PENDING
Doc sync: {N} items queued
Merge reviews: {N} items queued
Intake items: {N} in .planning/intake/
HEALTH
Circuit breaker trips this session: {N}
Audit entries today: {N}
Hooks installed: {N}
Operator tier: {novice | familiar | trusted} ({N} sessions, {N} campaigns)
QUICK COMMANDS
/do continue — resume active campaign
/do rollback — restore last checkpoint
/telemetry — cost breakdown, hook activity, telemetry settings
/triage prs — review open PRs
/pr-watch — watch PR CI
/learn — extract patterns from last completed campaign
.planning/ missing: All zeros, "(none active)"; add "Run /do setup --express to initialize."
harness.json missing or malformed: Show "not configured" for hooks count; do not crash.
Malformed campaign file: Skip it; note (N campaign file(s) skipped — malformed).
Large telemetry files: Read last 50 lines only.
Missing timestamps: Fall back to file modification time; display entry without timestamp if unavailable.
All campaigns completed: Note "No active campaigns" at top of CAMPAIGNS section.
Completed campaign still active: Show the exact node scripts/campaign.js complete <slug> --archive repair command; suggesting /do continue here is wrong because the campaign is already finished.
Campaign ready for review package: Show the exact node scripts/package-delivery.js <slug> repair command before showing campaign completion.
All fleet sessions idle: Note "No active fleet sessions" under FLEET SESSIONS.
routine-runs.jsonl missing or no runs in window: Show Runs (last 24h): 0/15 plus the one-line population hint; only show the WARNING line when more than 12 runs are counted.
Mixed state: Proceed with whatever state exists; note each missing directory inline.
Doc-sync backlog: Surface /learn --doc-sync as a repair action with skills/learn/SKILL.md as runbook.
Dirty worktree: Surface git status --short as a review action; do not suggest destructive cleanup.
Only safety blocks recorded: Show them in PROBLEMS and HOOKS VALUE, but do not surface /telemetry as NEXT ACTION.
Actionable hook problem recorded: Surface /telemetry as repair action with skills/telemetry/SKILL.md as runbook.
Disclosure: "Displaying harness dashboard. No files modified." Reversibility: green — read-only; no files modified Trust gates:
/dashboard does not produce a HANDOFF block. It is a read-only observability tool. After displaying the dashboard, wait for the next user command.
tools
Bounded foreground repetition for the current session. Creates a loop contract, runs or coordinates an action plus verifier up to a declared attempt limit, and records evidence under .planning/loops/. Use for repeat-until-pass work that is too small for daemon and not time-based scheduling.
testing
Remove Citadel from a project. Exports valuable state (campaigns, postmortems, research, backlog, discoveries) to docs/citadel/ as human-readable markdown, then removes all harness files and hooks. The archive is detected by /do setup on re-install and offered for restore.
development
Research-driven multi-cycle improvement director. Forms causal hypotheses about why scores are low, validates them with scout agents before attacking, dispatches axis-parallel fleet attacks, extracts transferable patterns, and runs indefinitely within a budget envelope. Accumulates a persistent belief model and pattern library across sessions.
data-ai
Multi-repo campaign coordinator. Same lifecycle as fleet -- scope claims, discovery relay, wave-based execution -- but the unit of work is a repo, not a file. Coordinates campaigns across repositories with shared context.