plugins/minister/skills/dora-metrics/SKILL.md
Computes DORA delivery-performance metrics from git and GitHub API. Use when assessing deployment frequency, lead time, or change failure rate.
npx skillsauth add athola/claude-night-market dora-metricsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Compute the four DORA delivery-performance metrics (Deployment Frequency, Lead Time for Changes, Change Failure Rate, and Time to Restore Service) from local git history and the GitHub API. Classify each metric into Elite, High, Medium, or Low using thresholds from DORA's State of DevOps research, and surface the single weakest dimension as the next improvement target.
minister:release-health-gates.Run the helper script with the desired window:
python3 -m minister.dora_metrics --window 30 --branch main
Read the output: per-metric value, tier classification, and the bottleneck pointer.
For agentic-workflow audits, run the same window twice. Once
filtering to AI-authored PRs (e.g., --failure-label ai-bug),
once across all PRs. Compare the CFR delta. See
modules/agentic-workflow-signals.md.
Optionally pipe --json into the tracker so trend data persists
alongside release-health-gates snapshots.
Optionally render trend charts with kuva when reviewing multiple windows or comparing before/after an agentic-workflow change:
# Collect weekly snapshots into a TSV, then plot all four metrics
# week<TAB>metric<TAB>value
kuva line trends.tsv --x week --y value --color-by metric \
--title "DORA trends (30-day windows)" -o dora-trends.svg
# Quick terminal preview without writing a file
kuva line trends.tsv --x week --y value --color-by metric --terminal
kuva reads TSV/CSV from stdin or a file path. Install once:
cargo install kuva --features cli. No project source changes
required. See kuva for the
full plot-type reference.
| Flag | Default | Meaning |
|------|---------|---------|
| --window | 30 | Measurement window in days |
| --branch | HEAD | Production branch |
| --failure-label | bug | GitHub label marking prod failures |
| --json | off | Emit JSON instead of human-readable |
| --repo-path | cwd | Repository directory |
A short text report or JSON payload with:
4.2/day, 2.1 hours, 8%).See modules/thresholds.md for the complete table. Brief summary:
| Metric | Elite | High | Medium | Low | |--------|-------|------|--------|-----| | DF | >= 1/day | >= 1/week | >= 1/month | < 1/month | | LT | <= 1 day | <= 1 week | <= 1 month | > 1 month | | CFR | <= 15% | <= 30% | <= 45% | > 45% | | TRS | < 1 hour | < 1 day | < 1 week | >= 1 week |
Confirm a DORA report is real by re-running the script over a
narrower window and checking that DF and LT scale predictably. For
CFR and TRS, sample two or three of the contributing GitHub issues
and verify the bug (or chosen) label is correct on each.
Unit tests live in
plugins/minister/tests/unit/test_dora_metrics.py. Each tier
boundary is exercised at the threshold, so future contributors who
adjust an inequality (> vs >=) trigger a failure rather than a
silent regression. Add new tests at the threshold when extending
classification logic.
tools
Detect friction signals; graduate patterns into rules. Use for session retrospectives.
testing
Use when you need a diff-derived test plan for an MR — reads the diff, groups changes by area, runs targeted verifications, and proves revert-tests are genuine guards, not dead assertions.
development
Curate the web-capture index. Use when the capture backlog grows, captures sit unprocessed at seedling/pending, or to surface stored research during work.
testing
Probe memory/summary clarity via dual anchor questions: task progress, info gaps. Use when verifying session state or summary before handoff or compression.