plugins/axiom-sdlc-engineering/skills/quantitative-management/SKILL.md
Use when establishing measurement programs, analyzing metrics with statistical process control, setting baselines, or implementing CMMI Level 4 quantitative management - prevents vanity metrics and measurement theater
npx skillsauth add tachyon-beep/skillpacks quantitative-managementInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Implements CMMI MA (Measurement & Analysis), QPM (Quantitative Project Management), and OPP (Organizational Process Performance) through data-driven decision making, statistical process control, and predictive analytics.
Core Principle: Measure what matters, use data to drive decisions, distinguish signal from noise.
Avoid: Measurement theater (tracking without action), vanity metrics (looks good but no value), gaming metrics (optimizing numbers vs processes).
Triggers:
Use this when:
Do NOT use for:
platform-integration skillgovernance-and-risk skill| You Want To... | Reference Sheet | Key Content | |----------------|-----------------|-------------| | Plan what to measure | measurement-planning.md | GQM methodology, cost/value analysis, anti-patterns | | Choose metrics | key-metrics-by-domain.md | Quality, velocity, stability, deployment metrics | | Implement DORA | dora-metrics.md | 4 DORA metrics, collection automation, baselines | | Analyze variation | statistical-analysis.md | Control charts, SPC, trend detection | | Establish baselines | process-baselines.md | Historical data analysis, baseline maintenance | | Make data-driven decisions | quantitative-management.md | QPM, prediction models, Level 4 practices | | Scale L2→L3→L4 | level-scaling.md | Maturity progression, requirements by level |
Focus: Capture data for visibility
Measurements:
Tools: Spreadsheets, basic dashboards, manual collection acceptable
Example: Track number of bugs found per sprint, average time to close bugs
Limitation: No statistical analysis, no baselines, reactive only
Focus: Establish norms, detect trends
Measurements:
Tools: Analytics platforms, automated collection, historical databases
Example: Know that your organization typically delivers 30 story points/sprint ± 10, and current project is trending 25 (slightly below average)
Advancement over L2: Baselines provide context, trends show direction
Focus: Predict outcomes, control variation
Measurements:
Tools: Statistical packages (R, Python), control chart software, Monte Carlo simulation
Example: Use control charts to detect when defect rate exceeds upper control limit (special cause), trigger root cause analysis. Use regression model to predict project completion date with 90% confidence interval.
Advancement over L3: Predictive, not just reactive; distinguishes noise from signal
| Anti-Pattern | Symptom | Better Approach | |--------------|---------|-----------------| | Measurement Theater | Tracking many metrics, no action taken | Use GQM to link metrics to decisions | | Vanity Metrics | Numbers look impressive but don't drive improvement | Focus on actionable metrics with clear business value | | Gaming Metrics | Teams optimize numbers instead of processes | Measure outcomes, not activities; use multiple metrics | | Lagging-Only | Only measure results after the fact | Balance lagging with leading indicators | | Analysis Paralysis | Too many metrics, can't make decisions | Focus on critical few (3-5 key metrics) | | Flying Blind | No metrics, decisions based on opinion | Start with Level 2 basics, build up | | Over-Precision | Measuring to 3 decimal places when ±20% is noise | Match precision to decision granularity |
REQM (Requirements Management):
CM (Configuration Management):
VER/VAL (Verification/Validation):
RSKM (Risk Management):
DAR (Decision Analysis & Resolution):
Cross-references:
requirements-lifecycle - Metrics for requirement qualitydesign-and-build - Metrics for development processgovernance-and-risk - Risk and compliance metricsplatform-integration - Metric collection automationGoal: Start tracking basic metrics for visibility
./measurement-planning.md - Learn GQM methodologyExample metrics for first program:
Goal: Industry-standard DevOps metrics
./dora-metrics.md - Understand the 4 metrics./measurement-planning.md - Ensure DORA aligns with goalsGoal: Detect process instability automatically
./statistical-analysis.md - Learn control chartsExample: Defect escape rate control chart with UCL=15%, LCL=2%, current value 18% → investigate
Goal: Use quantitative data for project decisions
./quantitative-management.md - Learn QPM practices./process-baselines.md - Understand baseline usageQ: How many metrics should I track? A: Level 2: 3-5 basic metrics. Level 3: 5-10 organizational baselines. Level 4: 3-5 with statistical control. Focus beats breadth.
Q: How long to establish a baseline? A: Minimum 4 weeks for initial baseline. Prefer 12 weeks (1 quarter) for statistical validity. Update quarterly or when process changes.
Q: What's the difference between MA and QPM? A: MA (Level 2+): Measurement & Analysis - defining, collecting, analyzing metrics. QPM (Level 4): Using statistical process control and prediction models to manage projects quantitatively.
Q: Do I need Level 3 before Level 4? A: Yes. Level 4 requires organizational baselines (Level 3). Can't do statistical process control without knowing what "normal" looks like.
Q: Leading vs lagging indicators - what's the difference?
A: Lagging: Measure results after the fact (defects found, deployment time). Leading: Predict future outcomes (code review coverage, test coverage). You need both. See ./key-metrics-by-domain.md for examples.
Q: How do I avoid measurement theater?
A: Use GQM methodology (see ./measurement-planning.md). Every metric must answer a question that drives a decision. If you can't explain the decision, don't track the metric.
Q: When should I use control charts?
A: When you have 20+ data points and want to distinguish normal variation from abnormal (special cause). See ./statistical-analysis.md for guidance.
../requirements-lifecycle/SKILL.md - Requirements management metrics../design-and-build/SKILL.md - Development process metrics../governance-and-risk/SKILL.md - Risk and compliance metrics../platform-integration/SKILL.md - Metric collection automation in GitHub/Azure DevOpsLast Updated: 2026-01-25
tools
Use when designing, implementing, or auditing an MCP (Model Context Protocol) server — tool API design, idempotency under agent retry, structured error envelopes agents can recover from, schema versioning across model drift, transport reliability (stdio / HTTP), output-shape and pagination discipline, and choosing between tools / resources / prompts / sampling. Also use when an MCP server's tools confuse agents, return unstructured errors, deadlock under concurrent calls, double-execute under retry, or lose state across reconnects. Do not use for general REST/GraphQL API design (use `/web-backend`), for client-side prompt engineering or tool-loop design (use `/llm-specialist`), for general in-process plugin architecture (use `/system-architect`), or for cryptographic-provenance audit trails (use `/audit-pipelines`).
development
Use when running **SQLite or DuckDB inside an application process** as the durable store — not as a development convenience but as the production database. Use when scaling an SQLite layer that worked at low concurrency and is now hitting SQLITE_BUSY, WAL bloat, lock contention, schema-migration ceremony, or correctness gaps under multi-process writers. Use when introducing DuckDB as an OLAP complement to an OLTP SQLite store, or when picking between the two for a new component. Pairs with `/web-backend` (the API surface above the DB) and `/audit-pipelines` (when the DB is also the audit trail). Do not load for server databases (Postgres, MySQL), key-value stores, or ORM choice in isolation.
development
Use when designing or critiquing the structure of a staged procedure — a wizard, configuration flow, troubleshooting tree, training curriculum, multi-stage approval pipeline, decision pipeline, or any decomposition of expert work into composable stages. Use for both producer work (build the decomposition) and critic work (audit a proposed decomposition). Use when reasoning about capacity, bottlenecks, or soundness of a procedural flow. Do not use for implementation-plan critique of code changes (use `/axiom-planning` instead), for execution-time dynamics (use `/simulation-foundations`), or for rendering an already-designed procedure as docs or UI (use `/technical-writer` or `/ux-designer`).
testing
Use when the user wants to draft fiction or creative nonfiction prose, get craft critique on prose they have written, or plan story structure, outline, or premise. Workshop-voiced. Three explicit modes (draft, critique, plan) and the router will refuse to begin work without a declared mode.