skills/by-role/qa/qa-metrics/SKILL.md
Build QA quality dashboards, calculate defect metrics, analyze escape rates, and produce quality trend reports. Use this skill whenever a QA lead or manager needs to measure and communicate quality outcomes. Trigger on phrases like "quality metrics", "defect density", "escape rate", "bug trends", "quality dashboard", "sprint quality report", "test coverage metrics", "QA KPIs", "how is quality trending", "defect leakage", "production bugs this quarter", "mean time to detect", "quality scorecard", or any request to quantify, visualize, or report on the quality of a product or testing process. Also trigger when a manager asks "how is QA performing?" or an engineering lead asks "what does our defect data tell us?".
npx skillsauth add qa-aman/claude-skills qa-metricsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Based on "Accelerate" by Nicole Forsgren, Jez Humble & Gene Kim. The DORA research (DevOps Research and Assessment) identified four metrics that predict both software delivery performance and organizational outcomes. High-performing teams excel on all four. QA directly influences two of them - change failure rate and mean time to restore - and enables the other two through fast, reliable test automation.
Measure, analyze, and communicate quality outcomes through data.
The four metrics that differentiate elite from low-performing engineering teams:
| Metric | What it measures | Elite benchmark | QA influence | |--------|-----------------|-----------------|-------------| | Deployment Frequency | How often code ships to production | On-demand (multiple/day) | Fast, reliable automation enables frequent deploys | | Lead Time for Changes | Commit to production time | < 1 hour | Slow QA cycles lengthen lead time | | Change Failure Rate | % of releases causing production incidents | 0-15% | Test coverage directly reduces this | | Mean Time to Restore (MTTR) | Time to recover from production failure | < 1 hour | Good observability + runbooks reduce this |
QA's accountability: Own change failure rate and contribute to lead time. If change failure rate is above 15%, testing strategy needs a root cause review. If QA is a bottleneck in lead time, automate or parallelize.
Defect Density Number of defects per unit of functionality.
Defect Escape Rate (Leakage) Percentage of defects found by customers (production) vs QA (pre-release).
Defect Detection Efficiency (DDE) How early defects are caught in the development lifecycle.
Defect Aging Average time from defect open to resolution.
Test Pass Rate Percentage of test cases that passed in a given cycle.
Test Coverage Percentage of requirements or features covered by test cases.
Automation Coverage Percentage of regression cases automated.
Test Execution Velocity Test cases executed per day or per sprint.
Production Defect Rate Defects per release found in production within 30 days of ship.
Mean Time to Detect (MTTD) Average time between a defect being introduced and being found.
Regression Rate Percentage of releases where a previously passing feature breaks.
Not every metric is useful for every context. Match metrics to questions:
| Question | Metrics to Use | |----------|---------------| | Is quality improving over time? | Defect density trend, escape rate trend | | Is QA catching bugs early enough? | DDE, MTTD, defect by phase | | Are we shipping stable releases? | Regression rate, production defect rate | | Is automation keeping up? | Automation coverage, execution velocity | | Where are the quality hotspots? | Defect density by module/team |
Define your data sources:
For a sprint report, calculate at minimum:
For a quarterly report, add:
Sprint Quality Report template:
Sprint [n] Quality Report - [dates]
Team: [team name]
QA Lead: [name]
Test Execution:
Cases executed: [n]
Pass rate: [x]%
Automation coverage: [x]%
Defects:
Opened: [n] (P1: [n] | P2: [n] | P3: [n])
Closed: [n]
Escaped to production: [n]
Escape rate: [x]%
Release Quality:
Production defects (this release): [n]
Regression rate: PASS / FAIL
Sign-off: GO / NO-GO
Hotspots:
[area with disproportionate defects and why]
Trend:
[1-2 sentences: is quality improving, stable, or degrading?]
Actions:
[specific action, owner, due date]
Quarterly Quality Dashboard sections:
Metrics without action are reporting theater. For every negative trend:
Vanity metrics "We ran 500 test cases" is not a quality metric. It is a volume metric. Pass rate, escape rate, and defect density are quality metrics. Distinguish between activity (what QA did) and outcomes (what quality resulted).
Reporting in isolation A 95% pass rate means nothing without context. Is that up or down from last sprint? Is the test suite getting weaker or stronger? Always show trend, not just point-in-time.
No action items A quality report with no action items is a postcard. If metrics show a problem, someone must own fixing it. If metrics show everything is fine, note what is working and why.
Measuring only pre-release defects Escape rate requires production data. QA teams that only track what they found - not what customers found - are grading their own homework. Include production defect rate in every quarterly review.
Automation coverage without flaky test tracking An automation suite that is 80% automated but 30% flaky is worse than 60% automated with stable tests. Track flaky test rate alongside automation coverage. Flaky tests erode confidence faster than low coverage.
Before publishing any quality report:
development
Plan a webinar end-to-end using April Dunford's Obviously Awesome positioning framework to find the topic angle that makes the webinar obviously valuable to the right audience. Produces topic positioning, abstract, speaker brief, registration page, promotion sequence, day-of run-of-show, and post-webinar follow-up. Use when the user asks to plan a webinar, virtual event, online workshop, "we need a webinar on X", host a webinar, online masterclass, or any live virtual event with promotion and follow-up. Reads ICP, services, and brand voice from knowledge/.
development
Write long-form thought leadership articles, opinion pieces, industry POV essays, and CEO/founder bylines using the Made to Stick SUCCESs framework (Chip and Dan Heath). Use when the user asks for a long-form article, executive byline, opinion piece, industry POV, manifesto, "explain our point of view on X", or wants to publish an authority-building piece (1200-2500 words). Reads brand voice and positioning from knowledge/.
development
Plan a monthly content calendar across channels using the Content Marketing Matrix (Dave Chaffey, Smart Insights) - Entertain/Inspire/Educate/Convince. Every post gets a quadrant label. The monthly calendar must hit 40% Educate, 40% Inspire+Convince, 20% Entertain. Produces a week-by-week posting schedule with topics, formats, channels, and asset links. Use when the user says "content calendar", "social calendar", "plan next month's content", "what should we post", "content plan", "editorial calendar", "schedule posts for the month", or wants a structured posting plan for LinkedIn, Twitter, email, or blog. Reads brand voice, ICP, and past learnings from knowledge/.
development
Write SEO-optimized long-form articles targeting specific keywords using the They Ask You Answer Big 5 framework (Marcus Sheridan). Articles are categorized by Big 5 type (Cost, Problems, Versus, Best/Reviews, How-To) and structured accordingly. The "answer first" rule applies to every article. Use when the user asks for an SEO article, blog post for ranking, "rank for keyword X", organic content, search-optimized post, pillar page, or content for organic traffic. Includes keyword targeting, search intent matching, internal linking suggestions, and meta tags.