skills/sprint-velocity-analysis/SKILL.md
Analyze sprint velocity data and produce an engineering team health report covering delivery trends, capacity utilization, and improvement recommendations. Use when asked to analyze sprint velocity, review team delivery health, identify delivery risks, or produce a retrospective data analysis. Produces a velocity trend analysis, health diagnosis table, top improvement recommendations with implementation steps, and a next-sprint capacity forecast.
npx skillsauth add mohitagw15856/pm-claude-skills sprint-velocity-analysisInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Analyze sprint velocity data to produce an honest engineering team health report. The goal is not to generate optimistic-looking charts — it is to surface delivery patterns, identify dysfunction early, and give the team and their manager actionable recommendations. Look for: velocity trends (improving, declining, flat, erratic), story point calibration consistency, carry-over patterns that indicate chronic over-commitment, and capacity-related signals. Produce text-based trend visualizations, a health diagnosis, and specific improvement recommendations with measurable targets.
Ask for these if not already provided:
If cycle time data is not provided, omit that section and note it as a recommended data source to add.
Analysis period: Sprint [N] through Sprint [N+7] ([Date range]) Team size: [X engineers] ([note any changes during period]) Report date: [Date] Data source: [Where this data came from — Jira, Linear, spreadsheet, etc.]
| Sprint | Committed | Completed | Completion Rate | Carried Over | Notes | |--------|-----------|-----------|----------------|--------------|-------| | [Sprint N] | [X pts] | [X pts] | [X%] | [X pts / X items] | [disruption or context] | | [Sprint N+1] | [X pts] | [X pts] | [X%] | [X pts / X items] | | | [Sprint N+2] | [X pts] | [X pts] | [X%] | [X pts / X items] | | | [Sprint N+3] | [X pts] | [X pts] | [X%] | [X pts / X items] | | | [Sprint N+4] | [X pts] | [X pts] | [X%] | [X pts / X items] | | | [Sprint N+5] | [X pts] | [X pts] | [X%] | [X pts / X items] | | | [Sprint N+6] | [X pts] | [X pts] | [X%] | [X pts / X items] | | | [Sprint N+7] | [X pts] | [X pts] | [X%] | [X pts / X items] | | | Average | [X pts] | [X pts] | [X%] | [X pts] | |
Points
60 |
55 | ●
50 | ● ●
45 | ● ● ●
40 | ● ●
35 |
30 |
+--+--+--+--+--+--+--+--
N N+1 N+2 N+3 N+4 N+5 N+6 N+7
Sprint
● = Completed points — = Average ([X pts])
Generate this chart using ASCII characters based on the actual data provided. Scale the Y-axis to the data range. Plot completed (not committed) points. Mark the average as a dashed line.
| Metric | Value | Interpretation | |--------|-------|----------------| | Average velocity | [X pts/sprint] | [Baseline for planning] | | Velocity std deviation | [±X pts] | [Low < 15% of avg = stable; High > 25% = erratic] | | Trend direction | [Improving / Flat / Declining / Erratic] | [3-sprint trailing average vs. 3-sprint leading average] | | Average completion rate | [X%] | [Healthy: 80–95%; < 75% = chronic over-commitment] | | Carry-over rate | [X% of committed points carried over per sprint] | [Healthy: < 15%; > 25% = systemic issue] | | Sprints with completion rate < 75% | [X of 8 sprints] | [> 3 of 8 = structural problem, not noise] |
Story points are only useful if they are applied consistently. Look for these calibration signals in the data:
| Signal | Observed | Interpretation | |--------|----------|----------------| | High variance in velocity despite stable team size | [Yes / No] | Suggests inconsistent estimation — same effort scored differently week to week | | Consistent over-commitment (committed >> completed) | [Yes / No — by avg X pts per sprint] | Team is sandbagging estimates or ignoring historical capacity | | Consistent under-commitment (completed >> committed by > 20%) | [Yes / No] | Team is over-padding estimates or pulling in unplanned work frequently | | Frequent large items (> 13 pts) in carry-over | [Yes / No] | Items are too large to estimate reliably — need better decomposition | | Velocity cliff after team change | [Yes / No — Sprint N+X] | Team did not re-baseline capacity after composition changed |
Calibration verdict: [Well-calibrated / Needs recalibration / Severely uncalibrated — one sentence explanation tied to the signals above]
If recalibration is needed: [Specific recommendation — e.g., "Run a calibration session using the last 20 completed items, re-score them as a team, and use the resulting relative sizes to anchor future estimates."]
Carry-over is the most reliable leading indicator of commitment reliability problems.
| Sprint | Carried-Over Items | Common Themes in Carry-Over | |--------|-------------------|----------------------------| | [Sprint N] | [X items / X pts] | [Technical debt, dependency blocked, scoped wrong, etc.] | | [Sprint N+1] | [X items / X pts] | [Theme] | | [Sprint N+2] | [X items / X pts] | [Theme] |
Carry-over root causes identified:
| Sprint | Team Size | Available Capacity (pts) | Committed | Utilization % | Disruptions | |--------|-----------|--------------------------|-----------|--------------|-------------| | [Sprint N] | [X engineers] | [X pts] | [X pts] | [X%] | [Holiday / incident / none] | | [Sprint N+1] | [X engineers] | [X pts] | [X pts] | [X%] | |
Capacity calculation used: [X engineers × Y pts/person/sprint = Z pts available. Adjust: if team capacity changed during the window, note which sprints used which team size.]
Average utilization: [X%] Utilization interpretation: [< 70% = team is under-loaded or over-padding | 70–90% = healthy range | > 90% = no slack for unplanned work — fragile]
| Dimension | Score | Evidence | Priority | |-----------|-------|----------|----------| | Delivery predictability | [Green / Yellow / Red] | [Average completion rate X%, std dev Y pts] | [High / Med / Low] | | Commitment accuracy | [Green / Yellow / Red] | [Team over-commits by avg X pts/sprint] | | | Estimation consistency | [Green / Yellow / Red] | [Velocity std dev ±X pts, calibration verdict] | | | Carry-over hygiene | [Green / Yellow / Red] | [X% carry-over rate, root causes] | | | Capacity management | [Green / Yellow / Red] | [Avg utilization X%, disruption handling] | | | Trend direction | [Green / Yellow / Red] | [Trailing 3-sprint avg vs. leading 3-sprint avg] | |
Scoring guide: Green = operating within healthy range; Yellow = marginal — watch closely or single-sprint anomaly; Red = chronic issue requiring active intervention.
Overall health: [Green / Yellow / Red] — [One sentence summary: "The team delivers consistently at X pts/sprint but chronic over-commitment is eroding morale and creating a misleading picture for stakeholders."]
If blocker data was provided, complete this section. If not, note it as a recommended tracking addition.
| Blocker Category | Frequency (last 8 sprints) | Avg Days Blocked | Impact (pts delayed) | |-----------------|--------------------------|------------------|---------------------| | External dependency | [X occurrences] | [X days] | [X pts] | | Technical debt / rework | [X occurrences] | [X days] | [X pts] | | Unclear requirements | [X occurrences] | [X days] | [X pts] | | On-call interruptions | [X occurrences] | [X days] | [X pts] | | Environment / tooling | [X occurrences] | [X days] | [X pts] |
Top blocker to address: [Name the single highest-impact blocker category and what addressing it would mean for velocity.]
Provide 3 specific recommendations ordered by expected impact. Each recommendation must include a measurable success target and implementation steps.
Problem it addresses: [Which health dimension is Red or Yellow, and what the data shows]
What to do:
Who owns it: [Tech lead / Engineering manager / Whole team] When to start: [This sprint / Next sprint / Within 2 weeks]
Measurable target: [e.g., "Carry-over rate drops below 15% within 3 sprints" or "Completion rate above 80% for 4 consecutive sprints"]
How to know it's working: [Leading indicator to watch before the outcome metric improves — e.g., "Carry-over items decreasing sprint-over-sprint even before the target is hit"]
Problem it addresses: [Health dimension and evidence]
What to do:
Who owns it: [Role] When to start: [Timing]
Measurable target: [Specific metric and timeframe]
How to know it's working: [Leading indicator]
Problem it addresses: [Health dimension and evidence]
What to do:
Who owns it: [Role] When to start: [Timing]
Measurable target: [Specific metric and timeframe]
How to know it's working: [Leading indicator]
Next sprint: [Sprint N+8] Known team size: [X engineers] Known capacity reducers: [PTO: X days total, on-call rotation: ~Y pts of unplanned capacity, etc.]
| Factor | Impact | |--------|--------| | Base capacity (historical average) | [X pts] | | PTO / planned absences | −[X pts] | | On-call overhead (estimate) | −[X pts] | | Carry-over from Sprint [N+7] | +[X pts committed capacity already spoken for] | | Recommended commitment ceiling | [X pts] |
Confidence: [High — stable team and known capacity | Medium — some uncertainty in disruption level | Low — team composition uncertain]
Recommendation for planning: [One sentence — e.g., "Plan to Sprint [N+8] ceiling of X pts. Given the carry-over items, prioritize completing those before pulling in new scope."]
| Sprint | p50 Cycle Time | p90 Cycle Time | Items Completed | |--------|---------------|---------------|-----------------| | [Sprint N] | [X days] | [X days] | [X items] | | [Average] | [X days] | [X days] | |
Cycle time interpretation: [p90 > 2× p50 indicates a long-tail of stuck items that deserve investigation. p50 increasing over time indicates slowing throughput independent of story point changes.]
If cycle time data was not provided: Cycle time data was not included in this analysis. Recommend adding p50 and p90 cycle time per sprint to your tracking to detect throughput issues that story points alone cannot reveal.
development
Build a framework for creating shareable, high-reach social media content. Use when asked to plan viral content, develop a shareable content strategy, create a hook writing system, or build a repeatable process for content that gets shared. Produces a platform-specific viral content framework with hook formulas, content structures, shareability triggers, and a content testing system.
development
Generate article or newsletter thumbnail candidates using the Gemini API from inside Claude Code. Claude reads article copy, proposes composition concepts, writes image generation prompts incorporating brand specs, calls Gemini to generate the images, evaluates the results via computer vision, and returns ranked candidates with rationale. Use when asked to create thumbnails, generate cover images, or produce visual candidates for an article or newsletter.
testing
Flips Claude's default from "find reasons you're right" to "find reasons you're wrong." A genuine thinking partner, not a mirror with grammar. Use before high-stakes decisions, plans, assumptions, or pitches you haven't stress-tested.
development
Scrapes a Substack Notes page and exports engagement data (likes, comments, restacks) to a formatted .xlsx file with conditional formatting and summary stats.