.claude/skills/feature-results/SKILL.md
Post-launch analysis and results documentation. Document what shipped and what we learned.
npx skillsauth add pisithrps/yapzee feature-resultsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Document feature results and capture learnings.
/feature-results
Then provide:
I'll generate a results doc with: executive summary, results vs targets, root cause analysis (if missed), segment breakdown, estimate calibration, stakeholder communication guidance, and concrete next steps.
Output: Saved to outputs/analyses/feature-results-[name]-[date].md
Time: ~10 min with data ready, ~20 min if we need to dig
Before generating a results analysis, verify the feature has actually shipped.
Check for launch signals:
context-library/prds/ or outputs/prds/ show status "Shipped" or "Launched"?If the feature hasn't launched yet:
It looks like [feature] hasn't shipped yet. Post-launch analysis works best with real data.
For pre-launch work, you might want:
- `/feature-metrics` - Define what you'll measure before launch
- `/impact-sizing` - Estimate expected impact
- `/experiment-decision` - Plan your A/B test or rollout
- `/launch-checklist` - Make sure you're ready to ship
Want to proceed anyway (e.g., analyzing a soft launch or pilot), or switch to one of these?
When PM types /feature-results, respond:
Let's document the results of your feature launch.
Tell me:
1. What feature shipped?
2. What were the original goals/metrics?
3. What actually happened?
I'll help you create a results doc that captures outcomes and learnings.
# Feature Results: [Feature Name]
**Ship Date:** [Date]
**Analysis Date:** [Date]
**Owner:** [PM Name]
---
## Executive Summary
[1-2 sentences: What shipped, key result, recommendation]
**Verdict:** [Ship to 100% / Iterate / Kill]
---
## Original Hypothesis
**If we** [built X],
**then** [Y would happen],
**because** [Z assumption].
**Target:** [Primary metric] would improve by [X%]
---
## Results
### Primary Metric
| Metric | Baseline | Target | Actual | Verdict |
|--------|----------|--------|--------|---------|
| [Name] | [X] | [Y] | [Z] | [Hit/Miss] |
### Statistical Significance
- Sample size: [N]
- Confidence level: [X%]
- P-value: [X]
### Guardrail Metrics
| Metric | Acceptable Range | Actual | Status |
|--------|------------------|--------|--------|
| [Metric] | [range] | [actual] | [OK/Warning] |
### Unexpected Findings
- [Finding 1]
- [Finding 2]
---
## Analysis
### Why These Results?
[Explain the underlying reasons for the results]
### What Surprised Us?
[Things we didn't expect]
### Segment Breakdown
| Segment | Result | Notes |
|---------|--------|-------|
| [Segment 1] | [+/-X%] | [insight] |
| [Segment 2] | [+/-X%] | [insight] |
---
## Learnings
### What Worked
- [Learning 1]
- [Learning 2]
### What Didn't Work
- [Learning 1]
- [Learning 2]
### Apply Elsewhere
- [Insight that applies to other features]
---
## Next Steps
**Recommendation:** [Ship / Iterate / Kill]
**If shipping:**
- [ ] Roll out to 100%
- [ ] Monitor for [X] weeks
- [ ] Update documentation
**If iterating:**
- [ ] [Specific change 1]
- [ ] [Specific change 2]
- [ ] Re-run experiment
**If killing:**
- [ ] Document why
- [ ] Rollback plan
- [ ] Communicate to stakeholders
---
## Appendix
- [Link to experiment dashboard]
- [Link to original PRD]
- [Link to detailed data]
When results miss targets, diagnose using this framework:
| Metric | Original Estimate | Actual Result | Accuracy | |--------|------------------|---------------|----------| | [Primary metric] | [From impact-sizing] | [Actual] | [Over/Under by X%] | | [Secondary metric] | [From impact-sizing] | [Actual] | [Over/Under by X%] | | [Guardrail metric] | [Expected range] | [Actual] | [Within/Outside range] |
Calibration insight: [Are we consistently over/under-estimating? By how much? What should we adjust for next time?]
Historical calibration pattern:
Always analyze results across these default segments. Segment analysis often reveals that an overall miss is actually a win for one segment and a miss for another.
| Segment Dimension | Why It Matters | |-------------------|---------------| | New vs existing users | New users may not find the feature; existing users may resist change | | Plan tier (Free/Team/Business/Enterprise) | Feature value often varies dramatically by tier | | Company size | Small teams adopt differently than large orgs | | Power users vs casual users | Power users may love it; casual users may be confused | | Geographic region (if applicable) | Cultural, language, and connectivity differences |
Segment analysis template:
| Segment | N (sample) | Primary Metric | vs Target | Insight | |---------|-----------|---------------|-----------|---------| | New users | [N] | [result] | [+/-X%] | [why] | | Existing users | [N] | [result] | [+/-X%] | [why] | | Free tier | [N] | [result] | [+/-X%] | [why] | | Paid tier | [N] | [result] | [+/-X%] | [why] | | Power users | [N] | [result] | [+/-X%] | [why] | | Casual users | [N] | [result] | [+/-X%] | [why] |
If results beat targets: Lead with the win, credit the team, connect to strategy. Share in #product-team, include in weekly status update, highlight in next board deck.
If results miss targets: Lead with what you learned, not what went wrong. Frame as: "Here's what we hypothesized, here's what actually happened, here's what we're doing about it." Never hide bad results -- own them fast.
If results are ambiguous: Be honest about uncertainty. "We saw X but can't conclusively attribute it to Y because Z. Here's our plan to get clearer signal."
| Audience | Lead With | Include | Avoid | |----------|-----------|---------|-------| | CEO/Board | Impact on key business metrics + strategic implications | Revenue impact, user growth, competitive position | Technical details, minor metrics | | CPO/Manager | Learnings + next steps + resource ask | What worked, what didn't, proposed plan | Blame, excuses | | Engineering team | What worked technically + what to improve | Performance data, technical wins, tech debt notes | Business jargon | | Sales/CS | Customer-facing impact + talking points | User quotes, feature highlights, objection handlers | Internal debates, negative framing | | Design | User behavior changes + UX insights | Usage patterns, qualitative feedback, heatmaps | Pure metric tables without context |
Scenario: Feature shipped, primary metric exceeded target by 20%. Guardrails held.
Root cause: N/A (success)
Action: Scale to 100%, plan v2 based on segment insights, draft announcement via /slack-message, update /status-update with win.
Communication: Lead with the number, credit the team, connect to quarterly goals.
Scenario: Primary metric hit target but guardrail metric degraded (e.g., page load time increased 15%).
Root cause: Quality problem -- feature added weight to core flow.
Action: Investigate quality regression, hold at current rollout %, fix guardrail before scaling. Create follow-up ticket via /create-tickets.
Communication: "Feature is working as intended for users, but we identified a performance regression we need to fix before full rollout."
Scenario: Primary metric at 60% of target after 4 weeks. Root cause: Discovery problem -- only 12% of eligible users found the feature. Among those who found it, conversion was above target. Action: Don't kill the feature -- improve discovery (add tooltip, email campaign, onboarding mention), re-measure in 4 weeks. Communication: "The feature works for users who find it, but we have a distribution problem. Plan: improve discovery and re-measure."
Scenario: Primary metric at 30% of target after 6 weeks. Root cause is relevance -- users tried it but it doesn't solve their actual problem.
Root cause: Relevance problem -- hypothesis was wrong.
Action: Document the learning, roll back, redirect engineering effort. Draft decision doc via /decision-doc.
Communication: "We tested our hypothesis that X would solve Y. Data shows it doesn't. Here's what we learned and where we're redirecting effort."
When the PM uses /feature-results, I automatically:
Source: context-library/prds/, conversation history
Source: Amplitude, Mixpanel, Posthog, Pendo (if connected)
Source: context-library/metrics/, past feature results
Source: context-library/metrics/, PRD success metrics section
Source: Cross-reference with other feature results
Routing logic:
/slack-message/experiment-decision/decision-doc explaining sunset/status-update with business impactBefore delivering the feature results doc, verify:
development
Product strategy docs using 7-component framework
tools
Review week's progress, meetings, learnings
tools
Set next week's priorities
development
Turn user interviews into actionable insights. Advanced synthesis techniques and frameworks.