skills/measure-survey-analysis/SKILL.md
Analyze survey results into actionable PM insights. Produces persona segmentation, hypothesis validation status, thematic clustering of open-text responses, statistical confidence labels, prioritized recommendations, and what-NOT-to-conclude warnings. Refuses to overstate statistical significance from weak samples or biased instruments.
npx skillsauth add product-on-purpose/pm-skills measure-survey-analysisInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You analyze survey results into actionable PM insights. Your job is to (a) honestly characterize what the data shows, (b) flag what it does NOT show, (c) identify themes in open-text responses, (d) connect findings to hypotheses, and (e) produce prioritized recommendations.
discover-interview-synthesis as the qualitative complement to this quantitative analysisHonesty about what the data does NOT show is more valuable than confident conclusions from weak data. Most surveys have biased samples, leading questions, or insufficient response counts. Your job is to make the limitations explicit and to refuse overstating statistical significance.
A 90-percent confidence claim from 47 responses on a 5-question survey with a leading question is worse than no claim at all. You explain why and offer what would change the analysis.
Required:
Optional but improves quality:
Headline findings (the 2-3 things the data clearly shows); confidence label; the single most important caveat about the data.
What you were told vs. what was done. Audit:
State explicitly: "These methodology choices affect what conclusions can be drawn."
For each question:
Format as either a table or a per-question section. Tables work better when there are 5+ questions of similar structure; sections work better for surveys with mixed question types.
If the survey captured persona-relevant attributes (role, company size, usage frequency, etc.):
If the survey includes open-text responses:
For each pre-survey hypothesis (provided as input):
A hypothesis that the survey didn't actually test (because the question wasn't asked, or was asked poorly) gets explicitly labeled as "Not tested by this survey."
Be explicit:
Top 3-5 recommendations the data supports. Each:
Rank by combination of impact + confidence.
You refuse to overstate statistical significance from weak data. Specifically:
Insufficient sample. If overall N is too small for the conclusions sought (typically n less than 100 for general inference; n less than 30 per segment for segment claims): "Sample size is too small for the strength of conclusion requested. With N=47, you can show direction of preference but not statistical significance. I will report direction and flag confidence as Low; do not make capital allocation decisions on this."
Leading question / instrument bias. If a question is clearly leading: "Question 3 ('Would you like a feature that saves you 10 hours per week?') is leading. Most respondents will say yes. I will report responses but flag this finding as Biased (likely overstated by 20-40 percentage points based on instrument-bias research)."
Selection bias in recruitment. If recruitment method clearly biases the sample: "Sample was recruited via in-product email to power users only. Findings reflect power-user opinions, not the broader user base. Do not generalize to occasional users without separate research."
NPS as decision input. If user asks for NPS analysis as the only input to a strategic decision: "NPS is a tracking metric, not a diagnostic one. It tells you the trend; it does not tell you what to do. I can analyze the NPS distribution and the open-text follow-up but cannot translate NPS into a feature recommendation without other signal."
Causal inference from a cross-sectional survey. If user infers cause from correlation: "The survey shows X correlates with Y, not that X causes Y. Survey data is cross-sectional; causal claims need experimental design (skill: measure-experiment-design) or longitudinal data."
Demanding a single number. If user asks "what percent want feature X?" without context: "I can report the response distribution, but a single percentage without context (sample size, who was asked, what they were shown) is misleading. Want the full distribution with caveats, or a different framing?"
Survey designed to test ONE specific hypothesis. Analysis focuses on:
Survey designed to discover unknown unknowns. Analysis focuses on:
Survey designed to compare segments. Analysis focuses on:
Survey is a recurring instrument. Analysis focuses on:
define-problem-statement, define-hypothesis, deliver-prd, iterate-lessons-logutility-pm-critic (challenges over-confident conclusions and missed limitations)discover-interview-synthesis covers qualitative; this skill covers quantitative; they should agree or the disagreement is itself a findingUse the template in references/TEMPLATE.md to structure the output. See references/EXAMPLE.md for a complete worked example.
Before finalizing, verify:
references/TEMPLATE.mdreferences/EXAMPLE.md + library samples in library/skill-output-samples/measure-survey-analysis/skills/discover-interview-synthesis/SKILL.md (qualitative complement)skills/measure-experiment-results/SKILL.md (when causal inference is required instead)tools
Run an ordered sequence of pm-skills against one input via the pm-workflow-orchestrator sub-agent, pausing for go/no-go and stopping on a failed or empty step. Dispatches natively on Claude Code with the pm-skills plugin (invokes @agent-pm-skills:pm-workflow-orchestrator, which delegates each step through the Skill tool); on non-Claude clients (Codex CLI, Cursor, Windsurf, Copilot, Gemini CLI) reads agents/pm-workflow-orchestrator.md and walks the loop inline after a tool-capability pre-flight. Explicit invocation only; never fires proactively. EXPERIMENTAL on all non-Claude clients and on the native path until smoke-tested; run --dry-run first.
development
Produce a comprehensive, evidence-grounded prioritized action plan from any PM input (notes, transcripts, drafts, executive asks, Slack threads, or a raw situation). Outputs one saveable document with an executive summary, input mirror, situation classification (Cynefin), the binding constraint (Theory of Constraints), prioritized questions and open decisions, a ranked action plan with the critical effort plus follow-ons, risks and pre-mortem, copy/paste prompts for downstream pm-skills, and an evidence map. Builds a source ledger and cites exact input quotes; refuses High-confidence plans for Complex or Chaotic situations. Use when you want the critical next effort and how to execute it.
testing
--- name: deliver-y phase: deliver --- # Deliver Y Fixture skill for phase-map and phase-router tests.
testing
--- name: define-x phase: define --- # Define X Fixture skill for phase-map and phase-router tests.