skills/delivery/design/SKILL.md
Design statistically rigorous A/B tests and experiments. Formulate hypotheses, select metrics, calculate sample sizes. Discovers analytics and feature flag tools via capability detection. Use when: "design experiment", "A/B test", "hypothesis", "sample size", "what metrics", "test my feature", "should we experiment"
npx skillsauth add mikeparcewski/wicked-garden designInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Design experiments with statistical rigor.
# Design experiment from hypothesis
/wicked-garden:delivery:experiment "Blue CTA increases clicks by 10%"
# Design with context file
/wicked-garden:delivery:experiment feature-spec.md
# Discover available tools
/wicked-garden:delivery:experiment --discover
Template:
[Action] will [increase/decrease] [Metric] by [Amount] because [Reason]
Good: "Adding social proof to checkout will increase conversion by 8% because it reduces purchase anxiety"
Bad: "New design will be better" (not specific or measurable)
Hierarchy:
Example for checkout optimization:
Quick estimates (95% confidence, 80% power):
See statistics.md for detailed formulas.
Best practices:
Required tracking:
// Variant assignment
trackEvent('experiment_viewed', {
experiment: 'checkout_social_proof',
variant: 'control' | 'treatment',
user_id: '...'
})
// Primary metric
trackEvent('purchase_completed', {
experiment: 'checkout_social_proof',
variant: '...',
value: 49.99
})
Statistical:
Business:
The skill emits a markdown experiment design with sections: Hypothesis, Metrics (Primary / Secondary / Guardrail), Variants, Sample Size, Statistical Parameters, Instrumentation, Success Criteria, Risks & Mitigations.
Full template with substitutable placeholders: refs/output-template.md.
Discovers available tools automatically via capability detection:
Capabilities needed:
feature-flags: Feature toggle and flag managementanalytics: Event tracking and metrics collectionexperiment-platform: Dedicated A/B testing platformsDiscovery methods:
Asks "Do I have analytics capability?" not "Do I have Amplitude?"
With native tasks: Stores design via TaskUpdate description append on the active task With qe: QE provides test scenarios for instrumentation With wicked-brain:memory: Recalls past experiment patterns With product: Uses product context for hypothesis
/wicked-garden:delivery:report - Analyze experiment results/wicked-garden:delivery:rollout - Plan feature rolloutdevelopment
--- name: large-scale-migration description: How to execute a LARGE MECHANICAL change across any codebase with LEVERAGE instead of an agent-grind or hand-edits — a cross-cutting migration, refactor, rename, dialect/framework/DB port, library adoption, or bulk transform. The map→transform→gate pattern: a deterministic transform driven by a source-of-truth map, proven by a differential-equivalence gate. Use when the work is "migrate all X to Y", "rename Z everywhere", "port to a new DB/dialect/fra
testing
v11 LLM-based work-shape classifier. Replaces the regex archetype detector with the model's own reasoning. Reads the user's prompt, picks the right archetype(s) from the catalog, identifies signals (blast_radius, novelty, reversibility, etc.), and persists to SessionState so subsequent turns steer correctly. Use when: the prompt_submit hook emitted a `<wg classify-due />` directive, OR explicitly invoked at session start, OR when re-classifying after the user changes scope mid-session.
tools
v11 work-shape archetype runner. When a prompt has been routed to one of the 9 archetypes (triage, explore, specify, decide, ship, review, incident, build, migrate), this skill is the entry point. It picks the right per-archetype playbook from refs/ and executes the phase shape declared in `.claude-plugin/archetypes.json`. Use when: a `<wg archetype="X">` or `<wg archetypes>` system-reminder tag appears, an explicit "let's run the X archetype" request, or when one of the per-archetype slash commands resolves to this skill.
development
Show or set the session intent variable. Intent gates how loud the framework is — simple-edit (silent), feature/research (synthesis directive), rigor (full crew context). Auto-detected on turn 1; this skill overrides explicitly. Sticky for the session. Use when: "set intent", "intent override", "/wicked-garden:intent", "make the framework quiet", "force rigor", "what's my intent".