Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

posthog/configuring-experiment-rollout

Name: configuring-experiment-rollout
Author: posthog

skills/configuring-experiment-rollout/SKILL.md

npx skillsauth add posthog/ai-plugin configuring-experiment-rollout

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Configuring experiment rollout

This skill answers: Who sees what variant?

Recommended approach: equal split + adjust rollout percentage

In most cases, experiments work best with an equal split. If you want to limit exposure to the test variant, adjust the rollout percentage instead.

Why equal splits are better:

Equal splits maximize statistical power — each variant has the same sample size
Equal splits balance traffic and thus reach significance faster
Increasing user exposure throughout the experiment through increasing rollout is clean (changing split mid-experiment can cause users to switch variants, which is bad for user experience and data quality)

Always default to an equal split unless the user explicitly requests otherwise.

When an uneven split is required

Uneven splits combined with the default "Exclude multivariate users" handling can introduce bias. If the experiment observes multi-variant users (users exposed to more than one variant) then those are dropped asymmetrically — the smaller variant loses a larger fraction of its assignments. If those users behave differently from the rest, the smaller variant's metrics will be skewed.

The right mitigation depends on experiment state:

Pre-launch, or live but with few exposures so far — use an equal split and reduce the overall rollout. Achieves the same test-variant exposure without the bias and preserves statistical power. See the disambiguation question below.
Live experiment with significant exposures — switch multivariate handling to "First seen variant". Changing the split mid-run reassigns users across variants (anti-pattern; see "Changing rollout on a running experiment" below). Switching handling instead keeps everyone in their original variant and avoids the asymmetric exclusion. See configuring-experiment-analytics for how to set this. Note that "first seen" handling can introduce other biases, but it's preferable to mid-run reassignment.

The two rollout controls

There are two separate controls that determine who sees what. Both are set via parameters.

1. Variant split (`parameters.feature_flag_variants`)

How users inside the experiment are distributed across variants.

Array of {key, name, split_percent} — percentages must sum to 100
First variant must have key "control" — this is the baseline
Minimum 2 variants, maximum 20
Default: control 50% / test 50%

If the user says "A/B/C test", map the baseline to "control" and create additional variants for the others.

2. Overall rollout (`parameters.rollout_percentage`)

What percentage of all users enter the experiment at all. Default: 100%.

Users not included are excluded entirely — they don't see any variant and are not part of the analysis.

How they interact

These two controls multiply:

| Overall rollout | Variant split | % seeing test | % in analysis | | --------------- | ------------------ | ------------- | ------------- | | 100% | 50/50 | 50% | 100% | | 100% | 75/25 control/test | 25% | 100% | | 50% | 50/50 | 25% | 50% | | 25% | 50/50 | 12.5% | 25% |

The disambiguation question

CRITICAL: If the user requests an uneven variant split (e.g. "60/40", "70/20/10") or mentions a specific percentage that could refer to either the split or the rollout (e.g. "roll out to 25%"), you MUST clarify before proceeding. This covers two cases:

Case 1: Single percentage ("25%", "roll out to 40%")

The percentage is ambiguous — it could mean a variant split or a rollout change. Ask:

There are two ways to get 25% of users seeing the test variant:

Reduced rollout with equal split (recommended): reduce the overall rollout and split variants equally. Only a subset of users enter the experiment, and of those, each variant gets the same share. Equal splits maximize statistical power and avoid bias.

Asymmetric split: keep 100% rollout but give the test variant only 25%. All users enter the experiment, but the uneven split reduces power on the smaller variant and risks bias.

Which approach do you prefer?

Adjust the numbers to match whatever percentage the user requested.

Case 2: Uneven ratio ("60/40", "70/30", "80/20", etc.)

The ratio looks like an explicit variant split, but a reduced rollout with an equal split is almost always better. Explain the trade-off and recommend the alternative:

An uneven variant split works, but an equal split with reduced rollout is recommended:

Equal split + reduced rollout (recommended): reduce the overall rollout so that the same fraction of users sees the test variant, but split variants equally within the experiment. Equal splits maximize statistical power and avoid bias from asymmetric multivariate exclusion.

Uneven split. Achieves the same user-facing outcome, but reduces power on the smaller variant and risks bias.

Would you like the equal split approach, or do you have a specific reason for the uneven split?

Adjust the numbers to match the ratio. For experiments with more than two variants, "equal" means each variant gets the same share (e.g. 34/33/33 for three variants). If the user confirms they want the uneven split after seeing the trade-off, proceed — but DO NOT skip the next section.

After the user picks the uneven split

If the user proceeds with an uneven split (option 2 in either case above), you MUST surface the multivariate-handling implication BEFORE creating or updating the experiment. The user has chosen the riskier rollout path and needs to make an informed choice about how to mitigate.

Ask:

One more thing — with an uneven split, the default "Exclude multivariate users" handling drops users exposed to multiple variants asymmetrically. The smaller variant loses a larger fraction of its assignments, which can skew its metrics if those users behave differently from the rest.

Two options:

Switch multivariate handling to "First seen variant" (recommended for uneven splits) — keeps all users in the analysis and avoids asymmetric exclusion. Has its own caveats (other biases can creep in) but is preferable to the default for uneven splits.

Keep the default "Exclude" handling and accept the bias risk.

Which would you like?

See configuring-experiment-analytics for how to set the multivariate handling. Apply the choice as part of the same operation (creation or update) — do not leave the user with an uneven split under default handling without an explicit, informed decision.

Persist flag across authentication steps

This option (ensure_experience_continuity on the feature flag) is only relevant when:

The feature flag is shown to both logged-out AND logged-in users
You need the same variant assignment before and after login

This is not compatible with all setups. Learn more: https://posthog.com/docs/feature-flags/creating-feature-flags#persisting-feature-flags-across-authentication-steps

Only mention this to the user if their use case involves pre/post-authentication experiences.

Resolving experiments

Rollout changes require an experiment ID. If the user refers to an experiment by name or description (e.g. "change rollout on my signup test"), load the finding-experiments skill to resolve it to a concrete ID before proceeding.

Changing rollout on a running experiment

Any change to rollout or variant split on a running experiment affects both user experience and statistical validity. You MUST warn the user and get explicit confirmation before making the change.

Do NOT silently apply the change — even if the user asked for it directly. Present the warning covering both perspectives:

Who sees what variant? — will users switch variants or lose a feature?
Who is in my analysis? — how does this affect data quality?

Exception: Increasing rollout (without changing the split) is generally safe — no users switch variants, more users are added cleanly.

Mid-experiment fix for uneven-split bias: switching multivariate handling from "Exclude" to "First seen variant" is the recommended mitigation for already-launched experiments — no users switch variants and all collected data stays in the analysis. Changing the split to be even is an anti-pattern mid-run (typically requires resetting or ending the experiment) and is only preferred if the experiment hasn't been exposed to many users yet. See configuring-experiment-analytics for how to change the handling.

See references/changing-distribution-after-launch.md for detailed warnings, what to tell the user, and when to recommend alternatives.

posthog/configuring-experiment-rollout

skills/configuring-experiment-rollout/SKILL.md

Configures the rollout shape of a PostHog experiment — the variant split (50/50, 80/20, A/B/C ratios), the overall rollout percentage that gates how many users enter the experiment, and the disambiguation when a percentage like "roll out to 25%" could mean either. Use when the user mentions a rollout percentage, variant split, or traffic distribution; gives a ratio like 60/40, 70/30, or 80/20; asks "who sees the test variant?"; wants to increase, decrease, or change the rollout or split on a draft or running experiment; weighs equal vs uneven splits; or proposes a mid-experiment split change (often an anti-pattern that needs reset or end-and-restart).

29 stars

testing

Updated May 7, 2026

$ install --global

skillsauth

npx skillsauth add posthog/ai-plugin configuring-experiment-rollout

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 7, 2026, 5:37 AM123.5s2 files scanned

SKILL.md

name:: configuring-experiment-rollout
description:: Configures the rollout shape of a PostHog experiment — the variant split (50/50, 80/20, A/B/C ratios), the overall rollout percentage that gates how many users enter the experiment, and the disambiguation when a percentage like "roll out to 25%" could mean either. Use when the user mentions a rollout percentage, variant split, or traffic distribution; gives a ratio like 60/40, 70/30, or 80/20; asks "who sees the test variant?"; wants to increase, decrease, or change the rollout or split on a draft or running experiment; weighs equal vs uneven splits; or proposes a mid-experiment split change (often an anti-pattern that needs reset or end-and-restart).

Configuring experiment rollout

This skill answers: Who sees what variant?

Recommended approach: equal split + adjust rollout percentage

In most cases, experiments work best with an equal split. If you want to limit exposure to the test variant, adjust the rollout percentage instead.

Why equal splits are better:

Equal splits maximize statistical power — each variant has the same sample size
Equal splits balance traffic and thus reach significance faster
Increasing user exposure throughout the experiment through increasing rollout is clean (changing split mid-experiment can cause users to switch variants, which is bad for user experience and data quality)

Always default to an equal split unless the user explicitly requests otherwise.

When an uneven split is required

The right mitigation depends on experiment state:

Pre-launch, or live but with few exposures so far — use an equal split and reduce the overall rollout. Achieves the same test-variant exposure without the bias and preserves statistical power. See the disambiguation question below.
Live experiment with significant exposures — switch multivariate handling to "First seen variant". Changing the split mid-run reassigns users across variants (anti-pattern; see "Changing rollout on a running experiment" below). Switching handling instead keeps everyone in their original variant and avoids the asymmetric exclusion. See configuring-experiment-analytics for how to set this. Note that "first seen" handling can introduce other biases, but it's preferable to mid-run reassignment.

The two rollout controls

There are two separate controls that determine who sees what. Both are set via parameters.

1. Variant split (`parameters.feature_flag_variants`)

How users inside the experiment are distributed across variants.

Array of {key, name, split_percent} — percentages must sum to 100
First variant must have key "control" — this is the baseline
Minimum 2 variants, maximum 20
Default: control 50% / test 50%

If the user says "A/B/C test", map the baseline to "control" and create additional variants for the others.

2. Overall rollout (`parameters.rollout_percentage`)

What percentage of all users enter the experiment at all. Default: 100%.

Users not included are excluded entirely — they don't see any variant and are not part of the analysis.

How they interact

These two controls multiply:

The disambiguation question

Case 1: Single percentage ("25%", "roll out to 40%")

The percentage is ambiguous — it could mean a variant split or a rollout change. Ask:

There are two ways to get 25% of users seeing the test variant:

Reduced rollout with equal split (recommended): reduce the overall rollout and split variants equally. Only a subset of users enter the experiment, and of those, each variant gets the same share. Equal splits maximize statistical power and avoid bias.

Asymmetric split: keep 100% rollout but give the test variant only 25%. All users enter the experiment, but the uneven split reduces power on the smaller variant and risks bias.

Which approach do you prefer?

Adjust the numbers to match whatever percentage the user requested.

Case 2: Uneven ratio ("60/40", "70/30", "80/20", etc.)

The ratio looks like an explicit variant split, but a reduced rollout with an equal split is almost always better. Explain the trade-off and recommend the alternative:

An uneven variant split works, but an equal split with reduced rollout is recommended:

Equal split + reduced rollout (recommended): reduce the overall rollout so that the same fraction of users sees the test variant, but split variants equally within the experiment. Equal splits maximize statistical power and avoid bias from asymmetric multivariate exclusion.

Uneven split. Achieves the same user-facing outcome, but reduces power on the smaller variant and risks bias.

Would you like the equal split approach, or do you have a specific reason for the uneven split?

After the user picks the uneven split

Ask:

One more thing — with an uneven split, the default "Exclude multivariate users" handling drops users exposed to multiple variants asymmetrically. The smaller variant loses a larger fraction of its assignments, which can skew its metrics if those users behave differently from the rest.

Two options:

Switch multivariate handling to "First seen variant" (recommended for uneven splits) — keeps all users in the analysis and avoids asymmetric exclusion. Has its own caveats (other biases can creep in) but is preferable to the default for uneven splits.

Keep the default "Exclude" handling and accept the bias risk.

Which would you like?

Persist flag across authentication steps

This option (ensure_experience_continuity on the feature flag) is only relevant when:

The feature flag is shown to both logged-out AND logged-in users
You need the same variant assignment before and after login

This is not compatible with all setups. Learn more: https://posthog.com/docs/feature-flags/creating-feature-flags#persisting-feature-flags-across-authentication-steps

Only mention this to the user if their use case involves pre/post-authentication experiences.

Resolving experiments

Changing rollout on a running experiment

Do NOT silently apply the change — even if the user asked for it directly. Present the warning covering both perspectives:

Who sees what variant? — will users switch variants or lose a feature?
Who is in my analysis? — how does this affect data quality?

Exception: Increasing rollout (without changing the split) is generally safe — no users switch variants, more users are added cleanly.

See references/changing-distribution-after-launch.md for detailed warnings, what to tell the user, and when to recommend alternatives.

Related Skills

posthog/signals-scout-surveys

testing

VerifiedTrustedCommunity

Focused Signals scout for PostHog projects running surveys. Watches active surveys for score regressions (NPS / CSAT / rating drops), response-volume drops, abandonment spikes, and targeting drift, AND aggregates open-text responses into recurring themes the team should know about (clusters of complaints, praise, feature requests). Emits findings only when a theme or anomaly clears the confidence bar; otherwise writes durable memory and closes out empty. Self-contained peer in the signals-scout-* fleet — no dependencies on other skills. Picked uniformly at random by the coordinator alongside `signals-scout-general` and other specialists.

49SKILL.mdUpdated Jun 5, 2026

posthog/signals-scout-surveys

posthog/signals-scout-revenue-analytics

development

VerifiedTrustedCommunity

Focused Signals scout for PostHog projects using revenue analytics. Watches the derived revenue product for upstream failures (Stripe sync stalls, capture regressions), config drift (missing subscription property, currency mix surprises, broken Stripe↔person joins, deferred-revenue gaps), and goal-miss escalations. Emits findings only when they clear the confidence bar; otherwise writes durable memory and closes out empty. Self-contained peer in the signals-scout-* fleet — no dependencies on other skills. Picked uniformly at random by the coordinator alongside `signals-scout-general` and other specialists.

49SKILL.mdUpdated Jun 5, 2026

posthog/signals-scout-revenue-analytics

posthog/signals-scout-observability-gaps

testing

VerifiedTrustedCommunity

Focused Signals scout for finding observability gaps in PostHog itself — significant event volumes the team isn't tracking, custom events with no insight or dashboard coverage, insights pointing at events that have stopped firing, dashboards missing related context, critical events with no alerts. Watches the event-stream-vs-saved- inventory delta as the team's product evolves and emits findings recommending new insights, dashboard additions, or alerts when gaps clear the confidence bar. Self-contained peer in the signals-scout-* fleet — picked uniformly at random by the coordinator alongside `signals-scout-general` and other specialists.

49SKILL.mdUpdated Jun 5, 2026

posthog/signals-scout-observability-gaps

posthog/signals-scout-logs

testing

VerifiedTrustedCommunity

Focused Signals scout for PostHog projects using logs. Watches for volume bursts, severity-distribution shifts, service silence, fresh message patterns, and trace-correlated bursts via the logs ingestion pipeline. Emits findings only when they clear the confidence bar; otherwise writes durable memory and closes out empty. Self-contained peer in the signals-scout-* fleet — no dependencies on other skills. Picked uniformly at random by the coordinator alongside `signals-scout-general` and other specialists.

49SKILL.mdUpdated Jun 5, 2026

posthog/signals-scout-logs

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/posthog/ai-plugin.git

# Copy into Claude Code skills folder (global)
cp -r ai-plugin/skills/configuring-experiment-rollout ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

posthog/ai-plugin

29 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

posthog/configuring-experiment-rollout

$ install --global

Security Scan Results

SKILL.md

Configuring experiment rollout

Recommended approach: equal split + adjust rollout percentage

When an uneven split is required

The two rollout controls

1. Variant split (parameters.feature_flag_variants)

2. Overall rollout (parameters.rollout_percentage)

How they interact

The disambiguation question

Case 1: Single percentage ("25%", "roll out to 40%")

Case 2: Uneven ratio ("60/40", "70/30", "80/20", etc.)

After the user picks the uneven split

Persist flag across authentication steps

Resolving experiments

Changing rollout on a running experiment

Related Skills

posthog/signals-scout-surveys

posthog/signals-scout-revenue-analytics

posthog/signals-scout-observability-gaps

posthog/signals-scout-logs

posthog/configuring-experiment-rollout

$ install --global

Security Scan Results

SKILL.md

Configuring experiment rollout

Recommended approach: equal split + adjust rollout percentage

When an uneven split is required

The two rollout controls

1. Variant split (parameters.feature_flag_variants)

2. Overall rollout (parameters.rollout_percentage)

How they interact

The disambiguation question

Case 1: Single percentage ("25%", "roll out to 40%")

Case 2: Uneven ratio ("60/40", "70/30", "80/20", etc.)

After the user picks the uneven split

Persist flag across authentication steps

Resolving experiments

Changing rollout on a running experiment

Related Skills

posthog/signals-scout-surveys

posthog/signals-scout-revenue-analytics

posthog/signals-scout-observability-gaps

posthog/signals-scout-logs

1. Variant split (`parameters.feature_flag_variants`)

2. Overall rollout (`parameters.rollout_percentage`)

1. Variant split (`parameters.feature_flag_variants`)

2. Overall rollout (`parameters.rollout_percentage`)