Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

posthog/managing-experiment-lifecycle

Name: managing-experiment-lifecycle
Author: posthog

skills/managing-experiment-lifecycle/SKILL.md

npx skillsauth add posthog/ai-plugin managing-experiment-lifecycle

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Managing experiment lifecycle

This skill covers experiment state transitions — what each action does, when to use it, and how it affects variant assignment and analysis.

State diagram

draft ──launch──▶ running ──end──▶ stopped ──archive──▶ archived
                    │   ▲              │
                  pause resume    ship_variant
                    │   │         (also ends if running)
                    ▼   │
                  paused (flag inactive, still "running" status)

Any non-draft state ──reset──▶ draft

Actions and their implications

For each action, the two key questions:

Who sees what variant? (user perspective)
Who is in my analysis? (statistical perspective)

Launch (`experiment-launch`)

Transitions draft → running. Activates the feature flag and sets start_date.

Preconditions: must be in draft, flag needs ≥2 variants with "control" first
Pre-launch checklist: has at least one metric? Variants correct? Flag implemented in code?
Variants: users start being bucketed into variants based on the configured split
Analysis: data collection begins from start_date

No request body needed.

Pause (`experiment-pause`)

Deactivates the feature flag. Users fall back to the default experience (typically control).

Preconditions: must be running and not already paused
Variants: flag is not returned by /decide — no new exposure events recorded
Analysis: no new data while paused, but existing data is preserved. Experiment stays "running".

No request body. Use experiment-resume to reactivate.

Resume (`experiment-resume`)

Reactivates the feature flag after a pause. Users are re-bucketed deterministically into the same variants.

Preconditions: must be paused
Variants: same assignment as before pause — deterministic bucketing
Analysis: exposure tracking resumes

No request body.

End (`experiment-end`)

Sets end_date and transitions to stopped. The feature flag is NOT modified.

Preconditions: must be running (launched, not already stopped)
Variants: users continue seeing assigned variants (flag stays active)
Analysis: results frozen to data up to end_date

Optional body: conclusion ("won", "lost", "inconclusive", "stopped_early", "invalid") and conclusion_comment.

Use this when you want to freeze results without changing what users see.

Ship variant (`experiment-ship-variant`)

Rewrites the feature flag so the selected variant is served to 100% of users.

Preconditions: must be launched (running or stopped). Cannot ship from draft.
Variants: ALL users see the shipped variant. The flag is rewritten with a catch-all group.
Analysis: if still running, the experiment is also ended (end_date set)

Always confirm with the user before shipping — this permanently rewrites the feature flag.

Required: variant_key (e.g. "test"). Optional: conclusion, conclusion_comment.

Returns 409 if an approval policy requires review before the flag change.

Archive (`experiment-archive`)

Hides a stopped experiment from the default list view.

Preconditions: must be stopped (end_date set)
Variants: no change — flag is unaffected
Analysis: no change — results remain accessible

No request body. Can be restored by setting archived=false via experiment-update.

Reset (`experiment-reset`)

Returns an experiment to draft state. Clears start_date, end_date, conclusion, and archived.

Preconditions: must not already be in draft
Variants: flag is left unchanged — users continue seeing assigned variants
Analysis: previously collected data still exists but won't be included in results unless start_date is adjusted after re-launch

No request body.

Duplicate (`experiment-duplicate`)

Creates a copy as a new draft with fresh dates and no results.

Important: always provide a unique feature_flag_key different from the original. If the same key is used, both experiments share a flag — changes to one affect both.

Optional: custom name (defaults to "Original Name (Copy)").

Decision framework

| Situation | Action | Tool | | -------------------------------------------------- | ------------------------ | ------------------------- | | Draft ready, flag implemented, metrics set | Launch | experiment-launch | | Clear winner, significant results | Ship the winning variant | experiment-ship-variant | | No significant difference after sufficient time | End as inconclusive | experiment-end | | Something wrong, need to stop exposure temporarily | Pause | experiment-pause | | Resume after pause | Resume | experiment-resume | | Experiment ended, ready to clean up | Archive | experiment-archive | | Need to start over with same config | Reset to draft | experiment-reset | | Want a similar experiment with a fresh start | Duplicate | experiment-duplicate |

Resolving experiments

All lifecycle actions require an experiment ID. If you don't have one, load the finding-experiments skill to resolve the user's reference (name, description, "latest", etc.) to a concrete ID before proceeding.

Error handling

| Error message | Meaning | | --------------------------------------- | ------------------------------------ | | "Experiment has already been launched." | Can't launch a non-draft experiment | | "Experiment has not been launched yet." | Can't end/pause/ship a draft | | "Experiment has already ended." | Can't end/pause a stopped experiment | | "Experiment is already paused." | Use resume instead | | "Experiment is not paused." | It's already active | | "Experiment is already in draft state." | Nothing to reset | | "Experiment is already archived." | Already done |

When you get a 400, explain the situation to the user rather than retrying.

posthog/managing-experiment-lifecycle

skills/managing-experiment-lifecycle/SKILL.md

Guides experiment state transitions: launching, pausing, resuming, ending, shipping variants, archiving, resetting, and duplicating. Covers preconditions, implications for variant assignment and analysis, and the decision framework for when to use each action. TRIGGER when: user asks to launch, pause, resume, end, ship, archive, reset, or duplicate an experiment. DO NOT TRIGGER when: user is creating an experiment (use creating-experiments), configuring rollout (use configuring-experiment-rollout), or setting up metrics (use configuring-experiment-analytics).

22 stars

development

Updated Apr 24, 2026

$ install --global

skillsauth

npx skillsauth add posthog/ai-plugin managing-experiment-lifecycle

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 24, 2026, 8:34 PM0.5s1 file scanned

SKILL.md

name:: managing-experiment-lifecycle
description:: Guides experiment state transitions: launching, pausing, resuming, ending, shipping variants, archiving, resetting, and duplicating. Covers preconditions, implications for variant assignment and analysis, and the decision framework for when to use each action.\nTRIGGER when: user asks to launch, pause, resume, end, ship, archive, reset, or duplicate an experiment.\nDO NOT TRIGGER when: user is creating an experiment (use creating-experiments), configuring rollout (use configuring-experiment-rollout), or setting up metrics (use configuring-experiment-analytics).

Managing experiment lifecycle

This skill covers experiment state transitions — what each action does, when to use it, and how it affects variant assignment and analysis.

State diagram

draft ──launch──▶ running ──end──▶ stopped ──archive──▶ archived
                    │   ▲              │
                  pause resume    ship_variant
                    │   │         (also ends if running)
                    ▼   │
                  paused (flag inactive, still "running" status)

Any non-draft state ──reset──▶ draft

Actions and their implications

For each action, the two key questions:

Who sees what variant? (user perspective)
Who is in my analysis? (statistical perspective)

Launch (`experiment-launch`)

Transitions draft → running. Activates the feature flag and sets start_date.

Preconditions: must be in draft, flag needs ≥2 variants with "control" first
Pre-launch checklist: has at least one metric? Variants correct? Flag implemented in code?
Variants: users start being bucketed into variants based on the configured split
Analysis: data collection begins from start_date

No request body needed.

Pause (`experiment-pause`)

Deactivates the feature flag. Users fall back to the default experience (typically control).

Preconditions: must be running and not already paused
Variants: flag is not returned by /decide — no new exposure events recorded
Analysis: no new data while paused, but existing data is preserved. Experiment stays "running".

No request body. Use experiment-resume to reactivate.

Resume (`experiment-resume`)

Reactivates the feature flag after a pause. Users are re-bucketed deterministically into the same variants.

Preconditions: must be paused
Variants: same assignment as before pause — deterministic bucketing
Analysis: exposure tracking resumes

No request body.

End (`experiment-end`)

Sets end_date and transitions to stopped. The feature flag is NOT modified.

Preconditions: must be running (launched, not already stopped)
Variants: users continue seeing assigned variants (flag stays active)
Analysis: results frozen to data up to end_date

Optional body: conclusion ("won", "lost", "inconclusive", "stopped_early", "invalid") and conclusion_comment.

Use this when you want to freeze results without changing what users see.

Ship variant (`experiment-ship-variant`)

Rewrites the feature flag so the selected variant is served to 100% of users.

Preconditions: must be launched (running or stopped). Cannot ship from draft.
Variants: ALL users see the shipped variant. The flag is rewritten with a catch-all group.
Analysis: if still running, the experiment is also ended (end_date set)

Always confirm with the user before shipping — this permanently rewrites the feature flag.

Required: variant_key (e.g. "test"). Optional: conclusion, conclusion_comment.

Returns 409 if an approval policy requires review before the flag change.

Archive (`experiment-archive`)

Hides a stopped experiment from the default list view.

Preconditions: must be stopped (end_date set)
Variants: no change — flag is unaffected
Analysis: no change — results remain accessible

No request body. Can be restored by setting archived=false via experiment-update.

Reset (`experiment-reset`)

Returns an experiment to draft state. Clears start_date, end_date, conclusion, and archived.

Preconditions: must not already be in draft
Variants: flag is left unchanged — users continue seeing assigned variants
Analysis: previously collected data still exists but won't be included in results unless start_date is adjusted after re-launch

No request body.

Duplicate (`experiment-duplicate`)

Creates a copy as a new draft with fresh dates and no results.

Important: always provide a unique feature_flag_key different from the original. If the same key is used, both experiments share a flag — changes to one affect both.

Optional: custom name (defaults to "Original Name (Copy)").

Decision framework

Resolving experiments

Error handling

When you get a 400, explain the situation to the user rather than retrying.

Related Skills

posthog/signals-scout-surveys

testing

VerifiedTrustedCommunity

Focused Signals scout for PostHog projects running surveys. Watches active surveys for score regressions (NPS / CSAT / rating drops), response-volume drops, abandonment spikes, and targeting drift, AND aggregates open-text responses into recurring themes the team should know about (clusters of complaints, praise, feature requests). Emits findings only when a theme or anomaly clears the confidence bar; otherwise writes durable memory and closes out empty. Self-contained peer in the signals-scout-* fleet — no dependencies on other skills. Picked uniformly at random by the coordinator alongside `signals-scout-general` and other specialists.

49SKILL.mdUpdated Jun 5, 2026

posthog/signals-scout-surveys

posthog/signals-scout-revenue-analytics

development

VerifiedTrustedCommunity

Focused Signals scout for PostHog projects using revenue analytics. Watches the derived revenue product for upstream failures (Stripe sync stalls, capture regressions), config drift (missing subscription property, currency mix surprises, broken Stripe↔person joins, deferred-revenue gaps), and goal-miss escalations. Emits findings only when they clear the confidence bar; otherwise writes durable memory and closes out empty. Self-contained peer in the signals-scout-* fleet — no dependencies on other skills. Picked uniformly at random by the coordinator alongside `signals-scout-general` and other specialists.

49SKILL.mdUpdated Jun 5, 2026

posthog/signals-scout-revenue-analytics

posthog/signals-scout-observability-gaps

testing

VerifiedTrustedCommunity

Focused Signals scout for finding observability gaps in PostHog itself — significant event volumes the team isn't tracking, custom events with no insight or dashboard coverage, insights pointing at events that have stopped firing, dashboards missing related context, critical events with no alerts. Watches the event-stream-vs-saved- inventory delta as the team's product evolves and emits findings recommending new insights, dashboard additions, or alerts when gaps clear the confidence bar. Self-contained peer in the signals-scout-* fleet — picked uniformly at random by the coordinator alongside `signals-scout-general` and other specialists.

49SKILL.mdUpdated Jun 5, 2026

posthog/signals-scout-observability-gaps

posthog/signals-scout-logs

testing

VerifiedTrustedCommunity

Focused Signals scout for PostHog projects using logs. Watches for volume bursts, severity-distribution shifts, service silence, fresh message patterns, and trace-correlated bursts via the logs ingestion pipeline. Emits findings only when they clear the confidence bar; otherwise writes durable memory and closes out empty. Self-contained peer in the signals-scout-* fleet — no dependencies on other skills. Picked uniformly at random by the coordinator alongside `signals-scout-general` and other specialists.

49SKILL.mdUpdated Jun 5, 2026

posthog/signals-scout-logs

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/posthog/ai-plugin.git

# Copy into Claude Code skills folder (global)
cp -r ai-plugin/skills/managing-experiment-lifecycle ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

posthog/ai-plugin

22 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

posthog/managing-experiment-lifecycle

$ install --global

Security Scan Results

SKILL.md

Managing experiment lifecycle

State diagram

Actions and their implications

Launch (experiment-launch)

Pause (experiment-pause)

Resume (experiment-resume)

End (experiment-end)

Ship variant (experiment-ship-variant)

Archive (experiment-archive)

Reset (experiment-reset)

Duplicate (experiment-duplicate)

Decision framework

Resolving experiments

Error handling

Related Skills

posthog/signals-scout-surveys

posthog/signals-scout-revenue-analytics

posthog/signals-scout-observability-gaps

posthog/signals-scout-logs

posthog/managing-experiment-lifecycle

$ install --global

Security Scan Results

SKILL.md

Managing experiment lifecycle

State diagram

Actions and their implications

Launch (experiment-launch)

Pause (experiment-pause)

Resume (experiment-resume)

End (experiment-end)

Ship variant (experiment-ship-variant)

Archive (experiment-archive)

Reset (experiment-reset)

Duplicate (experiment-duplicate)

Decision framework

Resolving experiments

Error handling

Related Skills

posthog/signals-scout-surveys

posthog/signals-scout-revenue-analytics

posthog/signals-scout-observability-gaps

posthog/signals-scout-logs

Launch (`experiment-launch`)

Pause (`experiment-pause`)

Resume (`experiment-resume`)

End (`experiment-end`)

Ship variant (`experiment-ship-variant`)

Archive (`experiment-archive`)

Reset (`experiment-reset`)

Duplicate (`experiment-duplicate`)

Launch (`experiment-launch`)

Pause (`experiment-pause`)

Resume (`experiment-resume`)

End (`experiment-end`)

Ship variant (`experiment-ship-variant`)

Archive (`experiment-archive`)

Reset (`experiment-reset`)

Duplicate (`experiment-duplicate`)