Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

shenxingy/ads-test

Name: ads-test
Author: shenxingy

configs/skills/ads-test/SKILL.md

npx skillsauth add shenxingy/claude-code-kit ads-test

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

A/B Test Design & Experiment Planning

Process

Understand what the user wants to test (creative, audience, bidding, landing page)
Build structured hypothesis using the framework below
Calculate required sample size and estimated duration
Recommend platform-specific test setup
Define success criteria and measurement plan

Hypothesis Framework

Every test must start with a structured hypothesis:

IF we [change/action]
THEN [metric] will [increase/decrease] by [estimated %]
BECAUSE [reasoning based on data or insight]

Example:
IF we replace polished product shots with UGC creator videos
THEN Meta CTR will increase by 25-40%
BECAUSE Andromeda prioritizes diverse creative formats and UGC consistently outperforms polished in 2025-2026 benchmarks

Hypothesis Quality Checklist

[ ] Single variable being tested (isolate the change)
[ ] Specific metric defined (not "performance")
[ ] Estimated effect size stated (needed for sample size calculation)
[ ] Timeframe defined
[ ] Success/failure criteria clear before launch

Statistical Significance Calculator

Required Sample Size (per variant):

n = (Z_alpha + Z_beta)^2 × 2 × p × (1-p) / MDE^2

Where:
- Z_alpha = 1.96 (for 95% confidence)
- Z_beta = 0.84 (for 80% power)
- p = baseline conversion rate
- MDE = minimum detectable effect (relative %)

Simplified lookup:

| Baseline CVR | 5% MDE | 10% MDE | 20% MDE | 30% MDE | |-------------|---------|---------|---------|---------| | 1% | 612,000 | 153,000 | 38,300 | 17,000 | | 2% | 302,400 | 75,600 | 18,900 | 8,400 | | 5% | 116,800 | 29,200 | 7,300 | 3,200 | | 10% | 55,200 | 13,800 | 3,450 | 1,530 | | 20% | 24,600 | 6,150 | 1,540 | 680 |

Per variant, 95% confidence, 80% power

Test Duration Estimator

Duration = Required Sample Size / Daily Traffic per Variant

Minimum duration: 7 days (capture weekly patterns)
Maximum recommended: 28 days (avoid seasonal drift)
Learning phase: Google 7-14 days, Meta 3-7 days, LinkedIn 7-14 days

Inputs needed:
- Daily impressions or clicks
- Number of variants (2 = A/B, 3+ = multivariate)
- Baseline conversion rate
- Minimum detectable effect desired

Duration Quick Estimates

| Daily Clicks | 2% CVR, 20% MDE | 5% CVR, 20% MDE | 10% CVR, 20% MDE | |-------------|-----------------|-----------------|-----------------| | 100 | 189 days | 73 days | 35 days | | 500 | 38 days | 15 days | 7 days | | 1,000 | 19 days | 7 days | 4 days* | | 5,000 | 4 days* | 2 days* | 1 day* |

*Minimum 7 days recommended regardless of sample sufficiency

Platform-Specific Test Setup

Meta Experiments

Use Ads Manager > Experiments tab (not manual ad set duplication)
Automatic audience splitting ensures no overlap
Supported test types: A/B (creative, audience, placement), Holdout, Brand Survey
Meta's Incremental Attribution (April 2025) provides AI-powered holdout testing for measuring real causal impact
Budget: split evenly across variants; minimum $100/day per variant recommended
Duration: 7-14 days typical; Meta auto-determines winner at 95% confidence

Google Experiments

Campaign Experiments (custom experiments) or Ad Variations
Create experiment from existing campaign > select experiment type
Traffic split: 50/50 recommended for fastest results
Supported: bidding strategy, ad copy, landing page, audience
Metrics: choose primary metric (conversions, CPA, ROAS) before launch
Duration: 14-30 days recommended; minimum 2 weeks for bidding tests

LinkedIn A/B Testing

Built into Campaign Manager for Sponsored Content
Duplicate ad set with single variable change
Target: same audience segment with automatic rotation
Minimum budget: $50/day per variant
Key metrics: CTR (>0.44% benchmark), CPL, Lead Form CVR (13% benchmark)
Duration: 14-21 days (LinkedIn's smaller daily volumes require longer tests)

TikTok Split Testing

Available in TikTok Ads Manager > Create A/B Test
Test types: targeting, bidding, creative
Auto-splits audience to avoid contamination
Minimum 7 days, recommended 14 days
Budget: minimum $20/day per ad group
Creative tests: isolate hook (first 2-3 seconds) as the primary variable
TikTok's enhanced split testing supports modular test variables (targeting, creative, budget, placement) via Smart+ since 2025

What to Test (Priority Order)

High Impact (test first)

Creative concept (different messaging angles, not just color changes)
Hook/first 3 seconds (video opening on Meta, TikTok, YouTube)
Offer structure (pricing, discount type, free trial length)
Landing page (headline, CTA, form length)
Bidding strategy (tCPA vs tROAS vs Maximize Conversions)

Medium Impact

Audience targeting (interest vs lookalike vs broad)
Ad format (static vs video vs carousel)
CTA button (Learn More vs Sign Up vs Shop Now)
Campaign structure (CBO vs ABO, consolidated vs segmented)

Low Impact (test last)

Ad scheduling (time of day, day of week)
Device targeting (mobile vs desktop)
Minor copy variations (word substitutions without concept change)

Common Testing Mistakes to Avoid

Testing too many variables at once (no clear winner attribution)
Ending tests too early (before statistical significance)
Testing during atypical periods (holidays, launches, incidents)
Comparing unequal time periods
Not documenting learnings (build institutional knowledge)
Testing small changes when big changes are needed (optimize vs innovate)
Ignoring learning phase on automated platforms

Output Format

## A/B Test Plan

### Hypothesis
IF [change]
THEN [metric] will [direction] by [amount]
BECAUSE [reasoning]

### Test Design
| Parameter | Value |
|-----------|-------|
| Platform | [platform] |
| Test Type | [A/B / Multivariate] |
| Variable | [what's being changed] |
| Control | [current state] |
| Variant | [proposed change] |
| Primary Metric | [KPI] |
| Traffic Split | [50/50 / other] |

### Sample Size & Duration
| Metric | Value |
|--------|-------|
| Baseline CVR | [X%] |
| MDE | [X%] |
| Required Sample | [N per variant] |
| Daily Traffic | [N clicks/day] |
| Est. Duration | [X days] |
| Min Duration | 7 days |

### Success Criteria
- Winner declared at 95% confidence
- [Primary metric] improvement of [X%]+ sustained over [Y] days
- No negative impact on [secondary metric]

### Setup Instructions
[Platform-specific step-by-step]

shenxingy/ads-test

configs/skills/ads-test/SKILL.md

A/B test design and experiment planning for paid advertising. Structured hypothesis framework, statistical significance calculator, test duration estimator, sample size calculator, and platform-specific experiment setup guides (Meta Experiments, Google Experiments, LinkedIn A/B). Use when user says A/B test, split test, experiment design, test hypothesis, statistical significance, sample size, or test duration.

8 stars

development

Updated Jun 13, 2026

$ install --global

skillsauth

npx skillsauth add shenxingy/claude-code-kit ads-test

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 13, 2026, 5:13 AM14.6s1 file scanned

SKILL.md

name:: ads-test
description:: A/B test design and experiment planning for paid advertising. Structured hypothesis framework, statistical significance calculator, test duration estimator, sample size calculator, and platform-specific experiment setup guides (Meta Experiments, Google Experiments, LinkedIn A/B). Use when user says A/B test, split test, experiment design, test hypothesis, statistical significance, sample size, or test duration.
user_invocable:: false
tested_date:: 2026-05-17
tested_with:: claude-code v2.x

A/B Test Design & Experiment Planning

Process

Understand what the user wants to test (creative, audience, bidding, landing page)
Build structured hypothesis using the framework below
Calculate required sample size and estimated duration
Recommend platform-specific test setup
Define success criteria and measurement plan

Hypothesis Framework

Every test must start with a structured hypothesis:

IF we [change/action]
THEN [metric] will [increase/decrease] by [estimated %]
BECAUSE [reasoning based on data or insight]

Example:
IF we replace polished product shots with UGC creator videos
THEN Meta CTR will increase by 25-40%
BECAUSE Andromeda prioritizes diverse creative formats and UGC consistently outperforms polished in 2025-2026 benchmarks

Hypothesis Quality Checklist

[ ] Single variable being tested (isolate the change)
[ ] Specific metric defined (not "performance")
[ ] Estimated effect size stated (needed for sample size calculation)
[ ] Timeframe defined
[ ] Success/failure criteria clear before launch

Statistical Significance Calculator

Required Sample Size (per variant):

n = (Z_alpha + Z_beta)^2 × 2 × p × (1-p) / MDE^2

Where:
- Z_alpha = 1.96 (for 95% confidence)
- Z_beta = 0.84 (for 80% power)
- p = baseline conversion rate
- MDE = minimum detectable effect (relative %)

Simplified lookup:

Per variant, 95% confidence, 80% power

Test Duration Estimator

Duration = Required Sample Size / Daily Traffic per Variant

Minimum duration: 7 days (capture weekly patterns)
Maximum recommended: 28 days (avoid seasonal drift)
Learning phase: Google 7-14 days, Meta 3-7 days, LinkedIn 7-14 days

Inputs needed:
- Daily impressions or clicks
- Number of variants (2 = A/B, 3+ = multivariate)
- Baseline conversion rate
- Minimum detectable effect desired

Duration Quick Estimates

*Minimum 7 days recommended regardless of sample sufficiency

Platform-Specific Test Setup

Meta Experiments

Use Ads Manager > Experiments tab (not manual ad set duplication)
Automatic audience splitting ensures no overlap
Supported test types: A/B (creative, audience, placement), Holdout, Brand Survey
Meta's Incremental Attribution (April 2025) provides AI-powered holdout testing for measuring real causal impact
Budget: split evenly across variants; minimum $100/day per variant recommended
Duration: 7-14 days typical; Meta auto-determines winner at 95% confidence

Google Experiments

Campaign Experiments (custom experiments) or Ad Variations
Create experiment from existing campaign > select experiment type
Traffic split: 50/50 recommended for fastest results
Supported: bidding strategy, ad copy, landing page, audience
Metrics: choose primary metric (conversions, CPA, ROAS) before launch
Duration: 14-30 days recommended; minimum 2 weeks for bidding tests

LinkedIn A/B Testing

Built into Campaign Manager for Sponsored Content
Duplicate ad set with single variable change
Target: same audience segment with automatic rotation
Minimum budget: $50/day per variant
Key metrics: CTR (>0.44% benchmark), CPL, Lead Form CVR (13% benchmark)
Duration: 14-21 days (LinkedIn's smaller daily volumes require longer tests)

TikTok Split Testing

Available in TikTok Ads Manager > Create A/B Test
Test types: targeting, bidding, creative
Auto-splits audience to avoid contamination
Minimum 7 days, recommended 14 days
Budget: minimum $20/day per ad group
Creative tests: isolate hook (first 2-3 seconds) as the primary variable
TikTok's enhanced split testing supports modular test variables (targeting, creative, budget, placement) via Smart+ since 2025

What to Test (Priority Order)

High Impact (test first)

Creative concept (different messaging angles, not just color changes)
Hook/first 3 seconds (video opening on Meta, TikTok, YouTube)
Offer structure (pricing, discount type, free trial length)
Landing page (headline, CTA, form length)
Bidding strategy (tCPA vs tROAS vs Maximize Conversions)

Medium Impact

Audience targeting (interest vs lookalike vs broad)
Ad format (static vs video vs carousel)
CTA button (Learn More vs Sign Up vs Shop Now)
Campaign structure (CBO vs ABO, consolidated vs segmented)

Low Impact (test last)

Ad scheduling (time of day, day of week)
Device targeting (mobile vs desktop)
Minor copy variations (word substitutions without concept change)

Common Testing Mistakes to Avoid

Testing too many variables at once (no clear winner attribution)
Ending tests too early (before statistical significance)
Testing during atypical periods (holidays, launches, incidents)
Comparing unequal time periods
Not documenting learnings (build institutional knowledge)
Testing small changes when big changes are needed (optimize vs innovate)
Ignoring learning phase on automated platforms

Output Format

## A/B Test Plan

### Hypothesis
IF [change]
THEN [metric] will [direction] by [amount]
BECAUSE [reasoning]

### Test Design
| Parameter | Value |
|-----------|-------|
| Platform | [platform] |
| Test Type | [A/B / Multivariate] |
| Variable | [what's being changed] |
| Control | [current state] |
| Variant | [proposed change] |
| Primary Metric | [KPI] |
| Traffic Split | [50/50 / other] |

### Sample Size & Duration
| Metric | Value |
|--------|-------|
| Baseline CVR | [X%] |
| MDE | [X%] |
| Required Sample | [N per variant] |
| Daily Traffic | [N clicks/day] |
| Est. Duration | [X days] |
| Min Duration | 7 days |

### Success Criteria
- Winner declared at 95% confidence
- [Primary metric] improvement of [X%]+ sustained over [Y] days
- No negative impact on [secondary metric]

### Setup Instructions
[Platform-specific step-by-step]

Related Skills

shenxingy/codex-orchestrate

development

VerifiedTrustedCommunity

Orchestrate a fleet of parallel `codex exec` workers with you (Claude Code) as the supervisor — spawn one per isolated git worktree, dispatch headless, verify each INDEPENDENTLY, PR/merge. The manual "codex-ultracode" pattern for fanning out real implementation, research, or review work onto Codex. Bakes in the hard gotchas (stdin blocking, background tracking, don't-trust-self-reports, writer isolation). Triggers on — orchestrate codex, codex workers, codex fleet, spawn codex, delegate to codex in parallel, manual ultracode, 开 codex 小弟, 派 codex worker — NOT for a single cross-vendor opinion (use the `second-opinion-codex` agent), NOT for web-UI worker decomposition (use `/orchestrate`).

8SKILL.mdUpdated Jul 16, 2026

shenxingy/codex-orchestrate

shenxingy/worktree

development

VerifiedTrustedCommunity

Create and manage git worktrees for parallel Codex sessions

8SKILL.mdUpdated Jul 14, 2026

shenxingy/verify

development

VerifiedTrustedCommunity

Verify project behavior anchors — compilation, tests, and interaction checks after autonomous runs. NOT the Codex built-in /verify (which runs the app to observe a single change working) — this one walks the AGENTS.md "Features (Behavior Anchors)" list.

8SKILL.mdUpdated Jul 14, 2026

shenxingy/sync

documentation

VerifiedTrustedCommunity

End-of-session documentation sync — updates TODO.md and PROGRESS.md only (run /commit after to commit everything)

8SKILL.mdUpdated Jul 14, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/shenxingy/claude-code-kit.git

# Copy into Claude Code skills folder (global)
cp -r claude-code-kit/configs/skills/ads-test ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

shenxingy/claude-code-kit

8 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT