Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

lyndonkl/design-of-experiments

Name: design-of-experiments
Author: lyndonkl

skills/design-of-experiments/SKILL.md

npx skillsauth add lyndonkl/claude design-of-experiments

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Design of Experiments

Workflow
Common Patterns
Guardrails
Quick Reference

Workflow

Copy this checklist and track your progress:

Design of Experiments Progress:
- [ ] Step 1: Define objectives and constraints
- [ ] Step 2: Identify factors, levels, and responses
- [ ] Step 3: Choose experimental design
- [ ] Step 4: Plan execution details
- [ ] Step 5: Create experiment plan document
- [ ] Step 6: Validate quality

Step 1: Define objectives and constraints

Clarify the experiment goal (screening vs optimization), response metric(s), experimental budget (max runs), time/cost constraints, and success criteria. See Common Patterns for typical objectives.

Step 2: Identify factors, levels, and responses

List all candidate factors (controllable inputs), specify levels for each factor (low/high or discrete values), categorize factors (control vs noise), and define response variables (measurable outputs). For screening many factors (8+), see resources/methodology.md for Plackett-Burman and fractional factorial approaches.

Step 3: Choose experimental design

Based on objective and constraints:

For screening 5+ factors with limited runs → Use resources/methodology.md for fractional factorial or Plackett-Burman
For optimizing 2-5 factors → Use resources/template.md for full or fractional factorial
For response surface mapping → Use resources/methodology.md for central composite or Box-Behnken
For robust design against noise → Use resources/methodology.md for parameter vs noise factor arrays

Step 4: Plan execution details

Specify randomization order (eliminate time trends), blocking strategy (control nuisance variables), replication plan (estimate error), sample size justification (power analysis), and measurement protocols. See Guardrails for critical requirements.

Step 5: Create experiment plan document

Create design-of-experiments.md with sections: objective, factors table, design matrix (run order with factor settings), response variables, execution protocol, and analysis plan. Use resources/template.md for structure.

Step 6: Validate quality

Self-assess using resources/evaluators/rubric_design_of_experiments.json. Check: objective clarity, factor completeness, design appropriateness, randomization plan, measurement protocol, statistical power, analysis plan, and deliverable quality. Minimum standard: Average score ≥ 3.5 before delivering.

Common Patterns

Pattern 1: Screening (many factors → vital few)

Context: 10-30 candidate factors, limited budget, want to identify 3-5 critical factors
Approach: Plackett-Burman or fractional factorial (Resolution III/IV)
Output: Pareto chart of effect sizes, shortlist for follow-up optimization
Example: Software performance tuning with 15 configuration parameters

Pattern 2: Optimization (find best settings)

Context: 2-5 factors already identified as important, want to find optimal levels
Approach: Full factorial (2^k) or fractional factorial + steepest ascent
Output: Main effects plot, interaction plots, recommended settings
Example: Manufacturing process with temperature, pressure, time factors

Pattern 3: Response Surface (map the landscape)

Context: Need to understand curvature, find maximum/minimum, quantify tradeoffs
Approach: Central Composite Design (CCD) or Box-Behnken
Output: Response surface equation, contour plots, optimal region
Example: Chemical formulation with ingredient ratios

Pattern 4: Robust Design (work despite noise)

Context: Product/process must perform well despite uncontrollable variation
Approach: Taguchi inner-outer array (control × noise factors)
Output: Settings that minimize sensitivity to noise factors
Example: Consumer product that must work across temperature/humidity ranges

Pattern 5: Sequential Experimentation (learn then refine)

Context: High uncertainty, want to learn iteratively with minimal waste
Approach: Screening → Steepest ascent → Response surface → Confirmation
Output: Progressively refined understanding and settings
Example: New product development with unknown factor relationships

Guardrails

Design requirements:

Randomize run order: Eliminates time-order bias and confounding with lurking variables. Use random number generator, not "convenient" sequences.
Replicate center points: For designs with continuous factors, replicate center point runs (3-5 times) to estimate pure error and detect curvature.
Preserve critical interactions: In fractional factorials, avoid confounding important 2-way interactions with main effects. Choose Resolution IV or higher if interactions matter.
Check design balance: Ensure orthogonality (factors are uncorrelated in design matrix). Correlation > 0.3 reduces precision and interpretability.
Define response precisely: Use objective, quantitative, repeatable measurements. Avoid subjective scoring unless calibrated with multiple raters.
Justify sample size: Run power analysis to ensure design can detect meaningful effect sizes with acceptable Type II error risk (beta at most 0.20).
Document assumptions: State expected effect magnitudes, interaction assumptions, noise variance estimates. Design validity depends on these.
Plan for analysis before running: Specify statistical tests, significance level (alpha), effect size metrics before data collection to prevent p-hacking.

Common pitfalls:

❌ One-factor-at-a-time (OFAT): Misses interactions, requires more runs than factorial designs
❌ Ignoring blocking: If runs span days/batches/operators, block accordingly or confound results with time trends
❌ Too many levels: Use 2-3 levels initially. More levels increase runs exponentially.
❌ Unmeasured factors: If an important factor isn't controlled/measured, it becomes noise
❌ Changing protocols mid-experiment: Breaks design structure. If necessary, restart or analyze separately.

Quick Reference

Key resources:

resources/template.md: Quick-start templates for common designs (factorial, screening, response surface)
resources/methodology.md: Advanced techniques (optimal designs, Taguchi, mixture experiments, sequential strategies)
resources/evaluators/rubric_design_of_experiments.json: Quality criteria for experiment plans

Typical workflow time:

Simple factorial (2-4 factors): 15-30 minutes
Screening design (8+ factors): 30-45 minutes
Response surface design: 45-60 minutes
Robust design (Taguchi): 60-90 minutes

When to escalate:

User needs mixture experiments (factors must sum to 100%)
Split-plot designs required (hard-to-change factors)
Optimal designs for irregular constraints
Bayesian adaptive designs → Use resources/methodology.md for these advanced cases

Inputs required:

Process/System: What you're experimenting on
Factors: List of controllable inputs with candidate levels
Responses: Measurable outputs (KPIs, metrics)
Constraints: Budget (max runs), time, resources
Objective: Screening, optimization, response surface, or robust design

Outputs produced:

design-of-experiments.md: Complete experiment plan with design matrix, randomization, protocols, analysis approach

lyndonkl/design-of-experiments

skills/design-of-experiments/SKILL.md

Generates structured experimental designs (factorial, response surface, Taguchi) to systematically discover how multiple factors affect outcomes while minimizing experimental runs. Use when optimizing multi-factor systems with limited experimental budget, screening many variables to find the vital few, discovering interactions between parameters, mapping response surfaces for peak performance, validating robustness to noise factors, or when users mention factorial designs, A/B/n testing, parameter tuning, or process optimization.

81 stars

testing

Updated Apr 20, 2026

$ install --global

skillsauth

npx skillsauth add lyndonkl/claude design-of-experiments

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 20, 2026, 6:27 AM9.5s4 files scanned

SKILL.md

name:: design-of-experiments
description:: Generates structured experimental designs (factorial, response surface, Taguchi) to systematically discover how multiple factors affect outcomes while minimizing experimental runs. Use when optimizing multi-factor systems with limited experimental budget, screening many variables to find the vital few, discovering interactions between parameters, mapping response surfaces for peak performance, validating robustness to noise factors, or when users mention factorial designs, A/B/n testing, parameter tuning, or process optimization.

Design of Experiments

Workflow
Common Patterns
Guardrails
Quick Reference

Workflow

Copy this checklist and track your progress:

Design of Experiments Progress:
- [ ] Step 1: Define objectives and constraints
- [ ] Step 2: Identify factors, levels, and responses
- [ ] Step 3: Choose experimental design
- [ ] Step 4: Plan execution details
- [ ] Step 5: Create experiment plan document
- [ ] Step 6: Validate quality

Step 1: Define objectives and constraints

Clarify the experiment goal (screening vs optimization), response metric(s), experimental budget (max runs), time/cost constraints, and success criteria. See Common Patterns for typical objectives.

Step 2: Identify factors, levels, and responses

Step 3: Choose experimental design

Based on objective and constraints:

For screening 5+ factors with limited runs → Use resources/methodology.md for fractional factorial or Plackett-Burman
For optimizing 2-5 factors → Use resources/template.md for full or fractional factorial
For response surface mapping → Use resources/methodology.md for central composite or Box-Behnken
For robust design against noise → Use resources/methodology.md for parameter vs noise factor arrays

Step 4: Plan execution details

Step 5: Create experiment plan document

Step 6: Validate quality

Common Patterns

Pattern 1: Screening (many factors → vital few)

Context: 10-30 candidate factors, limited budget, want to identify 3-5 critical factors
Approach: Plackett-Burman or fractional factorial (Resolution III/IV)
Output: Pareto chart of effect sizes, shortlist for follow-up optimization
Example: Software performance tuning with 15 configuration parameters

Pattern 2: Optimization (find best settings)

Context: 2-5 factors already identified as important, want to find optimal levels
Approach: Full factorial (2^k) or fractional factorial + steepest ascent
Output: Main effects plot, interaction plots, recommended settings
Example: Manufacturing process with temperature, pressure, time factors

Pattern 3: Response Surface (map the landscape)

Context: Need to understand curvature, find maximum/minimum, quantify tradeoffs
Approach: Central Composite Design (CCD) or Box-Behnken
Output: Response surface equation, contour plots, optimal region
Example: Chemical formulation with ingredient ratios

Pattern 4: Robust Design (work despite noise)

Context: Product/process must perform well despite uncontrollable variation
Approach: Taguchi inner-outer array (control × noise factors)
Output: Settings that minimize sensitivity to noise factors
Example: Consumer product that must work across temperature/humidity ranges

Pattern 5: Sequential Experimentation (learn then refine)

Context: High uncertainty, want to learn iteratively with minimal waste
Approach: Screening → Steepest ascent → Response surface → Confirmation
Output: Progressively refined understanding and settings
Example: New product development with unknown factor relationships

Guardrails

Design requirements:

Randomize run order: Eliminates time-order bias and confounding with lurking variables. Use random number generator, not "convenient" sequences.
Replicate center points: For designs with continuous factors, replicate center point runs (3-5 times) to estimate pure error and detect curvature.
Preserve critical interactions: In fractional factorials, avoid confounding important 2-way interactions with main effects. Choose Resolution IV or higher if interactions matter.
Check design balance: Ensure orthogonality (factors are uncorrelated in design matrix). Correlation > 0.3 reduces precision and interpretability.
Define response precisely: Use objective, quantitative, repeatable measurements. Avoid subjective scoring unless calibrated with multiple raters.
Justify sample size: Run power analysis to ensure design can detect meaningful effect sizes with acceptable Type II error risk (beta at most 0.20).
Document assumptions: State expected effect magnitudes, interaction assumptions, noise variance estimates. Design validity depends on these.
Plan for analysis before running: Specify statistical tests, significance level (alpha), effect size metrics before data collection to prevent p-hacking.

Common pitfalls:

❌ One-factor-at-a-time (OFAT): Misses interactions, requires more runs than factorial designs
❌ Ignoring blocking: If runs span days/batches/operators, block accordingly or confound results with time trends
❌ Too many levels: Use 2-3 levels initially. More levels increase runs exponentially.
❌ Unmeasured factors: If an important factor isn't controlled/measured, it becomes noise
❌ Changing protocols mid-experiment: Breaks design structure. If necessary, restart or analyze separately.

Quick Reference

Key resources:

resources/template.md: Quick-start templates for common designs (factorial, screening, response surface)
resources/methodology.md: Advanced techniques (optimal designs, Taguchi, mixture experiments, sequential strategies)
resources/evaluators/rubric_design_of_experiments.json: Quality criteria for experiment plans

Typical workflow time:

Simple factorial (2-4 factors): 15-30 minutes
Screening design (8+ factors): 30-45 minutes
Response surface design: 45-60 minutes
Robust design (Taguchi): 60-90 minutes

When to escalate:

User needs mixture experiments (factors must sum to 100%)
Split-plot designs required (hard-to-change factors)
Optimal designs for irregular constraints
Bayesian adaptive designs → Use resources/methodology.md for these advanced cases

Inputs required:

Process/System: What you're experimenting on
Factors: List of controllable inputs with candidate levels
Responses: Measurable outputs (KPIs, metrics)
Constraints: Budget (max runs), time, resources
Objective: Screening, optimization, response surface, or robust design

Outputs produced:

design-of-experiments.md: Complete experiment plan with design matrix, randomization, protocols, analysis approach

Related Skills

lyndonkl/conf-theme-clustering

testing

VerifiedTrustedCommunity

Cluster a conference's event records into a small set of coarse themes with finer sub-clusters, an explicit outlier bucket, and soft (multi-membership) affinities — using the hybrid embed-then-label pipeline (embed abstracts, reduce, density-cluster, then LLM-label the clusters) when embedding libraries are available, and an LLM-reasoned hierarchical fallback when they are not. Embeddings do the grouping; the LLM only names the groups. Conference-agnostic. Use when turning structured event records into a navigable theme map for preference elicitation and scheduling, when you need 6-8 reasonable themes rather than 20 muddy ones, or when overlapping talks must belong to more than one theme. Trigger keywords - theme clustering, cluster talks, embed then label, soft membership, outlier talks, conference themes, topic map.

127SKILL.mdUpdated Jun 28, 2026

lyndonkl/conf-theme-clustering

lyndonkl/conf-schedule-optimization

development

VerifiedTrustedCommunity

Build a personal conference schedule as a constraint-optimization problem — hard constraints (no time overlap, room-to-room travel time, capacity/registration, the attendee's own must-attends and blackouts) plus a user-owned weighted objective trading interest against breadth, pacing (maximize contiguous free time), and serendipity. Surfaces unbreakable conflicts (two high-value overlapping talks the model cannot rank) as decisions for the human rather than silently picking, and reports what each choice traded away. Conference-agnostic. Use to turn a preference profile plus a theme map into a day-by-day plan, to resolve overlapping sessions, or to balance a packed vs paced schedule. Trigger keywords - schedule optimization, conference schedule, constraint optimization, overlapping talks, contiguous free time, conflict surfacing, packed vs paced.

127SKILL.mdUpdated Jun 28, 2026

lyndonkl/conf-schedule-optimization

lyndonkl/conf-program-extraction

development

VerifiedTrustedCommunity

Parse a heterogeneous conference program (markdown, HTML, PDF-derived text, or JSON) into normalized event records with per-field confidence scores and independent classification axes (topic, depth, format, prerequisites, recorded, capacity). Detects the program's format before extracting, treats every inferred field as uncertain (present vs inferred vs missing), and flags thin or missing abstracts so downstream enrichment can target them. Conference-agnostic. Use when ingesting a conference or event schedule into a structured store, normalizing a talk/session list, or extracting per-session metadata with calibrated confidence. Trigger keywords - program ingestion, parse schedule, session extraction, event records, conference program, talk metadata, per-field confidence.

127SKILL.mdUpdated Jun 28, 2026

lyndonkl/conf-program-extraction

lyndonkl/conf-preference-elicitation

development

VerifiedTrustedCommunity

Build a personalized preference profile from a small number of well-chosen, cluster-grounded questions instead of a long survey. Represents the person's interests as an uncertainty region over the theme map, picks the single highest-information-gain choice-based question (contrasting real talks from different clusters), balances exploiting known interests against exploring uncertain ones, deliberately injects outlier probes to fight selection bias, and stops as soon as the schedule would be stable. Also elicits the user-owned objective weights and hard constraints. Interactive — runs where it can actually ask the person. Conference-agnostic. Use to turn a theme map into a preference profile, to decide what to ask a conference attendee, or to elicit scheduling priorities. Trigger keywords - preference elicitation, ask few questions, information gain, choice-based questions, selection bias probe, objective weights, attendee preferences.

127SKILL.mdUpdated Jun 28, 2026

lyndonkl/conf-preference-elicitation

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/lyndonkl/claude.git

# Copy into Claude Code skills folder (global)
cp -r claude/skills/design-of-experiments ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

lyndonkl/claude

81 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

lyndonkl/design-of-experiments

$ install --global

Security Scan Results

SKILL.md

Design of Experiments

Table of Contents

Workflow

Common Patterns

Guardrails

Quick Reference

Related Skills

lyndonkl/conf-theme-clustering

lyndonkl/conf-schedule-optimization

lyndonkl/conf-program-extraction

lyndonkl/conf-preference-elicitation

lyndonkl/design-of-experiments

$ install --global

Security Scan Results

SKILL.md

Design of Experiments

Table of Contents

Workflow

Common Patterns

Guardrails

Quick Reference

Related Skills

lyndonkl/conf-theme-clustering

lyndonkl/conf-schedule-optimization

lyndonkl/conf-program-extraction

lyndonkl/conf-preference-elicitation