src/autoskillit/skills_extended/make-experiment-diag/SKILL.md
Interactive selection of experimental design lens for visualizing experiment methodology. Routes to the appropriate exp-lens-* skill.
npx skillsauth add talont-org/autoskillit make-experiment-diagInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Select the right experimental design lens for your analysis. Each lens asks one primary question about an experiment's design, surfacing assumptions and failure modes specific to that epistemic dimension.
/autoskillit:make-experiment-diagNEVER:
ALWAYS:
| Your experiment involves... | Use this lens | Question it answers | Philosophical Mode | |-----------------------------|---------------|---------------------|--------------------| | Unclear or shifting claims | Estimand Clarity | What exactly is the claim? | Evidential | | Causal attribution without explicit assumptions | Causal Assumptions | What causal assumptions support this design? | Causal-Structural | | Baseline quality or fairness | Comparator Construction | Is the comparator fair and relevant? | Counterfactual | | Data preprocessing, splits, or feature pipelines | Pipeline Integrity | Could data handling create optimistic bias? | Integrity | | Nondeterminism, seed sensitivity, run-to-run noise | Variance & Stability | Is the signal larger than the noise? | Stability | | Asymmetric tuning, compute, or engineering effort | Fair Comparison | Are alternatives compared symmetrically? | Fairness | | Reproducing results from artifacts alone | Reproducibility & Artifacts | Could an independent party reproduce this? | Transparency | | Metric choice, proxy validity, score interpretation | Measurement Validity | Do measurements justify the interpretation? | Psychometric | | Robustness to preprocessing or modeling choices | Sensitivity & Robustness | Which assumptions are load-bearing? | Robustness | | Generalization claims beyond the test suite | Benchmark Representativeness | Does this generalize beyond the test bed? | Generalizability |
| Your experiment involves... | Use this lens | Question it answers | Philosophical Mode | |-----------------------------|---------------|---------------------|--------------------| | Shared resources, network effects, spillovers | Unit & Interference | What is the unit, and can treatments spill over? | Causal-Structural | | Power, multiplicity, sequential monitoring | Error Budget | Are error risks sized and controlled? | Statistical | | Theory claims needing adversarial stress | Severity Testing | Would this design have caught the error? | Falsificationist | | Assignment mechanism, blocking, stratification | Randomization & Blocking | Where does comparability come from? | Design-Structural | | Confounds, history effects, co-interventions | Validity Threats | What alternative explanations survive? | Adversarial | | Multi-step exploration, adaptive allocation | Iterative Learning | How does this maximize learning per cost? | Decision-Theoretic | | Mixing discovery and confirmation in one study | Exploratory vs Confirmatory | Is this discovery or test? | Boundary | | Deployment risk, fairness, stakeholder harm | Governance & Risk | What risks arise from acting on this result? | Governance |
Ask the user:
What aspect of your experimental design would you like to examine?
Example prompts:
Based on the user's description, match to the most appropriate lens using the selection tables above. If ambiguous, present the top 2-3 candidates and ask the user to choose.
Use the Skill tool to load the selected /autoskillit:exp-lens-{slug} skill.
If the Skill tool cannot be used (disable-model-invocation) or refuses this invocation, do NOT proceed with the exp-lens analysis. Abort this step and inform the user that the lens skill is unavailable.
The loaded lens skill takes over and runs its full analysis workflow.
| Alias | Lens | |-------|------| | estimand | exp-lens-estimand-clarity | | causal | exp-lens-causal-assumptions | | comparator | exp-lens-comparator-construction | | pipeline | exp-lens-pipeline-integrity | | variance | exp-lens-variance-stability | | fairness | exp-lens-fair-comparison | | reproducibility | exp-lens-reproducibility-artifacts | | measurement | exp-lens-measurement-validity | | sensitivity | exp-lens-sensitivity-robustness | | benchmark | exp-lens-benchmark-representativeness | | unit | exp-lens-unit-interference | | error | exp-lens-error-budget | | severity | exp-lens-severity-testing | | randomization | exp-lens-randomization-blocking | | validity | exp-lens-validity-threats | | iterative | exp-lens-iterative-learning | | exploratory | exp-lens-exploratory-confirmatory | | governance | exp-lens-governance-risk |
/autoskillit:mermaid - Shared diagram styling (loaded by individual lenses)/autoskillit:make-arch-diag - Software architecture lens counterpart/autoskillit:verify-diag - Verify diagram accuracy against codebasedevelopment
Generate YAML recipes for .autoskillit/recipes/. Use when user says "make script skill", "generate script", "script a workflow", "write a script", "create a script", "new recipe", "write a pipeline", or when loaded by other skills for script formatting.
data-ai
Create Uncertainty Representation visualization planning spec showing error bar definitions, distribution-aware alternatives, and multi-seed variance protocols. Statistical lens answering "How is uncertainty honestly represented?"
data-ai
Create Temporal Dynamics visualization planning spec showing axis scaling (linear vs log), smoothing disclosure, epoch/step alignment, run aggregation (mean + variance bands), early-stopping markers, and wall-clock vs step-count x-axis. Temporal lens answering "Are training dynamics shown clearly and honestly?"
data-ai
Create Narrative Story Arc visualization planning spec showing visual consistency across the report (same color = same model everywhere), logical figure progression, redundant figure detection, and narrative dependency between figures. Narrative lens answering "Do the figures tell a coherent story across the report?"