src/autoskillit/skills_extended/exp-lens-error-budget/SKILL.md
Analyze statistical error budget showing Type I/II errors, power, minimum detectable effect, multiplicity corrections, and sequential monitoring. Statistical lens answering "Are error risks sized and controlled?"
npx skillsauth add talont-org/autoskillit exp-lens-error-budgetInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Philosophical Mode: Statistical Primary Question: "Are error risks sized and controlled?" Focus: Type I/II Errors, Power, Minimum Detectable Effect, Multiplicity, Sequential Monitoring
/autoskillit:exp-lens-error-budget [context_path] [experiment_plan_path]
/autoskillit:exp-lens-error-budget or /autoskillit:make-experiment-diag errorNEVER:
{{AUTOSKILLIT_TEMP}}/exp-lens-error-budget/run_in_background: true is prohibited)ALWAYS:
Enumerate every statistical test and account for its error contribution
Distinguish per-test error rates from family-wise error rates
Flag any sequential peeking without a formal stopping rule as a critical defect
Evaluate whether the minimum detectable effect is practically meaningful, not just statistically chosen
BEFORE creating any diagram, LOAD the /autoskillit:mermaid skill using the Skill tool - this is MANDATORY
If the Skill tool cannot be used (disable-model-invocation) or refuses this invocation, do NOT proceed with diagram creation. Abort this step and omit the diagram from output.
Write output to {{AUTOSKILLIT_TEMP}}/exp-lens-error-budget/exp_diag_error_budget_{YYYY-MM-DD_HHMMSS}.md
After writing the file, emit the structured output token as literal plain text with no markdown formatting on the token name (the adjudicator performs a regex match):
diagram_path = /absolute/path/to/{{AUTOSKILLIT_TEMP}}/exp-lens-error-budget/exp_diag_error_budget_{...}.md
If positional arg 1 (context_path) is provided and the file exists, read it to obtain IV/DV tables, H0/H1 hypotheses, controlled variables, and success criteria. If positional arg 2 (experiment_plan_path) is provided and exists, read the experiment plan for full methodology. Use this structured context as the foundation for Steps 1-5; skip the CWD exploration for these fields if the context file supplies them.
Spawn Explore subagents to investigate:
Sample Size & Power
Multiple Comparisons
Sequential Analysis
Decision Thresholds
Effect Size Context
For each statistical claim:
For each test, rate alignment as: ALIGNED / CONVENTIONAL / MISALIGNED
If a diagram adds value, show Data → Tests → Thresholds → Conclusions, with labeled error rates.
Write the analysis to: {{AUTOSKILLIT_TEMP}}/exp-lens-error-budget/exp_diag_error_budget_{YYYY-MM-DD_HHMMSS}.md (relative to the current working directory)
Before creating the diagram, verify:
/autoskillit:mermaid skill using the Skill tool/autoskillit:make-experiment-diag - Parent skill/autoskillit:mermaid - MUST BE LOADED before creating diagram/autoskillit:exp-lens-severity-testing/autoskillit:exp-lens-variance-stabilitydevelopment
Generate YAML recipes for .autoskillit/recipes/. Use when user says "make script skill", "generate script", "script a workflow", "write a script", "create a script", "new recipe", "write a pipeline", or when loaded by other skills for script formatting.
data-ai
Create Uncertainty Representation visualization planning spec showing error bar definitions, distribution-aware alternatives, and multi-seed variance protocols. Statistical lens answering "How is uncertainty honestly represented?"
data-ai
Create Temporal Dynamics visualization planning spec showing axis scaling (linear vs log), smoothing disclosure, epoch/step alignment, run aggregation (mean + variance bands), early-stopping markers, and wall-clock vs step-count x-axis. Temporal lens answering "Are training dynamics shown clearly and honestly?"
data-ai
Create Narrative Story Arc visualization planning spec showing visual consistency across the report (same color = same model everywhere), logical figure progression, redundant figure detection, and narrative dependency between figures. Narrative lens answering "Do the figures tell a coherent story across the report?"