src/autoskillit/skills_extended/exp-lens-severity-testing/SKILL.md
Analyze severity of experimental tests — adversarial cases, negative controls, falsification tests, easy-pass detection, and confirmatory theater. Falsificationist lens answering "Would this design have caught the error?"
npx skillsauth add talont-org/autoskillit exp-lens-severity-testingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Philosophical Mode: Falsificationist Primary Question: "Would this design have caught the error?" Focus: Adversarial Cases, Negative Controls, Falsification Tests, Easy-Pass Detection, Confirmatory Theater
/autoskillit:exp-lens-severity-testing [context_path] [experiment_plan_path]
/autoskillit:exp-lens-severity-testing or /autoskillit:make-experiment-diag severityNEVER:
{{AUTOSKILLIT_TEMP}}/exp-lens-severity-testing/run_in_background: true is prohibited)ALWAYS:
For every positive claim, identify what error the test was capable of detecting
Inventory negative controls and sanity checks explicitly — their absence is a finding
Rate severity before reporting conclusions, not after
Flag confirmatory theater: experiments designed to confirm rather than risk refutation
BEFORE creating any diagram, LOAD the /autoskillit:mermaid skill using the Skill tool - this is MANDATORY
If the Skill tool cannot be used (disable-model-invocation) or refuses this invocation, do NOT proceed with diagram creation. Abort this step and omit the diagram from output.
Write output to {{AUTOSKILLIT_TEMP}}/exp-lens-severity-testing/exp_diag_severity_testing_{YYYY-MM-DD_HHMMSS}.md
After writing the file, emit the structured output token as literal plain text with no markdown formatting on the token name (the adjudicator performs a regex match):
diagram_path = /absolute/path/to/{{AUTOSKILLIT_TEMP}}/exp-lens-severity-testing/exp_diag_severity_testing_{...}.md
If positional arg 1 (context_path) is provided and the file exists, read it to obtain IV/DV tables, H0/H1 hypotheses, controlled variables, and success criteria. If positional arg 2 (experiment_plan_path) is provided and exists, read the experiment plan for full methodology. Use this structured context as the foundation for Steps 1-5; skip the CWD exploration for these fields if the context file supplies them.
Spawn Explore subagents to investigate:
Positive Results Claimed
Negative Controls & Sanity Checks
Adversarial Conditions
Alternative Explanations Tested
Prediction Specificity
For each claim:
Severity ratings: HIGH / MEDIUM / LOW Flag confirmatory theater when design is structured to confirm rather than risk refutation.
Show Claims → HIGH/MEDIUM/LOW severity tests → Severity verdicts.
Write the analysis to: {{AUTOSKILLIT_TEMP}}/exp-lens-severity-testing/exp_diag_severity_testing_{YYYY-MM-DD_HHMMSS}.md (relative to the current working directory)
Before creating the diagram, verify:
/autoskillit:mermaid skill using the Skill tool/autoskillit:make-experiment-diag - Parent skill/autoskillit:mermaid - MUST BE LOADED before creating diagram/autoskillit:exp-lens-error-budget/autoskillit:exp-lens-validity-threatsdevelopment
Generate YAML recipes for .autoskillit/recipes/. Use when user says "make script skill", "generate script", "script a workflow", "write a script", "create a script", "new recipe", "write a pipeline", or when loaded by other skills for script formatting.
data-ai
Create Uncertainty Representation visualization planning spec showing error bar definitions, distribution-aware alternatives, and multi-seed variance protocols. Statistical lens answering "How is uncertainty honestly represented?"
data-ai
Create Temporal Dynamics visualization planning spec showing axis scaling (linear vs log), smoothing disclosure, epoch/step alignment, run aggregation (mean + variance bands), early-stopping markers, and wall-clock vs step-count x-axis. Temporal lens answering "Are training dynamics shown clearly and honestly?"
data-ai
Create Narrative Story Arc visualization planning spec showing visual consistency across the report (same color = same model everywhere), logical figure progression, redundant figure detection, and narrative dependency between figures. Narrative lens answering "Do the figures tell a coherent story across the report?"