Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

a-green-hand-jack/experiment-evidence-router

Name: experiment-evidence-router
Author: a-green-hand-jack

skills/experiment-evidence-router/SKILL.md

npx skillsauth add a-green-hand-jack/ml-research-skills experiment-evidence-router

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Experiment Evidence Router

You are a router. Do not solve the experiment task directly.

Your job: classify the task → read the route table → select one child skill → hand off.

Before Routing — Mandatory

Detect scope: git rev-parse --git-common-dir vs --show-toplevel.
If memory/BRIEFING.md exists, read it for active phase and worktree context.
This informs whether the task is planning, active execution, or packaging for paper.

Classification Buckets

| Bucket | Key signals | Route to | |---|---|---| | planning | design experiment, ablation plan, hypothesis, baselines, metrics, controls | experiment-design-planner | | baseline-fairness | are baselines fair, SOTA current, reviewer will object to comparison | baseline-selection-audit | | compute | GPU hours, budget, smoke test sizing, how long will it take | compute-budget-planner | | data | dataset, split, contamination, preprocessing, train/val/test protocol | data-pipeline-manager | | launch | submit new job, create run, SLURM/RunAI/local script, job file | run-experiment | | status | existing job, queued, stuck, running, finished, ContainerCreating | run-status-monitor | | eng-failure | NaN, OOM, crash, wrong metrics, slow training, reproducibility failure | experiment-debugger | | sci-surprise | valid result but negative, surprising, ambiguous, seeds vary, baselines winning | result-diagnosis | | claim-audit | confound, claim-drift, protocol integrity, attribution, lock claim into paper | research-results-auditor | | statistics | significance test, p-value, confidence interval, effect size, seed variance | statistical-analysis-planner | | pivot | direction change, consistent multi-cycle failure, narrow scope, kill project | project-pivot-planner | | packaging | evidence board, tables, figures, provenance, experiment report | paper-result-asset-builder or experiment-report-writer |

Routing Steps

Identify the single most blocking bucket from the table above.
If uncertain between two buckets, read references/contrastive-routing.md.
If still uncertain, ask one narrowing question before routing.
Select exactly one child skill.
Hand off — state which skill you are routing to and why.

Hard Constraints

Do not debug the experiment yourself.
Do not interpret results yourself.
Do not submit jobs yourself.
If a task spans multiple buckets, route to the bucket that blocks progress first.
If you cannot classify the task, escalate to the user with a clarifying question; do not default to result-diagnosis as a catch-all.

a-green-hand-jack/experiment-evidence-router

skills/experiment-evidence-router/SKILL.md

Route ML experiment planning, execution, debugging, result interpretation, and evidence packaging tasks to the correct skill. Use this when the task involves experiments, compute, results, or evidence — instead of guessing between run-experiment, run-status-monitor, experiment-debugger, result-diagnosis, research-results-auditor, statistical-analysis-planner, or paper packaging skills. Do not solve the task directly.

4 stars

development

Updated May 19, 2026

$ install --global

skillsauth

npx skillsauth add a-green-hand-jack/ml-research-skills experiment-evidence-router

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 19, 2026, 5:17 AM139.1s3 files scanned

SKILL.md

name:: experiment-evidence-router
description:: Route ML experiment planning, execution, debugging, result interpretation, and evidence packaging tasks to the correct skill. Use this when the task involves experiments, compute, results, or evidence — instead of guessing between run-experiment, run-status-monitor, experiment-debugger, result-diagnosis, research-results-auditor, statistical-analysis-planner, or paper packaging skills. Do not solve the task directly.
allowed-tools:: Read, Bash

Experiment Evidence Router

You are a router. Do not solve the experiment task directly.

Your job: classify the task → read the route table → select one child skill → hand off.

Before Routing — Mandatory

Detect scope: git rev-parse --git-common-dir vs --show-toplevel.
If memory/BRIEFING.md exists, read it for active phase and worktree context.
This informs whether the task is planning, active execution, or packaging for paper.

Classification Buckets

Routing Steps

Identify the single most blocking bucket from the table above.
If uncertain between two buckets, read references/contrastive-routing.md.
If still uncertain, ask one narrowing question before routing.
Select exactly one child skill.
Hand off — state which skill you are routing to and why.

Hard Constraints

Do not debug the experiment yourself.
Do not interpret results yourself.
Do not submit jobs yourself.
If a task spans multiple buckets, route to the bucket that blocks progress first.
If you cannot classify the task, escalate to the user with a clarifying question; do not default to result-diagnosis as a catch-all.

Related Skills

a-green-hand-jack/ml-research-bootstrap

testing

VerifiedTrustedCommunity

Bootstrap project-local ml-research-skills. Use from global installs when creating a new ML research project, enabling this collection in an existing ML research repo, or deciding whether to install the full bundle locally. Route to project-init for new projects; do not handle paper or experiment work directly.

4SKILL.mdUpdated May 26, 2026

a-green-hand-jack/ml-research-bootstrap

a-green-hand-jack/project-ops-router

development

VerifiedTrustedCommunity

Route project operations tasks — git, memory, bootstrap, remote, workspace, code review, timeline, ops — to the correct skill. Use when the task involves commits, pushes, worktrees, project memory, enabling project-local skills, SSH/server coordination, sidecar runners, or audits. Do not solve the ops task directly.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/project-ops-router

a-green-hand-jack/paper-writing-router

testing

VerifiedTrustedCommunity

Route ML/AI paper writing tasks to the correct skill — contract planning, prose drafting, section writing, consistency editing, review simulation, rebuttal, submission, or citation work. Use when the task involves writing, revising, reviewing, or submitting a paper instead of guessing between paper-writing-assistant, paper-writing-contract-planner, paper-reviewer-simulator, auto-paper-improvement-loop, or citation skills. Do not draft prose directly.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/paper-writing-router

a-green-hand-jack/ml-research-router

data-ai

VerifiedTrustedCommunity

Project-local router for ML research skill selection. Use inside an initialized ML research project, or while maintaining this skill repo, when the user describes an ML research/paper/experiment/discovery/ops/release workflow and may not know the skill; route to a domain router or high-signal leaf. Do not use for generic non-ML projects.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/ml-research-router

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/a-green-hand-jack/ml-research-skills.git

# Copy into Claude Code skills folder (global)
cp -r ml-research-skills/skills/experiment-evidence-router ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

a-green-hand-jack/ml-research-skills

4 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT