skills/ai-research-explore/SKILL.md
Rigor Explore compatible skill slug for meaningful and potentially novel deep learning research candidates. Use when the researcher has chosen the task family, dataset, benchmark, evaluation method, provided SOTA references, and wants candidate-only exploration on top of `current_research` with auditable repo understanding, idea gating, fair comparison, and governed experiments written to `explore_outputs/`. Do not use for README-first trusted reproduction, open-ended direction finding, narrow code-only or run-only exploration, passive repo analysis, verified novelty claims, or implicit experimentation.
npx skillsauth add lllllllama/ai-research-workflow-skills ai-research-exploreInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this as the Rigor Explore compatible skill slug after the researcher
explicitly authorizes candidate-only work on top of a durable
current_research anchor. The installed slug remains ai-research-explore for
compatibility. Rigor Explore is for meaningful and potentially novel deep
learning research candidates while preserving scientific rigor, comparability,
reproducibility, and auditable collaboration. Novelty and significance remain
hypotheses before literature contrast, ablation evidence, and fair comparison.
The skill does not promise autonomous discovery, global benchmark completeness,
novelty proof, or trusted reproduction success.
Start from the shared operating principles in
../../references/agent-operating-principles.md, then load
../../references/research-rigor-principles.md for research claims and
../../references/deep-learning-experiment-principles.md when experiment
details affect comparability or reproducibility.
Use this skill only when the request has both:
current_research context such as a branch, commit, checkpoint,
run record, or already-trained local model state.Keep narrow code-only requests on explore-code. Keep narrow run-only requests
on explore-run. Keep passive repository analysis on analyze-project. Keep
README-first reproduction on ai-research-reproduction.
Use a two-loop rhythm:
This rhythm is a guide, not a rigid autonomous loop. Stop at explicit blockers, unclear scientific meaning, exhausted budget, missing anchor/evaluation, or a human checkpoint.
current_research and explicit explore-lane authorization.variant_spec or higher-level research_campaign.analyze-project.explore-code for bounded code
adaptation and explore-run for short-cycle trials or sweeps.minimal-run-and-audit or run-train only when the exploratory plan
requires real execution evidence.analysis_outputs/, sources/, and
explore_outputs/ as appropriate; never present exploratory gains as trusted
reproduction success. Include SCIENTIFIC_CHANGELOG.md and
COMPARABILITY_REPORT.md for candidate scientific meaning and comparison
boundaries.evaluation_source and sota_reference frozen for
the campaign; do not claim they are globally complete.research_campaign is preferred for Rigor Explore campaigns, but it should
stay minimal. The durable core is:
current_researchtask_familydatasetbenchmarkevaluation_sourcesota_referencecompute_budgetUse candidate_ideas, variant_spec, research_lookup, idea_policy,
idea_generation, source_constraints, feasibility_policy, baseline_gate,
and execution_policy as optional guidance, not as fields the agent must fill
for every campaign. See references/research-campaign-spec.md for the advanced
schema and artifact expectations.
references/ai-research-explore-policy.md for lane safety and candidate
semantics.references/research-campaign-spec.md only when a campaign file is
present or the user asks for Rigor Explore campaign governance.../../references/explore-variant-spec.md for run-level variant matrix
details.../../references/research-rigor-principles.md before making novelty,
contribution, SOTA, or comparability statements.../../references/deep-learning-experiment-principles.md when training,
evaluation, baseline, ablation, metric, checkpoint, or dataset details matter.scripts/orchestrate_explore.py and scripts/write_outputs.py for the
existing deterministic artifact workflow.development
Rigor Debug / Rigor Audit skill for deep learning research work. Use when the user pastes a traceback, terminal error, CUDA OOM, checkpoint load failure, shape mismatch, NaN loss symptom, or training failure and wants conservative diagnosis before any patching, with debug fixes clearly separated from research contributions. Do not use for broad refactoring, speculative adaptation, automatic exploratory patching, or general repository familiarization.
development
Rigor Train skill for deep learning research repositories. Use when a documented or selected training command should be run conservatively for startup verification, short-run verification, full kickoff, or resume, with command, config, seed, log, checkpoint, status, and metric evidence written to standardized `train_outputs/`. Do not use for environment setup, exploratory sweeps, speculative idea implementation, or end-to-end orchestration.
tools
Rigor Intake helper for README-first deep learning repo reproduction. Use when the task is specifically to scan a repository, read the README and common project files, extract documented commands, classify inference, evaluation, and training candidates, and return the smallest trustworthy reproduction plan to the main orchestrator. Do not use for environment setup, asset download, command execution, final reporting, paper lookup, or end-to-end orchestration.
tools
Rigor Paper Context helper for README-first deep learning repo reproduction. Use only when the README and repository files leave a narrow reproduction-critical gap and the task is to resolve a specific paper detail such as dataset split, preprocessing, evaluation protocol, checkpoint mapping, or runtime assumption from primary paper sources while recording conflicts. Do not use for general paper summary, repo scanning, environment setup, command execution, title-only paper lookup, or replacing README guidance by default.