skills/autoresearch-skill/SKILL.md
[Hyper] Optimize an existing Codex skill through baseline-first experiments, binary evals, optional guards, and one-mutation-at-a-time iteration. Use for skill autoresearch, measured trigger/workflow improvement, self-optimizing a skill, benchmarking skill changes, or resuming skill experiment artifacts.
npx skillsauth add alpoxdev/hypercore autoresearch-skillInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
@rules/experiment-loop.md @rules/context-sourcing-and-trace.md @rules/validation-and-exit.md @references/reporting-and-score-explanation.md
Improve an existing skill through measurable experiments instead of one large rewrite.
<output_language>
Default all user-facing deliverables, saved artifacts, reports, plans, generated docs, summaries, handoff notes, commit/message drafts, and validation notes to Korean, even when this canonical skill file is written in English.
Preserve source code identifiers, CLI commands, file paths, schema keys, JSON/YAML field names, API names, package names, proper nouns, and quoted source excerpts in their required or original language.
Use a different language only when the user explicitly requests it, an existing target artifact must stay in another language for consistency, or a machine-readable contract requires exact English tokens. If a localized template or reference exists (for example *.ko.md or *.ko.json), prefer it for user-facing artifacts.
</output_language>
<purpose>.hypercore/autoresearch-skill/[skill-name]/: results.tsv, results.json, changelog.md, dashboard.html, and SKILL.md.baseline.<routing_rule>
Use autoresearch-skill when the user wants to optimize an existing skill through repeated experiments and evaluation.
Use skill-maker when the main job is creating a new skill or doing one structural refactor without an experiment loop.
Do not use autoresearch-skill when:
</routing_rule>
<trigger_conditions>
Positive examples:
skills/web-clone/SKILL.md and keep only changes that raise the score.".hypercore."skills/foo and keep only score-improving mutations.".hypercore/autoresearch-skill/foo."Negative examples:
Boundary example:
skill-maker refactoring is usually better.</trigger_conditions>
<supported_targets>
SKILL.md and directly linked rules/ or references/.</supported_targets>
<required_inputs>
Collect these before the first mutation:
plan, run, resume, or review. Default: run when a target and eval intent are clear..hypercore/autoresearch-skill/[skill-name]/ workspace.Guard checks that must not regress. Default: trigger boundary, core size, support links, artifact schema, and renderer smoke checks when applicable.5; interval for timed loops defaults to 2 minutes.Input policy:
When autoresearching this or another skill without a supplied prompt pack:
</required_inputs>
<language_support>
</language_support>
<autoresearch_integration>
This skill is not complete from standalone .hypercore experiment logs alone. When used through $autoresearch, also satisfy this bridge contract.
Default validation mode:
prompt-architect-artifactState storage:
.omx/state/.../autoresearch-state.json:
validation_mode: prompt-architect-artifactcompletion_artifact_path: .omx/specs/autoresearch-{skill-name}/result.jsonvalidator_prompt: architect-review prompt that approves or rejects target skill output and experiment logs against the missionoutput_artifact_path: .hypercore/autoresearch-skill/{skill-name}/results.jsonExit rules:
.hypercore score is necessary evidence, not sufficient evidence.completion_artifact_path exists and architect_review.verdict is approved..hypercore results and .omx/specs/.../result.json.</autoresearch_integration>
<autonomy_contract>
After the baseline plan is explicit:
</autonomy_contract>
<skill_architecture>
Keep the core skill focused on trigger, owned work, workflow, and mutation discipline.
Load support files intentionally:
dashboard.html and results.js from the official dashboard template with scripts/render-dashboard.sh.details/ or standard log files; let the renderer load them into the dashboard instead of editing the HTML template by hand.description, score rationale, score-delta note, changelog entry, and dashboard text in Korean unless the user explicitly requests another language.Artifact lifecycle requirements:
.hypercore/autoresearch-skill/[skill-name]/.SKILL.md.baseline before editing.baseline-files.json or a baseline/ snapshot.results.tsv and results.json after every experiment.dashboard.html as a live view derived from results.json.results.js as the generated bridge for both results.json and detailed content files.results.json.status as running during the loop and complete at exit.file:// URL.When skill structure is weak:
rules/ and detailed knowledge into references/ only when those files will actually be used.</skill_architecture>
<workflow>| Phase | Task | Output |
|------|------|------|
| 0 | Read the target skill and current support-file shape | Baseline understanding |
| 1 | Convert success conditions into binary evals | Eval set |
| 2 | Initialize experiment workspace and artifacts | .hypercore/autoresearch-skill/[skill-name]/ |
| 3 | Run experiment 0 against the unmodified skill | Baseline score |
| 4 | Repeat one-mutation-at-a-time experiments | Keep/discard decision |
| 5 | Verify final results and summarize the run | Final report |
SKILL.md and only the directly linked support files needed for the target behavior.SKILL.md.baseline; snapshot support files too when they are in scope..hypercore/autoresearch-skill/[skill-name]/ at the repository root.results.tsv, results.json, changelog.md, and dashboard.html according to references/artifact-spec.md.scripts/render-dashboard.sh.0 as baseline.results.tsv, results.json, changelog.md, and optional git experiment history.keep or be promoted.<mutation_defaults>
Prefer these mutation types:
description so it triggers on the right requests and avoids neighboring skills.SKILL.md into a directly linked rule file.Avoid these mutation types:
</mutation_defaults>
<deliverables>At exit, leave behind:
.hypercore/autoresearch-skill/[skill-name]/dashboard.html..hypercore/autoresearch-skill/[skill-name]/results.json..hypercore/autoresearch-skill/[skill-name]/results.js or an equivalent file-based bridge..hypercore/autoresearch-skill/[skill-name]/results.tsv..hypercore/autoresearch-skill/[skill-name]/changelog.md..hypercore/autoresearch-skill/[skill-name]/score-explanation.md with Korean score movement, eval/category attribution, and file-level change reasons..hypercore/autoresearch-skill/[skill-name]/final-report.md with the Korean user-facing summary..hypercore/autoresearch-skill/[skill-name]/details/ when the run has detailed prompts, raw eval output, failure excerpts, or review notes too large for results.json..hypercore/autoresearch-skill/[skill-name]/SKILL.md.baseline..hypercore/autoresearch-skill/[skill-name]/baseline-files.json or baseline/ when support files are mutable..omx/specs/autoresearch-[skill-name]/result.json completion artifact.run-contract.md, source-ledger.md, or trace-summary.md when the run uses external/current sources, tools, or delegation.validation_mode and completion_artifact_path bridge state in .omx/state/.../autoresearch-state.json.Follow references/artifact-spec.md for schemas and examples, and references/reporting-and-score-explanation.md for the Korean report contract.
</deliverables> <validation>The run must satisfy:
SKILL.md.results.json, results.tsv, and results.js satisfy references/artifact-spec.md and the dashboard renders from generated data.score_explanation or equivalent score-explanation.md loaded through results.js.dashboard.html.development
[Hyper] Use when working on Vite + TanStack Router projects - enforces architecture rules (layers, routes, hooks, services, conventions) with mandatory validation before any code change. Triggers on file creation, route work, hook patterns, or any structural change in a Vite + TanStack Router codebase.
development
[Hyper] Update semantic versions across node/rust/python projects, keep discovered version files synchronized, and prefer the installed `git-commit` skill for the final git step with a direct fallback when it is unavailable.
development
[Hyper] Use when working on TanStack Start projects and the task involves auth, sessions, cookies, CSRF, secrets, env exposure, server functions/routes, headers/CSP, webhooks, or security review/fixes. Triggers on protecting routes, hardening auth flows, preventing secret leaks, securing server boundaries, or reviewing HTTP/security behavior in a TanStack Start app.
tools
[Hyper] Enforce TanStack Start architecture in existing Start projects, especially project/folder structure, route structure, nested shared folder organization, server functions, loader/client-server boundaries, importProtection, hooks, SSR/hydration, and hypercore conventions. Use before structural code changes, folder-structure reviews, route work, server function work, or architecture audits in TanStack Start codebases.