plugins/hypercore/skills/autoresearch-skill/SKILL.md
[Hyper] Use this skill when optimizing an existing Codex skill through baseline-first experiments, binary evals, Guard checks, and one-mutation-at-a-time iteration. Use for skill autoresearch, measured trigger/workflow improvement, self-optimizing a skill, benchmarking skill changes, or resuming skill experiment artifacts. If invoked without a target, ask for the target skill and eval intent before creating artifacts or mutating files. Do not use for one-off skill creation/refactor, generic docs polish, app QA, commit-only, or push-only requests.
npx skillsauth add alpoxdev/hypercore autoresearch-skillInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
@rules/experiment-loop.md @rules/context-sourcing-and-trace.md @rules/validation-and-exit.md @references/reporting-and-score-explanation.md
Improve an existing skill through measurable experiments instead of one large rewrite.
<output_language>
Default all user-facing deliverables, saved artifacts, reports, plans, generated docs, summaries, handoff notes, commit/message drafts, and validation notes to Korean, even when this canonical skill file is written in English.
Preserve source code identifiers, CLI commands, file paths, schema keys, JSON/YAML field names, API names, package names, proper nouns, and quoted source excerpts in their required or original language.
Use a different language only when the user explicitly requests it, an existing target artifact must stay in another language for consistency, or a machine-readable contract requires exact English tokens. If a localized template or reference exists (for example *.ko.md or *.ko.json), prefer it for user-facing artifacts.
</output_language>
<purpose>.hypercore/autoresearch-skill/[skill-name]/: results.tsv, results.json, changelog.md, dashboard.html, and SKILL.md.baseline.<routing_rule>
Use autoresearch-skill when the user wants to optimize an existing skill through repeated experiments and evaluation.
Use skill-maker when the main job is creating a new skill or doing one structural refactor without an experiment loop.
Use skill-tester when the main job is validating a skill once without a mutation loop.
Use docs-maker when the main job is rewriting a general document, runbook, or prose artifact rather than improving a reusable skill.
Do not use autoresearch-skill when:
</routing_rule>
<missing_target_behavior>
If the user invokes autoresearch-skill, $autoresearch-skill, or a local slash equivalent without a target skill path, existing experiment workspace, or clear skill name:
.hypercore, .omx, skills/, rules, references, scripts, or assets before that answer.</missing_target_behavior>
<trigger_conditions>
Positive examples:
skills/web-clone/SKILL.md and keep only changes that raise the score."skills/foo/SKILL.md and keep only score-improving mutations.".hypercore.".hypercore/autoresearch-skill/foo."Negative examples:
Boundary example:
skill-maker refactoring is usually better.</trigger_conditions>
<supported_targets>
SKILL.md and directly linked rules/ or references/.</supported_targets>
<required_inputs>
Collect these before the first mutation:
plan, run, resume, or review. Default: run when a target and eval intent are clear..hypercore/autoresearch-skill/[skill-name]/ workspace.Guard checks that must not regress. Default: trigger boundary, core size, support links, artifact schema, and renderer smoke checks when applicable.5; interval for timed loops defaults to 2 minutes.Input policy:
<missing_target_behavior> before any write.When autoresearching this or another skill without a supplied prompt pack:
</required_inputs>
<language_support>
</language_support>
<support_file_read_order>
Read only the files needed for the active phase, in this order:
rules/experiment-loop.md before recording experiment 0 or choosing a mutation.rules/context-sourcing-and-trace.md before baseline when tools, delegation, current/external sources, or guard checks affect correctness.references/self-test-pack.md when the user did not supply a prompt pack.references/eval-guide.md before designing or revising the 3 to 6 binary evals.references/artifact-spec.md before creating .hypercore artifacts, rendering dashboard.html, or validating results.json and results.js.references/skill-refactor-guide.md only when a failed eval points to structure, trigger wording, support-file placement, or duplication.references/reporting-and-score-explanation.md before writing Korean score explanations, changelog notes, dashboard-visible labels, and final reports.rules/validation-and-exit.md before declaring the run complete.</support_file_read_order>
<autoresearch_integration>
This skill is not complete from standalone .hypercore experiment logs alone. When used through $autoresearch, also satisfy this bridge contract.
Default validation mode:
prompt-architect-artifactState storage:
.omx/state/.../autoresearch-state.json:.omx/state/[session-or-skill]/autoresearch-state.json; for this skill's self-run, .omx/state/autoresearch-skill/autoresearch-state.json is the concrete path.
validation_mode: prompt-architect-artifactcompletion_artifact_path: .omx/specs/autoresearch-{skill-name}/result.jsonvalidator_prompt: architect-review prompt that approves or rejects target skill output and experiment logs against the missionoutput_artifact_path: .hypercore/autoresearch-skill/{skill-name}/results.jsonExit rules:
.hypercore score is necessary evidence, not sufficient evidence.completion_artifact_path exists and architect_review.verdict is approved..hypercore results and .omx/specs/.../result.json.</autoresearch_integration>
<manual_qa_gate>
Tests alone do not prove completion. For every user-visible criterion in an autoresearch run, capture at least one Manual QA artifact through the real available surface before final reporting.
tmux when the skill behavior is CLI, artifact, or terminal-session shaped.results.json.status as complete until Manual QA artifacts and cleanup receipts exist for the declared criteria.</manual_qa_gate>
<autonomy_contract>
After the baseline plan is explicit:
</autonomy_contract>
<skill_architecture>
Keep the core skill focused on trigger, owned work, workflow, and mutation discipline.
Load support files intentionally:
dashboard.html and results.js from the official dashboard template with scripts/render-dashboard.sh.details/ or standard log files; let the renderer load them into the dashboard instead of editing the HTML template by hand.description, score rationale, score-delta note, changelog entry, and dashboard text in Korean unless the user explicitly requests another language.Artifact lifecycle requirements:
.hypercore/autoresearch-skill/[skill-name]/.SKILL.md.baseline before editing.baseline-files.json or a baseline/ snapshot.results.tsv and results.json after every experiment.dashboard.html as a live view derived from results.json.results.js as the generated bridge for both results.json and detailed content files.results.json.status as running during the loop and complete at exit.file:// URL.When skill structure is weak:
rules/ and detailed knowledge into references/ only when those files will actually be used.</skill_architecture>
<workflow>| Phase | Task | Output |
|------|------|------|
| 0 | Read the target skill and current support-file shape | Baseline understanding |
| 1 | Convert success conditions into binary evals | Eval set |
| 2 | Initialize experiment workspace and artifacts | .hypercore/autoresearch-skill/[skill-name]/ |
| 3 | Run experiment 0 against the unmodified skill | Baseline score |
| 4 | Repeat one-mutation-at-a-time experiments | Keep/discard decision |
| 5 | Verify final results and summarize the run | Final report |
SKILL.md and only the directly linked support files needed for the target behavior.SKILL.md.baseline; snapshot support files too when they are in scope..hypercore/autoresearch-skill/[skill-name]/ at the repository root.results.tsv, results.json, changelog.md, and dashboard.html according to references/artifact-spec.md.scripts/render-dashboard.sh.0 as baseline.results.tsv, results.json, changelog.md, and optional git experiment history.keep or be promoted.<mutation_defaults>
Prefer these mutation types:
description so it triggers on the right requests and avoids neighboring skills.SKILL.md into a directly linked rule file.Avoid these mutation types:
</mutation_defaults>
<deliverables>At exit, leave behind:
.hypercore/autoresearch-skill/[skill-name]/dashboard.html..hypercore/autoresearch-skill/[skill-name]/results.json..hypercore/autoresearch-skill/[skill-name]/results.js or an equivalent file-based bridge..hypercore/autoresearch-skill/[skill-name]/results.tsv..hypercore/autoresearch-skill/[skill-name]/changelog.md..hypercore/autoresearch-skill/[skill-name]/score-explanation.md with Korean score movement, eval/category attribution, and file-level change reasons..hypercore/autoresearch-skill/[skill-name]/final-report.md with the Korean user-facing summary..hypercore/autoresearch-skill/[skill-name]/details/ when the run has detailed prompts, raw eval output, failure excerpts, or review notes too large for results.json..hypercore/autoresearch-skill/[skill-name]/SKILL.md.baseline..hypercore/autoresearch-skill/[skill-name]/baseline-files.json or baseline/ when support files are mutable..omx/specs/autoresearch-[skill-name]/result.json completion artifact.run-contract.md, source-ledger.md, or trace-summary.md when the run uses external/current sources, tools, or delegation.validation_mode and completion_artifact_path bridge state in .omx/state/.../autoresearch-state.json.Follow references/artifact-spec.md for schemas and examples, and references/reporting-and-score-explanation.md for the Korean report contract.
</deliverables> <validation>The run must satisfy:
SKILL.md.results.json, results.tsv, and results.js satisfy references/artifact-spec.md and the dashboard renders from generated data.score_explanation or equivalent score-explanation.md loaded through results.js.dashboard.html.testing
Use this skill when the user asks to create a GitHub issue and move the current AI session onto its matching branch, or when the user provides an existing GitHub issue number/URL and wants the matching branch checked out without extra confirmation. If invoked with no issue/topic, ask what issue to create. Do not use for commit-only, push-only, PR review, or detached worktree management.
development
Use this skill when the user asks to create or update a project-specific DESIGN.md design system document for AI agents, including visual direction, tokens, components, layout, interaction, motion, and light/dark mode variants from user requests, project UI evidence, or design references. Do not use for README/docs authoring, product requirements, architecture rules, or implementing UI code.
testing
Use this skill when the user asks to create a GitHub issue and move the current AI session onto its matching branch, or when the user provides an existing GitHub issue number/URL and wants the matching branch checked out without extra confirmation. If invoked with no issue/topic, ask what issue to create. Do not use for commit-only, push-only, PR review, or detached worktree management.
development
Use this skill when the user asks to create or update a project-specific DESIGN.md design system document for AI agents, including visual direction, tokens, components, layout, interaction, motion, and light/dark mode variants from user requests, project UI evidence, or design references. Do not use for README/docs authoring, product requirements, architecture rules, or implementing UI code.