skills/autoresearch-engineer/SKILL.md
Run Karpathy-style fixed-budget optimization loops for one approved asset and one objective score. Use when the user asks to optimize, auto-research, run overnight experiments, improve a prompt/tool/copy/config, or turn 'is it good?' into a number.
npx skillsauth add jason-easyazz/zoe-ai-assistant autoresearch-engineerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this skill to run a Karpathy autoresearch style loop inside Zoe/Multica: one human-owned program, one editable asset, one locked objective score, and a repeatable keep-or-revert experiment loop.
Reference behavior: karpathy/autoresearch keeps the repository small, treats program.md as the human-edited lightweight skill, lets the agent edit only the single asset file, runs a fixed roughly five-minute experiment, logs the metric, keeps improvements, and resets failed or worse experiments.
When starting setup, say briefly in your own words:
"Hi, I'm now your Auto Research Engineer. We pick one thing in your business, turn 'is it good?' into a single honest number, then I change it, score it, keep what wins, and revert what loses."
Then ask what asset is being optimized first.
Do not begin an experiment loop until all must-haves pass.
Must-haves:
Nice-to-haves:
If any must-have fails, stop and suggest a better-shaped target. Do not pretend a subjective request is an autoresearch candidate.
Create or verify these files before the first run. Names may vary by project, but the roles must be explicit.
Human-owned instructions/program file:
program.md, instructions.md, or a Multica issue description.Agent-editable asset file or files:
Locked scoring file or command:
score.py, scoring.md, a test command, analytics query, or API call, but the definition of better must not change during a run.Never edit the instructions/program file, scoring file, evaluation harness, dependencies, unrelated source files, secrets, or production runtime state unless the human starts a separate approved Zoe engineering ticket.
Use a fresh branch for every run or fix. Branch names should begin with autoresearch/ or codex/autoresearch- and include a short run tag.
Keep a result log outside committed source unless the human explicitly asks for a tracked report. Preferred names:
results.tsv for experiment rounds, untracked by Git by default via **/results.tsv.run.log for the most recent scorer output, untracked by Git via *.log.results.tsv columns:
round commit score status description
Use status values: baseline, keep, discard, crash, or blocked.
Ask only what cannot be discovered from the repo or Multica issue:
For Zoe/Multica production code, also confirm the approved asset paths and use the normal Zoe branch, evidence, validation, PR, and approval process.
After setup and approval, loop until the target is reached, the bounded run limit is reached, or the human stops the run.
run.log without flooding chat.results.tsv.Treat a crash as the hypothesis failing unless the crash is an obvious local typo or import mistake. Fix trivial execution bugs only within the allowed asset files.
Keep a change when:
Discard a change when:
For Zoe repositories, this skill does not bypass engineering governance:
.zoe/AI_ASSISTANT_CHECKLIST.md and any host-specific Zoe rules documented by the operator..zoe/AI_ASSISTANT_CHECKLIST.md and .zoe/manifest.json.When the run stops, summarize:
development
# Web Search Skill ## When to Use Use this skill when the user wants information that requires looking at the live web. ## Trigger Conditions **Use `web_search` (fast, ~3-5s) when:** - Single-source fact lookup: news, exchange rates, sports scores, weather, stock prices - One specific product at one named retailer: "what does Bunnings charge for X" - Simple factual question answerable from one good search result **Use `deep_web_research` (~60s) when:** - ANY mention of location / "near me"
tools
# Touch Panel Skill Zoe drives physical kiosk panels via `panel_*` MCP tools and can SSH into them for diagnostics and repair. ## When to use this skill - User asks about the touch screen / panel / kiosk - Diagnosing why the panel is blank, frozen, or showing wrong content - Restarting, updating, or re-provisioning a panel - Controlling what shows on the panel (navigate, announce, smart-home overlay, etc.) - Registering a new panel or managing panel tokens --- ## Current hardware (productio
tools
# zoe-widget-builder Build new dashboard widgets for Zoe's touch panel and desktop UI. ## Trigger conditions This skill activates when the system message begins with `[ZOE_SELF_BUILD: widget]`. ## Prerequisites - Caller must have admin role. Check via `zoe_self_capabilities` tool (role field). If not admin, reply: "Widget building requires admin access." - Do NOT build if the widget already exists (`zoe_self_capabilities` returns it in existing_widgets). ## Step-by-step workflow ### 1. Chec
tools
# zoe-page-builder Build new HTML pages and views for Zoe's UI at `services/zoe-ui/dist/`. ## Trigger conditions This skill activates when the system message begins with `[ZOE_SELF_BUILD: page]`, or when the user asks to create a new page, dashboard, or view in the Zoe UI. ## Prerequisites - Caller must have admin role. Check via `zoe_self_capabilities` tool. If not admin, reply: "Page building requires admin access." - Do NOT modify any existing page without explicit user instruction. - NE