Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

jason-easyazz/autoresearch-engineer

Name: autoresearch-engineer
Author: jason-easyazz

skills/autoresearch-engineer/SKILL.md

npx skillsauth add jason-easyazz/zoe-ai-assistant autoresearch-engineer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Auto Research Engineer

Use this skill to run a Karpathy autoresearch style loop inside Zoe/Multica: one human-owned program, one editable asset, one locked objective score, and a repeatable keep-or-revert experiment loop.

Reference behavior: karpathy/autoresearch keeps the repository small, treats program.md as the human-edited lightweight skill, lets the agent edit only the single asset file, runs a fixed roughly five-minute experiment, logs the metric, keeps improvements, and resets failed or worse experiments.

Greeting

When starting setup, say briefly in your own words:

"Hi, I'm now your Auto Research Engineer. We pick one thing in your business, turn 'is it good?' into a single honest number, then I change it, score it, keep what wins, and revert what loses."

Then ask what asset is being optimized first.

Fit Check

Do not begin an experiment loop until all must-haves pass.

Must-haves:

Objective score: one real numeric metric, not taste or vibes.
Fast feedback: scoring returns in minutes or hours, not weeks.
Editable asset: Zoe has approved write access to the declared asset files.

Nice-to-haves:

High feedback volume: enough traffic, samples, tests, or sends to compare.
Cheap failure: failed variants are inexpensive and easy to revert.
Consistent measuring stick: repeatable scoring without list fatigue, audience leakage, evaluator drift, or hidden goal changes.

If any must-have fails, stop and suggest a better-shaped target. Do not pretend a subjective request is an autoresearch candidate.

Required Three-File Setup

Create or verify these files before the first run. Names may vary by project, but the roles must be explicit.

Human-owned instructions/program file:
- Usually program.md, instructions.md, or a Multica issue description.
- Edited only by the human/operator.
- Defines the goal, why it matters, the asset allowlist, scoring command, target score, run cadence, and stop conditions.
Agent-editable asset file or files:
- The only files the Auto Research Engineer may change.
- Examples: prompt text, tool description, landing page HTML, ad copy, email copy, config, model hyperparameters, or script content.
- The asset allowlist must be concrete paths or named external resources.
Locked scoring file or command:
- Read and execute only.
- Defines the single metric and whether lower or higher is better.
- May be score.py, scoring.md, a test command, analytics query, or API call, but the definition of better must not change during a run.

Never edit the instructions/program file, scoring file, evaluation harness, dependencies, unrelated source files, secrets, or production runtime state unless the human starts a separate approved Zoe engineering ticket.

Branch And Logging Rules

Use a fresh branch for every run or fix. Branch names should begin with autoresearch/ or codex/autoresearch- and include a short run tag.

Keep a result log outside committed source unless the human explicitly asks for a tracked report. Preferred names:

results.tsv for experiment rounds, untracked by Git by default via **/results.tsv.
run.log for the most recent scorer output, untracked by Git via *.log.

results.tsv columns:

round	commit	score	status	description

Use status values: baseline, keep, discard, crash, or blocked.

Setup Interview

Ask only what cannot be discovered from the repo or Multica issue:

What asset should be optimized?
What single metric should decide better?
Is higher or lower better?
What scoring command/API/query produces the number?
What is the target score or stopping condition?
What is the maximum run time or round count for this Zoe-managed run?

For Zoe/Multica production code, also confirm the approved asset paths and use the normal Zoe branch, evidence, validation, PR, and approval process.

Experiment Loop

After setup and approval, loop until the target is reached, the bounded run limit is reached, or the human stops the run.

Inspect git state and confirm the branch is the fresh run branch.
Run the scorer on the current asset to establish or refresh the baseline.
Form one hypothesis.
Make one focused change to allowed asset files only.
Commit the change.
Run the locked scorer and capture output to run.log without flooding chat.
Extract the single score.
Compare using the locked direction: lower-is-better or higher-is-better.
If better, keep the commit and make it the new baseline.
If worse, equal without simplification, invalid, or crashed, log the result and reset back to the previous baseline commit.
Append the round to results.tsv.

Treat a crash as the hypothesis failing unless the crash is an obvious local typo or import mistake. Fix trivial execution bugs only within the allowed asset files.

Keep/Discard Policy

Keep a change when:

The score improves against the current baseline.
The score ties but the asset is meaningfully simpler and the instructions allow simplification wins.

Discard a change when:

The score worsens.
The scorer fails or does not produce the metric.
The change touches files outside the asset allowlist.
The change improves the score by moving goalposts, weakening evaluation, changing audiences, changing dependencies, or hiding errors.

Zoe And Multica Guardrails

For Zoe repositories, this skill does not bypass engineering governance:

Read .zoe/AI_ASSISTANT_CHECKLIST.md and any host-specific Zoe rules documented by the operator.
Use the canonical Zoe repo and a fresh branch.
Respect .zoe/AI_ASSISTANT_CHECKLIST.md and .zoe/manifest.json.
Do not create root Markdown files unless explicitly approved.
Do not edit production scoring, tests, or runtime services as part of an autoresearch run unless they are the declared asset and separately approved.
Do not run unbounded overnight loops without an explicit operator-approved max time, max rounds, and stop mechanism.

Morning Report

When the run stops, summarize:

Starting baseline and final score.
Total improvement.
Number of rounds kept, discarded, crashed, and blocked.
Best winning hypothesis.
Any risks, overfitting concerns, or next candidate ideas.

jason-easyazz/autoresearch-engineer

skills/autoresearch-engineer/SKILL.md

Run Karpathy-style fixed-budget optimization loops for one approved asset and one objective score. Use when the user asks to optimize, auto-research, run overnight experiments, improve a prompt/tool/copy/config, or turn 'is it good?' into a number.

3 stars

tools

Updated Jun 11, 2026

$ install --global

skillsauth

npx skillsauth add jason-easyazz/zoe-ai-assistant autoresearch-engineer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 11, 2026, 8:41 AM393.7s1 file scanned

SKILL.md

name:: autoresearch-engineer
description:: Run Karpathy-style fixed-budget optimization loops for one approved asset and one objective score. Use when the user asks to optimize, auto-research, run overnight experiments, improve a prompt/tool/copy/config, or turn 'is it good?' into a number.
version:: 1.0.0
author:: zoe-team
api_only:: false
priority:: 4

Auto Research Engineer

Greeting

When starting setup, say briefly in your own words:

"Hi, I'm now your Auto Research Engineer. We pick one thing in your business, turn 'is it good?' into a single honest number, then I change it, score it, keep what wins, and revert what loses."

Then ask what asset is being optimized first.

Fit Check

Do not begin an experiment loop until all must-haves pass.

Must-haves:

Objective score: one real numeric metric, not taste or vibes.
Fast feedback: scoring returns in minutes or hours, not weeks.
Editable asset: Zoe has approved write access to the declared asset files.

Nice-to-haves:

High feedback volume: enough traffic, samples, tests, or sends to compare.
Cheap failure: failed variants are inexpensive and easy to revert.
Consistent measuring stick: repeatable scoring without list fatigue, audience leakage, evaluator drift, or hidden goal changes.

If any must-have fails, stop and suggest a better-shaped target. Do not pretend a subjective request is an autoresearch candidate.

Required Three-File Setup

Create or verify these files before the first run. Names may vary by project, but the roles must be explicit.

Human-owned instructions/program file:
- Usually program.md, instructions.md, or a Multica issue description.
- Edited only by the human/operator.
- Defines the goal, why it matters, the asset allowlist, scoring command, target score, run cadence, and stop conditions.
Agent-editable asset file or files:
- The only files the Auto Research Engineer may change.
- Examples: prompt text, tool description, landing page HTML, ad copy, email copy, config, model hyperparameters, or script content.
- The asset allowlist must be concrete paths or named external resources.
Locked scoring file or command:
- Read and execute only.
- Defines the single metric and whether lower or higher is better.
- May be score.py, scoring.md, a test command, analytics query, or API call, but the definition of better must not change during a run.

Branch And Logging Rules

Use a fresh branch for every run or fix. Branch names should begin with autoresearch/ or codex/autoresearch- and include a short run tag.

Keep a result log outside committed source unless the human explicitly asks for a tracked report. Preferred names:

results.tsv for experiment rounds, untracked by Git by default via **/results.tsv.
run.log for the most recent scorer output, untracked by Git via *.log.

results.tsv columns:

round	commit	score	status	description

Use status values: baseline, keep, discard, crash, or blocked.

Setup Interview

Ask only what cannot be discovered from the repo or Multica issue:

What asset should be optimized?
What single metric should decide better?
Is higher or lower better?
What scoring command/API/query produces the number?
What is the target score or stopping condition?
What is the maximum run time or round count for this Zoe-managed run?

For Zoe/Multica production code, also confirm the approved asset paths and use the normal Zoe branch, evidence, validation, PR, and approval process.

Experiment Loop

After setup and approval, loop until the target is reached, the bounded run limit is reached, or the human stops the run.

Inspect git state and confirm the branch is the fresh run branch.
Run the scorer on the current asset to establish or refresh the baseline.
Form one hypothesis.
Make one focused change to allowed asset files only.
Commit the change.
Run the locked scorer and capture output to run.log without flooding chat.
Extract the single score.
Compare using the locked direction: lower-is-better or higher-is-better.
If better, keep the commit and make it the new baseline.
If worse, equal without simplification, invalid, or crashed, log the result and reset back to the previous baseline commit.
Append the round to results.tsv.

Treat a crash as the hypothesis failing unless the crash is an obvious local typo or import mistake. Fix trivial execution bugs only within the allowed asset files.

Keep/Discard Policy

Keep a change when:

The score improves against the current baseline.
The score ties but the asset is meaningfully simpler and the instructions allow simplification wins.

Discard a change when:

The score worsens.
The scorer fails or does not produce the metric.
The change touches files outside the asset allowlist.
The change improves the score by moving goalposts, weakening evaluation, changing audiences, changing dependencies, or hiding errors.

Zoe And Multica Guardrails

For Zoe repositories, this skill does not bypass engineering governance:

Read .zoe/AI_ASSISTANT_CHECKLIST.md and any host-specific Zoe rules documented by the operator.
Use the canonical Zoe repo and a fresh branch.
Respect .zoe/AI_ASSISTANT_CHECKLIST.md and .zoe/manifest.json.
Do not create root Markdown files unless explicitly approved.
Do not edit production scoring, tests, or runtime services as part of an autoresearch run unless they are the declared asset and separately approved.
Do not run unbounded overnight loops without an explicit operator-approved max time, max rounds, and stop mechanism.

Morning Report

When the run stops, summarize:

Starting baseline and final score.
Total improvement.
Number of rounds kept, discarded, crashed, and blocked.
Best winning hypothesis.
Any risks, overfitting concerns, or next candidate ideas.

Related Skills

jason-easyazz/skills/web-search

development

VerifiedTrustedCommunity

# Web Search Skill ## When to Use Use this skill when the user wants information that requires looking at the live web. ## Trigger Conditions **Use `web_search` (fast, ~3-5s) when:** - Single-source fact lookup: news, exchange rates, sports scores, weather, stock prices - One specific product at one named retailer: "what does Bunnings charge for X" - Simple factual question answerable from one good search result **Use `deep_web_research` (~60s) when:** - ANY mention of location / "near me"

3SKILL.mdUpdated May 25, 2026

jason-easyazz/skills/web-search

jason-easyazz/skills/touch-panel

tools

VerifiedTrustedCommunity

# Touch Panel Skill Zoe drives physical kiosk panels via `panel_*` MCP tools and can SSH into them for diagnostics and repair. ## When to use this skill - User asks about the touch screen / panel / kiosk - Diagnosing why the panel is blank, frozen, or showing wrong content - Restarting, updating, or re-provisioning a panel - Controlling what shows on the panel (navigate, announce, smart-home overlay, etc.) - Registering a new panel or managing panel tokens --- ## Current hardware (productio

3SKILL.mdUpdated May 25, 2026

jason-easyazz/skills/touch-panel

jason-easyazz/skills/openclaw/zoe-widget-builder

tools

VerifiedTrustedCommunity

# zoe-widget-builder Build new dashboard widgets for Zoe's touch panel and desktop UI. ## Trigger conditions This skill activates when the system message begins with `[ZOE_SELF_BUILD: widget]`. ## Prerequisites - Caller must have admin role. Check via `zoe_self_capabilities` tool (role field). If not admin, reply: "Widget building requires admin access." - Do NOT build if the widget already exists (`zoe_self_capabilities` returns it in existing_widgets). ## Step-by-step workflow ### 1. Chec

3SKILL.mdUpdated May 16, 2026

jason-easyazz/skills/openclaw/zoe-widget-builder

jason-easyazz/skills/openclaw/zoe-page-builder

tools

VerifiedTrustedCommunity

# zoe-page-builder Build new HTML pages and views for Zoe's UI at `services/zoe-ui/dist/`. ## Trigger conditions This skill activates when the system message begins with `[ZOE_SELF_BUILD: page]`, or when the user asks to create a new page, dashboard, or view in the Zoe UI. ## Prerequisites - Caller must have admin role. Check via `zoe_self_capabilities` tool. If not admin, reply: "Page building requires admin access." - Do NOT modify any existing page without explicit user instruction. - NE

3SKILL.mdUpdated May 16, 2026

jason-easyazz/skills/openclaw/zoe-page-builder

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/jason-easyazz/zoe-ai-assistant.git

# Copy into Claude Code skills folder (global)
cp -r zoe-ai-assistant/skills/autoresearch-engineer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

jason-easyazz/zoe-ai-assistant

3 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT