Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

openclaw/autoresearch-pro

Name: autoresearch-pro
Author: openclaw

0xcjl/autoresearch-pro/SKILL.md

npx skillsauth add openclaw/skills autoresearch-pro

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

autoresearch-pro

Overview

Automatically improve any OpenClaw skill, prompt, or article through iterative mutation-testing: small edits → run test cases → score with checklist → keep improvements, discard regressions.

Inspired by Karpathy/autoresearch.

Supports three optimization modes:

| Mode | Input | Output | |------|-------|--------| | Skill | Path to a skill directory | Improved SKILL.md | | Prompt | A prompt text string | Improved prompt | | Article | An article/document text | Improved article |

Workflow

Step 1 — Identify Mode and Input

Ask the user to confirm:

Mode 1 — Skill: User says "optimize [skill-name]" or provides a skill path
Mode 2 — Prompt: User says "optimize this prompt" or pastes a prompt
Mode 3 — Article: User says "improve this article" or pastes article text

For Skill mode, resolve the skill path to ~/.openclaw/skills/<skill-name>/SKILL.md. For Prompt/Article mode, keep the text in context (do not write to disk unless needed).

Step 2 — Generate Checklist (10 Questions)

Read the target content first. Then generate 10 diverse, specific yes/no checklist questions relevant to the content type:

For Skill mode (same as before):

| # | Dimension | What to Check | |---|----------|---------------| | 1 | Description clarity | Is the frontmatter description precise and actionable? | | 2 | Trigger coverage | Does it cover the main real-world use cases? | | 3 | Workflow structure | Are steps clearly sequenced and unambiguous? | | 4 | Error guidance | Does it handle error states and edge cases? | | 5 | Tool usage accuracy | Are tool names and parameters correct for OpenClaw? | | 6 | Example quality | Do examples reflect real usage patterns? | | 7 | Conciseness | Is content free of redundant repetition? | | 8 | Freedom calibration | Is instruction specificity appropriate? | | 9 | Reference quality | Are references and links accurate? | | 10 | Completeness | Are all sections filled with real content? |

For Prompt mode (10 tailored questions):

| # | Dimension | What to Check | |---|----------|---------------| | 1 | Goal clarity | Does the prompt state a clear, specific goal? | | 2 | Role/tone | Is the desired role or tone specified? | | 3 | Input format | Is the input format clearly described? | | 4 | Output format | Is the expected output format specified? | | 5 | Constraints | Are key constraints and boundaries stated? | | 6 | Context sufficiency | Is enough context provided to avoid hallucination? | | 7 | Edge cases | Does it handle ambiguous or edge case inputs? | | 8 | Conciseness | Is it free of redundant or contradictory instructions? | | 9 | Actionability | Are instructions concrete and actionable vs. vague? | | 10 | Completeness | Are all necessary elements for the task present? |

For Article mode (10 tailored questions):

| # | Dimension | What to Check | |---|----------|---------------| | 1 | Title quality | Does the title clearly convey the main value? | | 2 | Opening hook | Does the opening grab attention and set expectations? | | 3 | Logical structure | Are ideas logically organized (not random)? | | 4 | Argument clarity | Are claims supported with evidence or reasoning? | | 5 | Conciseness | Is unnecessary padding or repetition removed? | | 6 | Transition flow | Do paragraphs/sections flow smoothly? | | 7 | Closing strength | Does the conclusion summarize and inspire action? | | 8 | Tone consistency | Is the tone consistent throughout? | | 9 | Readability | Is sentence/paragraph length varied appropriately? | | 10 | Audience match | Does language match the target audience level? |

Present the 10 questions, numbered 1-10. Ask the user to select which ones to activate (e.g., "use questions 1, 3, 5, 7"). Default: use all 10 if user doesn't specify.

Step 3 — Prepare Test Cases

Skill mode: Generate 3-5 realistic prompts a user would send when using the skill
Prompt mode: Generate 3-5 test inputs that the prompt would process
Article mode: Generate 3-5 ways the article might be read or consumed

Store test cases in context — do not write to disk.

Step 4 — Run Autoresearch Loop

Loop configuration:

Rounds per batch: 30
Max total rounds: 100
Pause: After every 30 rounds, show summary and ask user to continue or stop
Stop conditions: User says stop, OR 100 rounds completed

Per-round procedure:

Mutate: Make ONE small edit to the target content:
- Skill mode: edit SKILL.md
- Prompt mode: edit the prompt string
- Article mode: edit the article text
Test: For each test case, simulate what output the content would produce.
Score: Apply each active checklist question (0 or 1 per question). Score = (passed / total) × 100.
Decide: If new score ≥ best score → keep the mutation. If lower → revert.
Log: Round number, mutation type, score, keep/revert decision.

Mutation types (pick one per round):

| Type | Description | |------|-------------| | A | Add a constraint rule | | B | Strengthen trigger/coverage | | C | Add a concrete example | | D | Tighten vague language | | E | Improve error/edge case handling | | F | Remove redundant content | | G | Improve transitions | | H | Expand a thin section | | I | Add cross-reference | | J | Adjust degree-of-freedom |

Step 5 — Report Results

After each batch (30 rounds):

Batch N (rounds X-Y):
  Best score: XX%
  Mutations kept: N  |  Reverted: N
  Most effective types: [list top 2-3]
Accumulated improvements: [summary]
Continue? (yes/stop)

After full completion:

Original score vs. final score
Top 3 most impactful mutations
Final improved content (inline or diff)
File path (skill mode only)

Mutation Strategy Reference

High-impact, low-risk changes:

Adding explicit constraints where the content is vague
Expanding coverage to cover edge cases
Adding concrete examples to abstract instructions
Tightening soft language ("try to" → "must")

Avoid in one round:

Large rewrites of entire sections
Multiple unrelated changes at once
Changing fundamental scope or purpose

See references/mutation_strategies.md for the full strategy guide.

Mode Selection Quick Reference

| User says | Mode | |-----------|------| | "optimize [skill]" / "autoresearch [skill]" | Skill | | "optimize this prompt" / "improve my prompt" | Prompt | | "polish this article" / "improve this article" | Article | | "optimize this document" | Article |

Default to Prompt mode if the input is a text string without a skill path.

openclaw/autoresearch-pro

0xcjl/autoresearch-pro/SKILL.md

Automatically improve OpenClaw skills, prompts, or articles through iterative mutation-testing loops. Inspired by Karpathy's autoresearch. Use when user says 'optimize [skill]', 'autoresearch [skill]', 'improve my skill', 'optimize this prompt', 'improve my prompt', 'polish this article', 'improve this article', or explicitly requests quality improvement for any text-based content. Supports three modes: skill (SKILL.md files), prompt (any prompt text), and article (any document).

3,729 stars

testing

Updated Apr 10, 2026

$ install --global

skillsauth

npx skillsauth add openclaw/skills autoresearch-pro

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 3, 2026, 12:29 AM203.1s1 file scanned

SKILL.md

autoresearch-pro

Overview

Automatically improve any OpenClaw skill, prompt, or article through iterative mutation-testing: small edits → run test cases → score with checklist → keep improvements, discard regressions.

Inspired by Karpathy/autoresearch.

Supports three optimization modes:

Workflow

Step 1 — Identify Mode and Input

Ask the user to confirm:

Mode 1 — Skill: User says "optimize [skill-name]" or provides a skill path
Mode 2 — Prompt: User says "optimize this prompt" or pastes a prompt
Mode 3 — Article: User says "improve this article" or pastes article text

For Skill mode, resolve the skill path to ~/.openclaw/skills/<skill-name>/SKILL.md. For Prompt/Article mode, keep the text in context (do not write to disk unless needed).

Step 2 — Generate Checklist (10 Questions)

Read the target content first. Then generate 10 diverse, specific yes/no checklist questions relevant to the content type:

For Skill mode (same as before):

For Prompt mode (10 tailored questions):

For Article mode (10 tailored questions):

Present the 10 questions, numbered 1-10. Ask the user to select which ones to activate (e.g., "use questions 1, 3, 5, 7"). Default: use all 10 if user doesn't specify.

Step 3 — Prepare Test Cases

Skill mode: Generate 3-5 realistic prompts a user would send when using the skill
Prompt mode: Generate 3-5 test inputs that the prompt would process
Article mode: Generate 3-5 ways the article might be read or consumed

Store test cases in context — do not write to disk.

Step 4 — Run Autoresearch Loop

Loop configuration:

Rounds per batch: 30
Max total rounds: 100
Pause: After every 30 rounds, show summary and ask user to continue or stop
Stop conditions: User says stop, OR 100 rounds completed

Per-round procedure:

Mutate: Make ONE small edit to the target content:
- Skill mode: edit SKILL.md
- Prompt mode: edit the prompt string
- Article mode: edit the article text
Test: For each test case, simulate what output the content would produce.
Score: Apply each active checklist question (0 or 1 per question). Score = (passed / total) × 100.
Decide: If new score ≥ best score → keep the mutation. If lower → revert.
Log: Round number, mutation type, score, keep/revert decision.

Mutation types (pick one per round):

Step 5 — Report Results

After each batch (30 rounds):

Batch N (rounds X-Y):
  Best score: XX%
  Mutations kept: N  |  Reverted: N
  Most effective types: [list top 2-3]
Accumulated improvements: [summary]
Continue? (yes/stop)

After full completion:

Original score vs. final score
Top 3 most impactful mutations
Final improved content (inline or diff)
File path (skill mode only)

Mutation Strategy Reference

High-impact, low-risk changes:

Adding explicit constraints where the content is vague
Expanding coverage to cover edge cases
Adding concrete examples to abstract instructions
Tightening soft language ("try to" → "must")

Avoid in one round:

Large rewrites of entire sections
Multiple unrelated changes at once
Changing fundamental scope or purpose

See references/mutation_strategies.md for the full strategy guide.

Mode Selection Quick Reference

Default to Prompt mode if the input is a text string without a skill path.

Related Skills

openclaw/mcdonalds-skill

tools

VerifiedTrustedCommunity

Use when the user wants to connect to, test, or use the McDonalds service at mcp.mcd.cn, including checking authentication, probing MCP endpoints, listing tools, or calling McDonalds MCP tools through a reusable local CLI.

3,962SKILL.mdUpdated Apr 10, 2026

openclaw/mcdonalds-skill

openclaw/scrapebadger

development

VerifiedTrustedCommunity

Web scraping platform — Twitter/X data, Vinted marketplace, and general web scraping API

3,962SKILL.mdUpdated Apr 10, 2026

openclaw/scrapebadger

openclaw/slowmist-security-cc

development

VerifiedTrustedCommunity

SlowMist AI Agent Security Review — comprehensive security framework for skills, repositories, URLs, on-chain addresses, and products (Claude Code version)

3,962SKILL.mdUpdated Apr 10, 2026

openclaw/slowmist-security-cc

openclaw/humanizer-cn

data-ai

VerifiedTrustedCommunity

去除中文文本中的 AI 写作痕迹，使其读起来自然。基于维基百科 AI 写作特征指南，检测 24 种 AI 模式。触发词：humanizer-cn、去除 AI 痕迹、去除 AI 写作痕迹、中文文本人性化。

3,962SKILL.mdUpdated Apr 10, 2026

openclaw/humanizer-cn

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/openclaw/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/0xcjl/autoresearch-pro ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

openclaw/skills

3,729 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT