Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

obolnetwork/autoresearch

Name: autoresearch
Author: obolnetwork

internal/embed/skills/autoresearch/SKILL.md

npx skillsauth add obolnetwork/obol-stack autoresearch

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Autoresearch

Autonomous LLM optimization: the agent iterates on train.py, runs 5-minute GPU experiments, measures validation bits-per-byte (val_bpb), and publishes the best checkpoint as a sellable Ollama model.

When to Use

Optimizing a base model for a specific domain or task
Running automated training experiments to improve val_bpb
Publishing an optimized model checkpoint to Ollama
Selling an optimized model via x402 payment-gated inference

When NOT to Use

Selling an existing model without optimization — use sell
Buying remote inference — use buy-x402
Cluster diagnostics — use obol-stack

Quick Start

1. Prepare Data

Place your training and validation data in the autoresearch working directory:

autoresearch/
  train.bin        # training data (tokenized)
  val.bin          # validation data (tokenized)
  train.py         # training script (agent modifies this)
  results.tsv      # experiment log (appended by each run)

2. Run Experiments

The agent modifies train.py and runs experiments in a loop. Each experiment:

Has a 5-minute time budget on GPU
Produces a checkpoint and a val_bpb measurement
Is tracked as a git commit with status (keep/discard) in results.tsv

The results.tsv file is tab-separated with columns:

commit_hash	val_bpb	status	description
a1b2c3d	1.042	keep	baseline transformer
e4f5g6h	1.038	keep	added RMSNorm
i7j8k9l	1.051	discard	unstable lr schedule

3. Publish the Best Model

Once experiments are complete, use publish.py to find the best checkpoint, register it with Ollama, and optionally sell it:

# Publish to Ollama only
python3 scripts/publish.py /path/to/autoresearch

# Publish and sell via x402
python3 scripts/publish.py /path/to/autoresearch \
  --sell \
  --wallet 0xYourWalletAddress \
  --price 0.002 \
  --chain base-sepolia

Commands

| Command | Description | |---------|-------------| | publish.py <dir> | Find best experiment, create Ollama model, generate provenance | | publish.py <dir> --sell --wallet <addr> --price <p> --chain <c> | Publish and sell via obol sell inference |

How It Works

Experiment loop: The agent edits train.py, runs training for up to 5 minutes, measures val_bpb on the validation set, and commits the result with a keep/discard verdict.
Selection: publish.py reads results.tsv, filters for status=keep, and selects the experiment with the lowest val_bpb (lower is better — fewer bits per byte means better compression / prediction).
Provenance: A JSON provenance file is generated with:
- framework: training framework used
- metricName: metric identifier (val_bpb)
- metricValue: winning metric value as a string
- trainHash: sha256: hash of the train.py at the winning commit
- paramCount: model parameter count as a string
- experimentId: git commit hash of the winning experiment
Ollama registration: A Modelfile is generated from the checkpoint and ollama create registers the model locally.
Sell (optional): If --sell is passed, runs obol sell inference with the --provenance-file flag pointing at the provenance JSON so buyers can verify optimization lineage.

Architecture

Agent (autoresearch loop)
  |
  +-- edit train.py
  +-- run experiment (5-min budget)
  +-- measure val_bpb
  +-- commit results.tsv
  |
  v
publish.py
  |
  +-- read results.tsv → best experiment
  +-- git show <commit>:train.py → SHA-256 trainHash
  +-- generate provenance.json
  +-- generate Modelfile → ollama create
  +-- (optional) obol sell inference --provenance-file

Constraints

Python stdlib + uv — no pip install; uv for environment management
5-minute time budget — each experiment must complete within 5 minutes
GPU required — training runs on local GPU (Ollama must have GPU access)
GGUF checkpoint required — Ollama only accepts GGUF format; convert other formats (.pt, .safetensors) with llama.cpp/convert_hf_to_gguf.py
Git repo required — autoresearch directory must be a git repository for commit tracking
results.tsv format — tab-separated: commit_hash, val_bpb, status, description

OASF Registration

When registering an autoresearch-optimized model on-chain via ERC-8004:

Skills: devops_mlops/model_versioning
Domains: research_and_development/scientific_research

References

references/autoresearch-overview.md — val_bpb metric, time budget, and the train.py modification loop

obolnetwork/autoresearch

internal/embed/skills/autoresearch/SKILL.md

Run autonomous LLM optimization experiments (autoresearch) and publish optimized models for paid inference via x402.

6 stars

data-ai

Updated May 2, 2026

$ install --global

skillsauth

npx skillsauth add obolnetwork/obol-stack autoresearch

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 2, 2026, 6:36 AM103.6s2 files scanned

SKILL.md

name:: autoresearch
description:: Run autonomous LLM optimization experiments (autoresearch) and publish optimized models for paid inference via x402.
metadata:: { "openclaw": { "emoji": "🧪", "requires": { "bins": ["python3", "uv"] } } }

Autoresearch

When to Use

Optimizing a base model for a specific domain or task
Running automated training experiments to improve val_bpb
Publishing an optimized model checkpoint to Ollama
Selling an optimized model via x402 payment-gated inference

When NOT to Use

Selling an existing model without optimization — use sell
Buying remote inference — use buy-x402
Cluster diagnostics — use obol-stack

Quick Start

1. Prepare Data

Place your training and validation data in the autoresearch working directory:

autoresearch/
  train.bin        # training data (tokenized)
  val.bin          # validation data (tokenized)
  train.py         # training script (agent modifies this)
  results.tsv      # experiment log (appended by each run)

2. Run Experiments

The agent modifies train.py and runs experiments in a loop. Each experiment:

Has a 5-minute time budget on GPU
Produces a checkpoint and a val_bpb measurement
Is tracked as a git commit with status (keep/discard) in results.tsv

The results.tsv file is tab-separated with columns:

commit_hash	val_bpb	status	description
a1b2c3d	1.042	keep	baseline transformer
e4f5g6h	1.038	keep	added RMSNorm
i7j8k9l	1.051	discard	unstable lr schedule

3. Publish the Best Model

Once experiments are complete, use publish.py to find the best checkpoint, register it with Ollama, and optionally sell it:

# Publish to Ollama only
python3 scripts/publish.py /path/to/autoresearch

# Publish and sell via x402
python3 scripts/publish.py /path/to/autoresearch \
  --sell \
  --wallet 0xYourWalletAddress \
  --price 0.002 \
  --chain base-sepolia

Commands

How It Works

Experiment loop: The agent edits train.py, runs training for up to 5 minutes, measures val_bpb on the validation set, and commits the result with a keep/discard verdict.
Selection: publish.py reads results.tsv, filters for status=keep, and selects the experiment with the lowest val_bpb (lower is better — fewer bits per byte means better compression / prediction).
Provenance: A JSON provenance file is generated with:
- framework: training framework used
- metricName: metric identifier (val_bpb)
- metricValue: winning metric value as a string
- trainHash: sha256: hash of the train.py at the winning commit
- paramCount: model parameter count as a string
- experimentId: git commit hash of the winning experiment
Ollama registration: A Modelfile is generated from the checkpoint and ollama create registers the model locally.
Sell (optional): If --sell is passed, runs obol sell inference with the --provenance-file flag pointing at the provenance JSON so buyers can verify optimization lineage.

Architecture

Agent (autoresearch loop)
  |
  +-- edit train.py
  +-- run experiment (5-min budget)
  +-- measure val_bpb
  +-- commit results.tsv
  |
  v
publish.py
  |
  +-- read results.tsv → best experiment
  +-- git show <commit>:train.py → SHA-256 trainHash
  +-- generate provenance.json
  +-- generate Modelfile → ollama create
  +-- (optional) obol sell inference --provenance-file

Constraints

Python stdlib + uv — no pip install; uv for environment management
5-minute time budget — each experiment must complete within 5 minutes
GPU required — training runs on local GPU (Ollama must have GPU access)
GGUF checkpoint required — Ollama only accepts GGUF format; convert other formats (.pt, .safetensors) with llama.cpp/convert_hf_to_gguf.py
Git repo required — autoresearch directory must be a git repository for commit tracking
results.tsv format — tab-separated: commit_hash, val_bpb, status, description

OASF Registration

When registering an autoresearch-optimized model on-chain via ERC-8004:

Skills: devops_mlops/model_versioning
Domains: research_and_development/scientific_research

References

references/autoresearch-overview.md — val_bpb metric, time budget, and the train.py modification loop

Related Skills

obolnetwork/agent-factory

data-ai

VerifiedTrustedCommunity

Spawn durable child Hermes agents from inside Obol Stack. Creates child namespaces, optional profile/env Secrets, Agent CRDs, and optional ServiceOffers for x402-paid child services.

9SKILL.mdUpdated May 22, 2026

obolnetwork/agent-factory

obolnetwork/buy-x402

data-ai

VerifiedTrustedCommunity

Buy from any x402-gated endpoint. Two flows: `pay` for one-shot HTTP services (single authorization, no sidecar), and `buy` for long-running paid inference (pre-authorized batch via PurchaseRequest, exposed as `paid/<remote-model>`). Supports USDC (EIP-3009) and OBOL (Permit2). Zero signer access at runtime — spending is capped by design and nothing moves on-chain until a voucher is spent.

9SKILL.mdUpdated May 2, 2026

obolnetwork/sell

testing

VerifiedTrustedCommunity

Sell access to services via x402 payment gating. Create ServiceOffer CRDs that automatically health-check upstreams, create payment-gated routes, and optionally pull models and register on ERC-8004. Supports inference, HTTP, and fine-tuning service types.

9SKILL.mdUpdated Apr 20, 2026

obolnetwork/monetize-guide

testing

VerifiedTrustedCommunity

End-to-end guide for monetizing GPU resources or HTTP services through obol-stack. Covers pre-flight checks, model detection, pricing research, selling via x402, ERC-8004 registration, and verification. Use this skill when the user wants to monetize their machine.

9SKILL.mdUpdated Apr 20, 2026

obolnetwork/monetize-guide

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/obolnetwork/obol-stack.git

# Copy into Claude Code skills folder (global)
cp -r obol-stack/internal/embed/skills/autoresearch ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

obolnetwork/obol-stack

6 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT