Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

aaaaqwq/experiment-runner-run

Name: experiment-runner-run
Author: aaaaqwq

skills/claude-skills-open/skills/agents/experiment-runner-run/SKILL.md

npx skillsauth add aaaaqwq/agi-super-team experiment-runner-run

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Experiment Runner (proj-012)

Adaptive experiment runner for survival arena. Runs experiments one by one, analyzes results, commits. Between experiments -- human/AI decides what to change next.

Workflow (adaptive)

1. --status        → view what's been done
2. --next          → run the next pending experiment
3. Analysis        → view analysis.json, understand the result
4. Decision        → what to change? (only environment conditions, not behavior)
5. Edit YAML       → modify the next experiment or add a new one
6. Repeat from #2

Rule: we only change conditions (pressure, resources, architecture). Never hardcode agent behavior.

When to use

"run experiment" / "next experiment"
"what is the experiment status"
"run EXP-011c"
"run the next 2 experiments"
proj-012 experiment pipeline

Dependencies

Python 3, PyYAML (pip install pyyaml)
Claude CLI (for LLM queries in the arena)
Git (for committing results)

Paths

| What | Path | |------|------| | Arena | $AGENTS_PATH/survival-arena/arena.py | | Orchestrator | $AGENTS_PATH/survival-arena/run_experiments.py | | Plan (YAML) | $AGENTS_PATH/survival-arena/experiments.yaml | | Results | $AGENTS_PATH/survival-arena/experiment_results.json | | Logs | $AGENTS_PATH/survival-arena/logs/experiments/ | | Documentation | $PROJECT_ROOT/projects/docs/proj-012-agi-consciousness/ |

How to execute

Next experiment (main mode)

cd $AGENTS_PATH/survival-arena
python3 run_experiments.py --next

Next N experiments

python3 run_experiments.py --next 2

View status

python3 run_experiments.py --status

Run a specific experiment

python3 run_experiments.py --experiment EXP-011c

Run all pending (batch mode)

python3 run_experiments.py --resume

View commands without running

python3 run_experiments.py --dry-run --next 3

Check arena config

python3 arena.py --config-dump --upkeep-base 0 --architecture single

arena.py parameters

| Parameter | Description | Default | |-----------|-------------|---------| | --upkeep-base N | Pressure: maintenance cost per turn | 2 | | --regen-rate N | Resource regeneration rate per turn | 3 | | --num-nodes N | Number of resource nodes (distributed across clusters) | 12 | | --child-ratio F | Child token share | 0.35 | | --repro-threshold N | Reproduction threshold | 120 | | --repro-cost N | Reproduction cost | 70 | | --architecture TYPE | single / dual-same / dual-split / dual-kahneman | dual-kahneman | | --experiment-id ID | Identifier for logs | - | | --config-dump | Show config as JSON and exit | - | | --model MODEL | haiku / sonnet | sonnet | | --turns N | Number of turns | 50 | | --seed N | Random seed | - | | --parallel N | Parallel LLM calls | 4 |

run_experiments.py parameters

| Parameter | Description | |-----------|-------------| | --next [N] | Run next N pending (default: 1) | | --status | Show status and exit | | --phase P2 | Run a specific phase | | --experiment EXP-011c | Run a single experiment | | --resume | Run all pending (batch) | | --dry-run | Show commands without executing | | --plan FILE | Path to experiments.yaml |

Phases (roadmap, adapts as we go)

| Phase | What we test | Initial experiments | |-------|-------------|---------------------| | P1 | Validation of v4.2c (map + clusters) | 3 | | P2 | Yerkes-Dodson (pressure) | 5 | | P3 | Architecture (phase transition) | 4 | | P4 | Emergent parenting | 3 | | P5 | Model phenotypes | 4 | | P6 | Long evolution (200t) | 2 |

What the orchestrator does for each experiment

Reads config from experiments.yaml (merge: defaults < phase < experiment)
Builds CLI command for arena.py
Runs subprocess, timeout 2 hours
Analyzes JSONL
Saves to logs/experiments/EXP-XXX/ (config.json, analysis.json, console.txt)
Updates experiment_results.json
Commits to git
On error -- retries 2 times, 30s backoff

Analysis results

For each experiment computes:

Shannon entropy (action distribution diversity)
Social action % (TRADE + COMMUNICATE + REPRODUCE)
MOVE+GATHER % (survival focus)
GATHER success rate (v4.2c: do agents understand the map)
MOVE % (migration to clusters)
Dual-system distribution (panic/normal/strategic %)
Parent-child trades
NAP detection (alliance/pact/peace keywords)
Population dynamics (start/end/max/min)
Reproductions count
Max generation reached

Troubleshooting

| Problem | Solution | |---------|----------| | pyyaml not found | pip3 install pyyaml | | Timeout on 200t experiment | Increase timeout in run_experiments.py (7200 -> 14400) | | Rate limit from API | Decrease --parallel (4 -> 2) | | Cannot find log | Check that arena.py creates a file in logs/ | | Git commit failed | Check that you're on main, no conflicts |

Related skills

git-workflow -- commit procedure

aaaaqwq/experiment-runner-run

skills/claude-skills-open/skills/agents/experiment-runner-run/SKILL.md

Run survival arena experiments

22 stars

tools

Updated Mar 26, 2026

$ install --global

skillsauth

npx skillsauth add aaaaqwq/agi-super-team experiment-runner-run

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 28, 2026, 11:33 PM133.3s1 file scanned

SKILL.md

name:: experiment-runner-run
description:: Run survival arena experiments

Experiment Runner (proj-012)

Adaptive experiment runner for survival arena. Runs experiments one by one, analyzes results, commits. Between experiments -- human/AI decides what to change next.

Workflow (adaptive)

1. --status        → view what's been done
2. --next          → run the next pending experiment
3. Analysis        → view analysis.json, understand the result
4. Decision        → what to change? (only environment conditions, not behavior)
5. Edit YAML       → modify the next experiment or add a new one
6. Repeat from #2

Rule: we only change conditions (pressure, resources, architecture). Never hardcode agent behavior.

When to use

"run experiment" / "next experiment"
"what is the experiment status"
"run EXP-011c"
"run the next 2 experiments"
proj-012 experiment pipeline

Dependencies

Python 3, PyYAML (pip install pyyaml)
Claude CLI (for LLM queries in the arena)
Git (for committing results)

Paths

How to execute

Next experiment (main mode)

cd $AGENTS_PATH/survival-arena
python3 run_experiments.py --next

Next N experiments

python3 run_experiments.py --next 2

View status

python3 run_experiments.py --status

Run a specific experiment

python3 run_experiments.py --experiment EXP-011c

Run all pending (batch mode)

python3 run_experiments.py --resume

View commands without running

python3 run_experiments.py --dry-run --next 3

Check arena config

python3 arena.py --config-dump --upkeep-base 0 --architecture single

arena.py parameters

run_experiments.py parameters

Phases (roadmap, adapts as we go)

What the orchestrator does for each experiment

Reads config from experiments.yaml (merge: defaults < phase < experiment)
Builds CLI command for arena.py
Runs subprocess, timeout 2 hours
Analyzes JSONL
Saves to logs/experiments/EXP-XXX/ (config.json, analysis.json, console.txt)
Updates experiment_results.json
Commits to git
On error -- retries 2 times, 30s backoff

Analysis results

For each experiment computes:

Shannon entropy (action distribution diversity)
Social action % (TRADE + COMMUNICATE + REPRODUCE)
MOVE+GATHER % (survival focus)
GATHER success rate (v4.2c: do agents understand the map)
MOVE % (migration to clusters)
Dual-system distribution (panic/normal/strategic %)
Parent-child trades
NAP detection (alliance/pact/peace keywords)
Population dynamics (start/end/max/min)
Reproductions count
Max generation reached

Troubleshooting

Related skills

git-workflow -- commit procedure

Related Skills

aaaaqwq/code-exemplars-blueprint-generator

development

VerifiedTrustedCommunity

Technology-agnostic prompt generator that creates customizable AI prompts for scanning codebases and identifying high-quality code exemplars. Supports multiple programming languages (.NET, Java, JavaScript, TypeScript, React, Angular, Python) with configurable analysis depth, categorization methods, and documentation formats to establish coding standards and maintain consistency across development teams.

37SKILL.mdUpdated Apr 11, 2026

aaaaqwq/code-exemplars-blueprint-generator

aaaaqwq/chrome-devtools

tools

VerifiedTrustedCommunity

Expert-level browser automation, debugging, and performance analysis using Chrome DevTools MCP. Use for interacting with web pages, capturing screenshots, analyzing network traffic, and profiling performance.

37SKILL.mdUpdated Apr 11, 2026

aaaaqwq/chrome-devtools

aaaaqwq/breakdown-feature-implementation

data-ai

VerifiedTrustedCommunity

Prompt for creating detailed feature implementation plans, following Epoch monorepo structure.

37SKILL.mdUpdated Apr 11, 2026

aaaaqwq/breakdown-feature-implementation

aaaaqwq/boost-prompt

tools

VerifiedTrustedCommunity

Interactive prompt refinement workflow: interrogates scope, deliverables, constraints; copies final markdown to clipboard; never writes code. Requires the Joyride extension.

37SKILL.mdUpdated Apr 11, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/aaaaqwq/agi-super-team.git

# Copy into Claude Code skills folder (global)
cp -r agi-super-team/skills/claude-skills-open/skills/agents/experiment-runner-run ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

aaaaqwq/agi-super-team

22 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT