skills/claude-skills-open/skills/agents/experiment-runner-run/SKILL.md
Run survival arena experiments
npx skillsauth add aaaaqwq/agi-super-team experiment-runner-runInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Adaptive experiment runner for survival arena. Runs experiments one by one, analyzes results, commits. Between experiments -- human/AI decides what to change next.
1. --status → view what's been done
2. --next → run the next pending experiment
3. Analysis → view analysis.json, understand the result
4. Decision → what to change? (only environment conditions, not behavior)
5. Edit YAML → modify the next experiment or add a new one
6. Repeat from #2
Rule: we only change conditions (pressure, resources, architecture). Never hardcode agent behavior.
pip install pyyaml)| What | Path |
|------|------|
| Arena | $AGENTS_PATH/survival-arena/arena.py |
| Orchestrator | $AGENTS_PATH/survival-arena/run_experiments.py |
| Plan (YAML) | $AGENTS_PATH/survival-arena/experiments.yaml |
| Results | $AGENTS_PATH/survival-arena/experiment_results.json |
| Logs | $AGENTS_PATH/survival-arena/logs/experiments/ |
| Documentation | $PROJECT_ROOT/projects/docs/proj-012-agi-consciousness/ |
cd $AGENTS_PATH/survival-arena
python3 run_experiments.py --next
python3 run_experiments.py --next 2
python3 run_experiments.py --status
python3 run_experiments.py --experiment EXP-011c
python3 run_experiments.py --resume
python3 run_experiments.py --dry-run --next 3
python3 arena.py --config-dump --upkeep-base 0 --architecture single
| Parameter | Description | Default |
|-----------|-------------|---------|
| --upkeep-base N | Pressure: maintenance cost per turn | 2 |
| --regen-rate N | Resource regeneration rate per turn | 3 |
| --num-nodes N | Number of resource nodes (distributed across clusters) | 12 |
| --child-ratio F | Child token share | 0.35 |
| --repro-threshold N | Reproduction threshold | 120 |
| --repro-cost N | Reproduction cost | 70 |
| --architecture TYPE | single / dual-same / dual-split / dual-kahneman | dual-kahneman |
| --experiment-id ID | Identifier for logs | - |
| --config-dump | Show config as JSON and exit | - |
| --model MODEL | haiku / sonnet | sonnet |
| --turns N | Number of turns | 50 |
| --seed N | Random seed | - |
| --parallel N | Parallel LLM calls | 4 |
| Parameter | Description |
|-----------|-------------|
| --next [N] | Run next N pending (default: 1) |
| --status | Show status and exit |
| --phase P2 | Run a specific phase |
| --experiment EXP-011c | Run a single experiment |
| --resume | Run all pending (batch) |
| --dry-run | Show commands without executing |
| --plan FILE | Path to experiments.yaml |
| Phase | What we test | Initial experiments | |-------|-------------|---------------------| | P1 | Validation of v4.2c (map + clusters) | 3 | | P2 | Yerkes-Dodson (pressure) | 5 | | P3 | Architecture (phase transition) | 4 | | P4 | Emergent parenting | 3 | | P5 | Model phenotypes | 4 | | P6 | Long evolution (200t) | 2 |
logs/experiments/EXP-XXX/ (config.json, analysis.json, console.txt)For each experiment computes:
| Problem | Solution |
|---------|----------|
| pyyaml not found | pip3 install pyyaml |
| Timeout on 200t experiment | Increase timeout in run_experiments.py (7200 -> 14400) |
| Rate limit from API | Decrease --parallel (4 -> 2) |
| Cannot find log | Check that arena.py creates a file in logs/ |
| Git commit failed | Check that you're on main, no conflicts |
git-workflow -- commit proceduredevelopment
Technology-agnostic prompt generator that creates customizable AI prompts for scanning codebases and identifying high-quality code exemplars. Supports multiple programming languages (.NET, Java, JavaScript, TypeScript, React, Angular, Python) with configurable analysis depth, categorization methods, and documentation formats to establish coding standards and maintain consistency across development teams.
tools
Expert-level browser automation, debugging, and performance analysis using Chrome DevTools MCP. Use for interacting with web pages, capturing screenshots, analyzing network traffic, and profiling performance.
data-ai
Prompt for creating detailed feature implementation plans, following Epoch monorepo structure.
tools
Interactive prompt refinement workflow: interrogates scope, deliverables, constraints; copies final markdown to clipboard; never writes code. Requires the Joyride extension.