Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

adaptationio/auto-claude-optimization

Name: auto-claude-optimization
Author: adaptationio

.claude/skills/auto-claude-optimization/SKILL.md

npx skillsauth add adaptationio/skrillz auto-claude-optimization

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Auto-Claude Optimization

Performance tuning, cost reduction, and efficiency improvements.

Performance Overview

Key Metrics

| Metric | Impact | Optimization | |--------|--------|--------------| | API latency | Build speed | Model selection, caching | | Token usage | Cost | Prompt efficiency, context limits | | Memory queries | Speed | Embedding model, index tuning | | Build iterations | Time | Spec quality, QA settings |

Model Optimization

Model Selection

| Model | Speed | Cost | Quality | Use Case | |-------|-------|------|---------|----------| | claude-opus-4-5-20251101 | Slow | High | Best | Complex features | | claude-sonnet-4-5-20250929 | Fast | Medium | Good | Standard features |

# Override model in .env
AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929

Extended Thinking Tokens

Configure thinking budget per agent:

| Agent | Default | Recommended | |-------|---------|-------------| | Spec creation | 16000 | Keep default for quality | | Planning | 5000 | Reduce to 3000 for speed | | Coding | 0 | Keep disabled | | QA Review | 10000 | Reduce to 5000 for speed |

# In agent configuration
max_thinking_tokens=5000  # or None to disable

Token Optimization

Reduce Context Size

Smaller spec files

# Keep specs concise
# Bad: 5000 word spec
# Good: 500 word spec with clear criteria

Limit codebase scanning

# In context/builder.py
MAX_CONTEXT_FILES = 50  # Reduce from 100

Use targeted searches

# Instead of full codebase scan
# Focus on relevant directories

Efficient Prompts

Optimize system prompts in apps/backend/prompts/:

<!-- Bad: Verbose -->
You are an expert software developer who specializes in building
high-quality, production-ready applications. You have extensive
experience with many programming languages and frameworks...

<!-- Good: Concise -->
Expert full-stack developer. Build production-quality code.
Follow existing patterns. Test thoroughly.

Memory Optimization

# Use efficient embedding model
OPENAI_EMBEDDING_MODEL=text-embedding-3-small

# Or offline with smaller model
OLLAMA_EMBEDDING_MODEL=all-minilm
OLLAMA_EMBEDDING_DIM=384

Speed Optimization

Parallel Execution

# Enable more parallel agents (default: 4)
MAX_PARALLEL_AGENTS=8

Reduce QA Iterations

# Limit QA loop iterations
MAX_QA_ITERATIONS=10  # Default: 50

# Skip QA for quick iterations
python run.py --spec 001 --skip-qa

Faster Spec Creation

# Force simple complexity for quick tasks
python spec_runner.py --task "Fix typo" --complexity simple

# Skip research phase
SKIP_RESEARCH_PHASE=true python spec_runner.py --task "..."

API Timeout Tuning

# Reduce timeout for faster failure detection
API_TIMEOUT_MS=120000  # 2 minutes (default: 10 minutes)

Cost Management

Monitor Token Usage

# Enable cost tracking
ENABLE_COST_TRACKING=true

# View usage report
python usage_report.py --spec 001

Cost Reduction Strategies

Use cheaper models for simple tasks

# For simple specs
AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929 python spec_runner.py --task "..."

Limit context window

MAX_CONTEXT_TOKENS=50000  # Reduce from 100000

Batch similar tasks

# Create specs together, run together
python spec_runner.py --task "Add feature A"
python spec_runner.py --task "Add feature B"
python run.py --spec 001
python run.py --spec 002

Use local models for memory

# Ollama for memory (free)
GRAPHITI_LLM_PROVIDER=ollama
GRAPHITI_EMBEDDER_PROVIDER=ollama

Cost Estimation

| Operation | Estimated Tokens | Cost (Opus) | Cost (Sonnet) | |-----------|-----------------|-------------|---------------| | Simple spec | 10k | ~$0.30 | ~$0.06 | | Standard spec | 50k | ~$1.50 | ~$0.30 | | Complex spec | 200k | ~$6.00 | ~$1.20 | | Build (simple) | 50k | ~$1.50 | ~$0.30 | | Build (standard) | 200k | ~$6.00 | ~$1.20 | | Build (complex) | 500k | ~$15.00 | ~$3.00 |

Memory System Optimization

Embedding Performance

# Faster embeddings
OPENAI_EMBEDDING_MODEL=text-embedding-3-small  # 1536 dim, fast

# Higher quality (slower)
OPENAI_EMBEDDING_MODEL=text-embedding-3-large  # 3072 dim

# Offline (fastest, free)
OLLAMA_EMBEDDING_MODEL=all-minilm
OLLAMA_EMBEDDING_DIM=384

Query Optimization

# Limit search results
memory.search("query", limit=10)  # Instead of 100

# Use semantic caching
ENABLE_MEMORY_CACHE=true

Database Maintenance

# Compact database periodically
python -c "from integrations.graphiti.memory import compact_database; compact_database()"

# Clear old episodes
python query_memory.py --cleanup --older-than 30d

Build Efficiency

Spec Quality = Build Speed

High-quality specs reduce iterations:

# Good spec (fewer iterations)
## Acceptance Criteria
- [ ] User can log in with email/password
- [ ] Invalid credentials show error message
- [ ] Successful login redirects to /dashboard
- [ ] Session persists for 24 hours

# Bad spec (more iterations)
## Acceptance Criteria
- [ ] Login works

Subtask Granularity

Optimal subtask size:

Too large: Agent gets stuck, needs recovery
Too small: Overhead per subtask
Optimal: 30-60 minutes of work each

Parallel Work

Let agents spawn subagents for parallel execution:

Main Coder
├── Subagent 1: Frontend (parallel)
├── Subagent 2: Backend (parallel)
└── Subagent 3: Tests (parallel)

Environment Tuning

Optimal .env Configuration

# Performance-focused configuration
AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929
API_TIMEOUT_MS=180000
MAX_PARALLEL_AGENTS=6

# Memory optimization
GRAPHITI_LLM_PROVIDER=ollama
GRAPHITI_EMBEDDER_PROVIDER=ollama
OLLAMA_LLM_MODEL=llama3.2:3b
OLLAMA_EMBEDDING_MODEL=all-minilm
OLLAMA_EMBEDDING_DIM=384

# Reduce verbosity
DEBUG=false
ENABLE_FANCY_UI=false

Resource Limits

# Limit Python memory
export PYTHONMALLOC=malloc

# Set max file descriptors
ulimit -n 4096

Benchmarking

Measure Build Time

# Time a build
time python run.py --spec 001

# Compare models
time AUTO_BUILD_MODEL=claude-opus-4-5-20251101 python run.py --spec 001
time AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929 python run.py --spec 001

Profile Memory Usage

# Monitor memory
watch -n 1 'ps aux | grep python | head -5'

# Profile script
python -m cProfile -o profile.stats run.py --spec 001
python -c "import pstats; p = pstats.Stats('profile.stats'); p.sort_stats('cumulative').print_stats(20)"

Quick Wins

Immediate Optimizations

Switch to Sonnet for most tasks

AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929

Use Ollama for memory

GRAPHITI_LLM_PROVIDER=ollama
GRAPHITI_EMBEDDER_PROVIDER=ollama

Skip QA for prototypes
```
python run.py --spec 001 --skip-qa
```

Force simple complexity for small tasks

python spec_runner.py --task "..." --complexity simple

Medium-Term Improvements

Optimize prompts in apps/backend/prompts/
Configure project-specific security allowlist
Set up memory caching
Tune parallel agent count

Long-Term Strategies

Self-hosted LLM for memory (Ollama)
Caching layer for common operations
Incremental context building
Project-specific prompt optimization

Related Skills

auto-claude-memory: Memory configuration
auto-claude-build: Build process
auto-claude-troubleshooting: Debugging

adaptationio/auto-claude-optimization

.claude/skills/auto-claude-optimization/SKILL.md

Auto-Claude performance optimization and cost management. Use when optimizing token usage, reducing API costs, improving build speed, or tuning agent performance.

6 stars

development

Updated Mar 28, 2026

$ install --global

skillsauth

npx skillsauth add adaptationio/skrillz auto-claude-optimization

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Mar 30, 2026, 11:17 PM67.1s1 file scanned

SKILL.md

name:: auto-claude-optimization
description:: Auto-Claude performance optimization and cost management. Use when optimizing token usage, reducing API costs, improving build speed, or tuning agent performance.
version:: 1.0.0
auto-claude-version:: 2.7.2

Auto-Claude Optimization

Performance tuning, cost reduction, and efficiency improvements.

Performance Overview

Key Metrics

Model Optimization

Model Selection

# Override model in .env
AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929

Extended Thinking Tokens

Configure thinking budget per agent:

# In agent configuration
max_thinking_tokens=5000  # or None to disable

Token Optimization

Reduce Context Size

Smaller spec files

# Keep specs concise
# Bad: 5000 word spec
# Good: 500 word spec with clear criteria

Limit codebase scanning

# In context/builder.py
MAX_CONTEXT_FILES = 50  # Reduce from 100

Use targeted searches

# Instead of full codebase scan
# Focus on relevant directories

Efficient Prompts

Optimize system prompts in apps/backend/prompts/:

<!-- Bad: Verbose -->
You are an expert software developer who specializes in building
high-quality, production-ready applications. You have extensive
experience with many programming languages and frameworks...

<!-- Good: Concise -->
Expert full-stack developer. Build production-quality code.
Follow existing patterns. Test thoroughly.

Memory Optimization

# Use efficient embedding model
OPENAI_EMBEDDING_MODEL=text-embedding-3-small

# Or offline with smaller model
OLLAMA_EMBEDDING_MODEL=all-minilm
OLLAMA_EMBEDDING_DIM=384

Speed Optimization

Parallel Execution

# Enable more parallel agents (default: 4)
MAX_PARALLEL_AGENTS=8

Reduce QA Iterations

# Limit QA loop iterations
MAX_QA_ITERATIONS=10  # Default: 50

# Skip QA for quick iterations
python run.py --spec 001 --skip-qa

Faster Spec Creation

# Force simple complexity for quick tasks
python spec_runner.py --task "Fix typo" --complexity simple

# Skip research phase
SKIP_RESEARCH_PHASE=true python spec_runner.py --task "..."

API Timeout Tuning

# Reduce timeout for faster failure detection
API_TIMEOUT_MS=120000  # 2 minutes (default: 10 minutes)

Cost Management

Monitor Token Usage

# Enable cost tracking
ENABLE_COST_TRACKING=true

# View usage report
python usage_report.py --spec 001

Cost Reduction Strategies

Use cheaper models for simple tasks

# For simple specs
AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929 python spec_runner.py --task "..."

Limit context window

MAX_CONTEXT_TOKENS=50000  # Reduce from 100000

Batch similar tasks

# Create specs together, run together
python spec_runner.py --task "Add feature A"
python spec_runner.py --task "Add feature B"
python run.py --spec 001
python run.py --spec 002

Use local models for memory

# Ollama for memory (free)
GRAPHITI_LLM_PROVIDER=ollama
GRAPHITI_EMBEDDER_PROVIDER=ollama

Cost Estimation

Memory System Optimization

Embedding Performance

# Faster embeddings
OPENAI_EMBEDDING_MODEL=text-embedding-3-small  # 1536 dim, fast

# Higher quality (slower)
OPENAI_EMBEDDING_MODEL=text-embedding-3-large  # 3072 dim

# Offline (fastest, free)
OLLAMA_EMBEDDING_MODEL=all-minilm
OLLAMA_EMBEDDING_DIM=384

Query Optimization

# Limit search results
memory.search("query", limit=10)  # Instead of 100

# Use semantic caching
ENABLE_MEMORY_CACHE=true

Database Maintenance

# Compact database periodically
python -c "from integrations.graphiti.memory import compact_database; compact_database()"

# Clear old episodes
python query_memory.py --cleanup --older-than 30d

Build Efficiency

Spec Quality = Build Speed

High-quality specs reduce iterations:

# Good spec (fewer iterations)
## Acceptance Criteria
- [ ] User can log in with email/password
- [ ] Invalid credentials show error message
- [ ] Successful login redirects to /dashboard
- [ ] Session persists for 24 hours

# Bad spec (more iterations)
## Acceptance Criteria
- [ ] Login works

Subtask Granularity

Optimal subtask size:

Too large: Agent gets stuck, needs recovery
Too small: Overhead per subtask
Optimal: 30-60 minutes of work each

Parallel Work

Let agents spawn subagents for parallel execution:

Main Coder
├── Subagent 1: Frontend (parallel)
├── Subagent 2: Backend (parallel)
└── Subagent 3: Tests (parallel)

Environment Tuning

Optimal .env Configuration

# Performance-focused configuration
AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929
API_TIMEOUT_MS=180000
MAX_PARALLEL_AGENTS=6

# Memory optimization
GRAPHITI_LLM_PROVIDER=ollama
GRAPHITI_EMBEDDER_PROVIDER=ollama
OLLAMA_LLM_MODEL=llama3.2:3b
OLLAMA_EMBEDDING_MODEL=all-minilm
OLLAMA_EMBEDDING_DIM=384

# Reduce verbosity
DEBUG=false
ENABLE_FANCY_UI=false

Resource Limits

# Limit Python memory
export PYTHONMALLOC=malloc

# Set max file descriptors
ulimit -n 4096

Benchmarking

Measure Build Time

# Time a build
time python run.py --spec 001

# Compare models
time AUTO_BUILD_MODEL=claude-opus-4-5-20251101 python run.py --spec 001
time AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929 python run.py --spec 001

Profile Memory Usage

# Monitor memory
watch -n 1 'ps aux | grep python | head -5'

# Profile script
python -m cProfile -o profile.stats run.py --spec 001
python -c "import pstats; p = pstats.Stats('profile.stats'); p.sort_stats('cumulative').print_stats(20)"

Quick Wins

Immediate Optimizations

Switch to Sonnet for most tasks

AUTO_BUILD_MODEL=claude-sonnet-4-5-20250929

Use Ollama for memory

GRAPHITI_LLM_PROVIDER=ollama
GRAPHITI_EMBEDDER_PROVIDER=ollama

Skip QA for prototypes
```
python run.py --spec 001 --skip-qa
```

Force simple complexity for small tasks

python spec_runner.py --task "..." --complexity simple

Medium-Term Improvements

Optimize prompts in apps/backend/prompts/
Configure project-specific security allowlist
Set up memory caching
Tune parallel agent count

Long-Term Strategies

Self-hosted LLM for memory (Ollama)
Caching layer for common operations
Incremental context building
Project-specific prompt optimization

Related Skills

auto-claude-memory: Memory configuration
auto-claude-build: Build process
auto-claude-troubleshooting: Debugging

Related Skills

adaptationio/ttyd-remote-terminal-wsl2

development

VerifiedTrustedCommunity

Setup secure web-based terminal access to WSL2 from mobile/tablet via ttyd + ngrok/Cloudflare/Tailscale. One-command install, start, stop, status. Use when you need remote terminal access, web terminal, browser-based shell, or mobile access to WSL2 environment.

6SKILL.mdUpdated Mar 28, 2026

adaptationio/ttyd-remote-terminal-wsl2

adaptationio/tri-ai-collaboration

development

VerifiedTrustedCommunity

Complete development workflows where Claude writes the code while Gemini and Codex provide research, planning, reviews, and different perspectives. Claude remains the main developer. Use for complex projects requiring expert planning and multi-perspective reviews.

6SKILL.mdUpdated Mar 28, 2026

adaptationio/tri-ai-collaboration

adaptationio/todo-management

development

VerifiedTrustedCommunity

Systematic progress tracking for skill development. Manages task states (pending/in_progress/completed), updates in real-time, reports progress, identifies blockers, and maintains momentum. Use when tracking skill development, coordinating work, or reporting progress.

6SKILL.mdUpdated Mar 28, 2026

adaptationio/todo-management

adaptationio/testing-workflow

testing

VerifiedTrustedCommunity

Comprehensive testing workflow orchestrating functional testing, example validation, integration testing, and usability assessment. Sequential workflow for complete skill testing from examples through scenarios to integration validation. Use when conducting thorough testing, pre-deployment validation, ensuring skill functionality, or comprehensive quality checks.

6SKILL.mdUpdated Mar 28, 2026

adaptationio/testing-workflow

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/adaptationio/skrillz.git

# Copy into Claude Code skills folder (global)
cp -r skrillz/.claude/skills/auto-claude-optimization ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

adaptationio/skrillz

6 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT