Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

bereniketech/model-selection

Name: model-selection
Author: bereniketech

skills/agents-orchestration/model-selection/SKILL.md

npx skillsauth add bereniketech/claude_kit model-selection

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Model Selection

1. Model Profiles

| Model | Speed | Cost | Best for | |-------|-------|------|----------| | Haiku 4.5 | Fastest | Cheapest (~20× cheaper than Opus) | High-volume reads, scraping, summarization, classification, simple transforms | | Sonnet 4.6 | Fast | Mid | Implementation, code generation, debugging, most everyday tasks | | Opus 4.7 | Slowest | Most expensive | Architecture decisions, hard debugging, complex reasoning, final review |

Rule: Default to Sonnet. Downgrade to Haiku for volume, upgrade to Opus only when Sonnet fails after 2 attempts or the decision affects the whole system.

2. Decision Tree

Task involves architecture / system-wide design decisions?
└── Yes → Opus

Task involves complex debugging that Sonnet can't crack after 2 tries?
└── Yes → Opus

Task involves writing or reviewing implementation code?
└── Yes → Sonnet

Task involves reading many documents / large data to extract a summary?
└── Yes → Haiku

Task involves classification, tagging, or simple extraction at scale?
└── Yes → Haiku

Not sure?
└── → Sonnet

3. Sub-Agent Cost Pattern

When orchestrating multi-agent workflows, assign models by role:

Main thread (Opus/Sonnet) — orchestrates, makes decisions, reviews final output
    ↓ spawns
Sub-agents (Haiku) — scrape, read, summarize, classify, filter
    ↓ report back
Main thread — synthesizes Haiku summaries, produces final answer

Example: Scraping 50 articles to find the 3 most relevant: Haiku reads all 50 and returns a ranked list of 5. Sonnet reads those 5 and writes the final synthesis. Cost reduction: ~85%.

4. Setting Model per Agent

In Claude Code, specify the model when spawning a sub-agent:

Agent({
  model: "haiku",        // for cheap volume work
  description: "Scrape and summarize 50 articles",
  prompt: "..."
})

Available model values: "haiku", "sonnet", "opus"

5. When to Upgrade Mid-Task

Upgrade from Sonnet → Opus when:

Sonnet has given the same wrong answer 2+ times
The task is architectural (touching interfaces, data models, module boundaries)
The task requires cross-file reasoning across >10 files simultaneously
A mistake here would require significant rework

Rule: Don't use Opus as a default just because it's "better." The quality gap on routine tasks is small; the cost gap is large.

6. Cost Reference (approximate)

| Task | Haiku | Sonnet | Opus | |------|-------|--------|------| | Read + summarize 10K tokens | ~$0.001 | ~$0.005 | ~$0.02 | | Write 500-line feature | ~$0.01 | ~$0.05 | ~$0.20 | | Full codebase review (100K tokens) | ~$0.01 | ~$0.05 | ~$0.20 |

Running 10 Haiku sub-agents reading 10K tokens each ≈ cost of 1 Sonnet reading the same.

bereniketech/model-selection

skills/agents-orchestration/model-selection/SKILL.md

Choose the right Claude model (Haiku / Sonnet / Opus) for each task and sub-agent to maximize output quality while minimizing token cost.

testing

Updated May 3, 2026

$ install --global

skillsauth

npx skillsauth add bereniketech/claude_kit model-selection

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 3, 2026, 3:59 AM187.3s1 file scanned

SKILL.md

name:: model-selection
description:: Choose the right Claude model (Haiku / Sonnet / Opus) for each task and sub-agent to maximize output quality while minimizing token cost.

Model Selection

1. Model Profiles

Rule: Default to Sonnet. Downgrade to Haiku for volume, upgrade to Opus only when Sonnet fails after 2 attempts or the decision affects the whole system.

2. Decision Tree

Task involves architecture / system-wide design decisions?
└── Yes → Opus

Task involves complex debugging that Sonnet can't crack after 2 tries?
└── Yes → Opus

Task involves writing or reviewing implementation code?
└── Yes → Sonnet

Task involves reading many documents / large data to extract a summary?
└── Yes → Haiku

Task involves classification, tagging, or simple extraction at scale?
└── Yes → Haiku

Not sure?
└── → Sonnet

3. Sub-Agent Cost Pattern

When orchestrating multi-agent workflows, assign models by role:

Main thread (Opus/Sonnet) — orchestrates, makes decisions, reviews final output
    ↓ spawns
Sub-agents (Haiku) — scrape, read, summarize, classify, filter
    ↓ report back
Main thread — synthesizes Haiku summaries, produces final answer

Example: Scraping 50 articles to find the 3 most relevant: Haiku reads all 50 and returns a ranked list of 5. Sonnet reads those 5 and writes the final synthesis. Cost reduction: ~85%.

4. Setting Model per Agent

In Claude Code, specify the model when spawning a sub-agent:

Agent({
  model: "haiku",        // for cheap volume work
  description: "Scrape and summarize 50 articles",
  prompt: "..."
})

Available model values: "haiku", "sonnet", "opus"

5. When to Upgrade Mid-Task

Upgrade from Sonnet → Opus when:

Sonnet has given the same wrong answer 2+ times
The task is architectural (touching interfaces, data models, module boundaries)
The task requires cross-file reasoning across >10 files simultaneously
A mistake here would require significant rework

Rule: Don't use Opus as a default just because it's "better." The quality gap on routine tasks is small; the cost gap is large.

6. Cost Reference (approximate)

Running 10 Haiku sub-agents reading 10K tokens each ≈ cost of 1 Sonnet reading the same.

Related Skills

bereniketech/anti-reversing-techniques

testing

VerifiedTrustedCommunity

AUTHORIZED USE ONLY: This skill contains dual-use security techniques. Before proceeding with any bypass or analysis: > 1.

SKILL.mdUpdated May 3, 2026

bereniketech/anti-reversing-techniques

bereniketech/active-directory-attacks

testing

Community

Provide comprehensive techniques for attacking Microsoft Active Directory environments. Covers reconnaissance, credential harvesting, Kerberos attacks, lateral movement, privilege escalation, and domain dominance for red team operations and penetration testing.

SKILL.mdUpdated May 3, 2026

bereniketech/active-directory-attacks

Security Scans

mcp-scan — Pending Scan

Semgrep — Pending Scan

Trivy — Pending Scan

OWASP — Pending Scan

VirusTotal — Pending Scan

bereniketech/zeroize-audit

development

VerifiedTrustedCommunity

Detects missing zeroization of sensitive data in source code and identifies zeroization removed by compiler optimizations, with assembly-level analysis, and control-flow verification. Use for auditing C/C++/Rust code handling secrets, keys, passwords, or other sensitive data.

SKILL.mdUpdated May 3, 2026

bereniketech/zeroize-audit

bereniketech/wcag-audit-patterns

development

VerifiedTrustedCommunity

Comprehensive guide to auditing web content against WCAG 2.2 guidelines with actionable remediation strategies.

SKILL.mdUpdated May 3, 2026

bereniketech/wcag-audit-patterns

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/bereniketech/claude_kit.git

# Copy into Claude Code skills folder (global)
cp -r claude_kit/skills/agents-orchestration/model-selection ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

bereniketech/claude_kit

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT