templates/.claude/skills/multi-model-verification/SKILL.md
Parallel code verification using multiple models with severity classification
npx skillsauth add baekenough/oh-my-customcode multi-model-verificationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Parallel code verification using multiple AI models for comprehensive quality assurance. Each model focuses on a different aspect, and results are aggregated with severity classification.
Inspired by Pi Coding Agent Workflow Extension's multi-model verification pattern. Instead of a single reviewer, multiple models verify code simultaneously, each with specialized focus areas.
| Model | Role | Focus Areas |
|-------|------|-------------|
| opus | Architecture Reviewer | Design patterns, separation of concerns, extensibility, security architecture |
| sonnet | Code Quality Reviewer | Logic correctness, error handling, edge cases, performance patterns |
| haiku | Style & Convention Reviewer | Naming conventions, formatting, documentation, code organization |
| Severity | Description | Action Required | |----------|-------------|-----------------| | CRITICAL | Bugs, security vulnerabilities, data loss risks | Must fix before merge | | WARNING | Code smells, suboptimal patterns, missing error handling | Should fix, justify if skipped | | INFO | Style suggestions, minor improvements, alternative approaches | Optional improvement |
CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1) for full parallel executionAgent(opus) → Architecture & design reviewAgent(sonnet) → Code quality & correctness reviewAgent(haiku) → Style & convention reviewWhen Agent Teams is available, create a verification team:
TeamCreate("verification-team")
├── architect-reviewer (opus) → Architecture review
├── quality-reviewer (sonnet) → Code quality review
└── style-reviewer (haiku) → Style review
Members communicate findings via SendMessage for cross-cutting concerns.
When Agent Teams is unavailable, spawn parallel agents:
[1] Agent(general-purpose):opus → Architecture review
[2] Agent(general-purpose):sonnet → Code quality review
[3] Agent(general-purpose):haiku → Style & convention review
## Multi-Model Verification Report
### Summary
- Files reviewed: {count}
- Findings: {critical} CRITICAL, {warning} WARNING, {info} INFO
- Reviewers: opus (architecture), sonnet (quality), haiku (style)
### CRITICAL
[opus] {file}:{line} — {description}
[sonnet] {file}:{line} — {description}
### WARNING
[sonnet] {file}:{line} — {description}
[haiku] {file}:{line} — {description}
### INFO
[haiku] {file}:{line} — {description}
### Consensus
Issues flagged by 2+ reviewers:
- {file}:{line} — {description} (flagged by: opus, sonnet)
dev-review skill as an enhanced verification modestructured-dev-cycle skill at the "Verify Implementation" stage| Parameter | Default | Description |
|-----------|---------|-------------|
| models | [opus, sonnet, haiku] | Models to use for verification |
| severity_filter | all | Minimum severity to report (critical, warning, all) |
| consensus_threshold | 2 | Number of reviewers needed for consensus flag |
| include_suggestions | true | Include INFO-level suggestions |
| Scenario | Recommended | |----------|-------------| | Pre-merge review of critical code | Yes | | Architecture changes | Yes | | Security-sensitive code | Yes | | Simple formatting changes | No (use single reviewer) | | Rapid prototyping | No (overhead too high) |
Multi-model verification uses 3x the tokens of a single review. Reserve for:
development
Generate and maintain a persistent codebase wiki — LLM-built interlinked markdown knowledge base (Karpathy LLM Wiki pattern)
development
Use the project wiki as RAG knowledge source — search wiki pages to answer codebase questions before exploring raw files
tools
Analyze task trajectories to propose reusable SKILL.md candidates from successful patterns
data-ai
hada.io RSS feed monitoring for AI agent/harness articles with automated /scout analysis