.claude/skills/multi-ai-research/SKILL.md
Comprehensive research and analysis using Claude (subagents), Gemini CLI, and Codex CLI. Multi-perspective research with cross-verification, iterative refinement, and 100% citation coverage. Use for security analysis, architecture research, code quality assessment, performance analysis, or any research requiring rigorous verification and multiple AI perspectives.
npx skillsauth add adaptationio/skrillz multi-ai-researchInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Harnesses three AI systems (Claude via Task tool, Gemini CLI, Codex CLI) for comprehensive research and analysis with multi-perspective verification and iterative refinement.
Purpose: Produce analysis more thorough than any single AI could achieve through specialized roles, cross-validation, and systematic verification.
Key Innovation: Not just parallel execution - specialized research roles with cross-verification and iterative refinement until production-ready (quality ≥95/100, 100% citations, zero gaps).
The 3 AI Systems:
Quality Guarantees:
Use this skill for:
# Run complete analysis automatically
bash .claude/skills/multi-ai-research/scripts/analyze.sh "Security analysis of authentication system"
This will:
Ask Claude Code to use this skill:
"Use multi-ai-research to analyze [objective]"
Claude will:
Duration: 5-10 minutes
Output: .analysis/ANALYSIS_PLAN.md
Claude creates comprehensive plan:
Duration: 10-20 minutes
Output: .analysis/research/*.md
All three systems research simultaneously:
Claude Subagent:
Gemini CLI:
Codex CLI:
Duration: 15-30 minutes
Output: .analysis/analysis/code-patterns.md
Claude Analysis Agent with extended thinking:
Duration: 10-20 minutes Outputs:
.analysis/SYNTHESIS_REPORT.md.analysis/verification/cross-check.mdSynthesis (Claude with extended thinking):
Verification (Verification Subagent):
Duration: 10-30 minutes
Output: .analysis/iterations/ITERATION_2.md
If quality <95 or gaps exist:
Duration: 5-10 minutes
Output: .analysis/ANALYSIS_FINAL.md
Comprehensive final report:
Total Time: 45-90 minutes for comprehensive analysis
What it checks:
Example:
Use multi-ai-research for "Security audit of authentication system"
Output:
What it examines:
Example:
Use multi-ai-research for "Architecture analysis of microservices system"
Output:
What it analyzes:
Example:
Use multi-ai-research for "Code quality assessment for ./src"
Output:
What it identifies:
Example:
Use multi-ai-research for "Performance bottleneck identification"
Output:
What it compiles:
Example:
Use multi-ai-research for "Research GraphQL federation patterns 2024-2025"
Output:
Never reads files blindly. Always uses 3-level approach:
Level 1: Metadata (glob) - ~50 tokens
glob "**/*.{ts,js,py}" # Understand structure
glob "**/*.md" # Find documentation
glob "**/package.json" # Check dependencies
Level 2: Patterns (grep) - ~5k tokens
grep "export class|interface" --glob "**/*.ts"
grep "TODO|FIXME|BUG" --glob "**/*"
grep "password|secret|token" --glob "**/*.ts"
Level 3: Reading (read) - ~50k tokens
read "src/auth/login.ts" # Only critical files
read "docs/architecture.md"
Result: 90%+ reduction in unnecessary file reads
All state saved to files, not context:
.analysis/
├── ANALYSIS_PLAN.md # Strategy and assignments
├── research/
│ ├── claude-docs.md # Claude research
│ ├── gemini-web.md # Gemini research
│ └── codex-github.md # Codex research
├── analysis/
│ ├── code-patterns.md # Pattern analysis
│ └── architecture-map.md # System map
├── verification/
│ └── cross-check.md # Verification results
├── iterations/
│ ├── ITERATION_1.md # First pass
│ └── ITERATION_2.md # Gap fills
└── ANALYSIS_FINAL.md # Complete report
Benefits:
High Confidence (★★★★★): All 3 sources agree + code verification Medium Confidence (★★★☆☆): 2/3 sources agree Requires Investigation (★★☆☆☆): Sources conflict
Example:
## JWT Implementation (High Confidence ★★★★★)
**Claude**: "Uses JWT with HS256" (src/auth/jwt.ts:15)
**Gemini**: "HS256 is industry standard 2024" (URL)
**Codex**: "150+ repos use HS256 pattern" (GitHub)
**Code**: Verified at src/auth/jwt.ts:18-22
**Recommendation**: Implementation correct per standards
Comprehensive rubric (0-100):
Quality Gates:
Iteration 1 (Breadth): Broad coverage, identifies gaps Iteration 2 (Depth): Fill gaps, improve quality Iteration 3 (Polish): Final verification, perfection
Automatic iteration until:
Research Agent (Haiku):
Analysis Agent (Sonnet):
Verification Agent (Haiku):
Strengths:
Use for:
Strengths:
Use for:
Required:
Optional but Recommended:
npm install -g @google/gemini-clinpm install -g @openai/codexNote: Skill works with Claude-only fallback if Gemini/Codex unavailable.
# Install
npm install -g @google/gemini-cli
# Authenticate (OAuth - free)
gemini
# Follow browser authentication
# Test
gemini -p "test prompt"
# Install
npm install -g @openai/codex
# Authenticate (ChatGPT Plus/Pro account)
codex login
# Follow browser authentication
# Test
codex exec "test prompt"
Claude:
Gemini:
Codex:
Objective: "Security audit of authentication system"
Phase 2 - Parallel Research:
├─ Claude: Analyzes src/auth/* for patterns
├─ Gemini: Researches "OAuth 2.0 security best practices 2024"
└─ Codex: Finds GitHub examples of secure auth
Phase 3 - Analysis:
└─ Claude: Identifies 3 critical, 5 high priority issues
Phase 4 - Synthesis:
└─ All agree: Missing rate limiting (CRITICAL)
- Claude: No rate limit found in src/auth/login.ts
- Gemini: OWASP recommends max 5 attempts/hour
- Codex: 150+ repos use express-rate-limit
- Recommendation: Implement with Redis backend
Final Report:
├─ Executive summary
├─ 8 issues (3 critical, 5 high) with fixes
├─ OWASP Top 10 coverage
├─ Specific code examples
└─ Priority implementation plan
Quality: 97/100 ✓
Objective: "Analyze microservices architecture"
Phase 2:
├─ Claude: Maps services via glob + grep
├─ Gemini: Researches microservices patterns 2024
└─ Codex: Finds service mesh examples
Phase 3:
└─ Claude: Identifies 7 services, 12 integration points
Phase 4:
└─ Synthesis: Service communication patterns
- Consensus: REST for external, gRPC for internal
- Trade-offs documented
- Scaling strategies from Codex examples
Final Report:
├─ Component map (7 services, dependencies)
├─ Integration analysis (12 patterns)
├─ Scalability assessment
└─ Modernization recommendations
Quality: 96/100 ✓
Objective: "Research state management patterns for React 2024"
Phase 2:
├─ Claude: Reviews React docs + examples
├─ Gemini: Web research "React state management 2024"
└─ Codex: Analyzes top 50 React repos
Phase 3:
└─ Pattern analysis: 5 major approaches identified
Phase 4:
└─ Synthesis by use case:
- Small apps: Context (all sources agree)
- Medium apps: Zustand (Gemini + Codex recommend)
- Large apps: Redux Toolkit (battle-tested, Codex data)
- Server state: TanStack Query (trending, Gemini research)
Final Report:
├─ Decision tree by project size
├─ Pros/cons with sources
├─ Migration strategies
└─ Code examples from Codex
Quality: 98/100 ✓
❌ "Analyze the code"
✅ "Security analysis of authentication module for OWASP Top 10 compliance"
Multi-pass verification catches issues. If quality <95, iteration happens automatically.
Check .analysis/ folder during execution to see progress.
Every claim has file:line or URL. Use for validation and deep dives.
Large projects can span sessions:
Session 1: Initial analysis → ITERATION_1.md
Session 2: Gap filling → ITERATION_2.md
Session 3: Final polish → ANALYSIS_FINAL.md
High-value insights often come from comparing AI perspectives.
Cause: Gaps in coverage or missing citations
Solution: Automatic iteration 2 fills gaps
Check: .analysis/verification/cross-check.md for details
Cause: Verification flags uncited claims Solution: Iteration adds missing attributions Prevention: All agents trained to cite sources
Fallback: Claude-only analysis with warning
Impact: Reduced perspectives but still comprehensive
Install: npm install -g @google/gemini-cli @openai/codex
Resolution: Synthesis phase investigates conflicts Method: Check ground truth (actual code/docs) Output: Documented reasoning for resolution
anthropic-expert: Anthropic product expertisecodex-cli: Codex integration patternsgemini-cli: Gemini integration patternstri-ai-collaboration: General tri-AI workflowsanalysis: Code/skill/process analysis# Full automated analysis
bash .claude/skills/multi-ai-research/scripts/analyze.sh "objective"
# Interactive with Claude Code
# Just ask: "Use multi-ai-research for [objective]"
| File | Purpose |
|------|---------|
| .analysis/ANALYSIS_PLAN.md | Strategy and assignments |
| .analysis/research/ | All AI research outputs |
| .analysis/SYNTHESIS_REPORT.md | Multi-source synthesis |
| .analysis/ANALYSIS_FINAL.md | Complete final report |
| Metric | Threshold | Meaning | |--------|-----------|---------| | Quality Score | ≥95/100 | Production-ready | | Citation Coverage | 100% | All claims sourced | | Completeness | ≥95% | All objectives met | | Critical Gaps | 0 | No missing essentials |
| Type | Time | Iterations | |------|------|------------| | Security | 45-60 min | 1-2 | | Architecture | 60-90 min | 1-2 | | Code Quality | 30-45 min | 1 | | Performance | 45-60 min | 1-2 | | Research | 30-60 min | 1 |
multi-ai-research delivers production-ready analysis through systematic multi-AI collaboration, rigorous verification, and iterative refinement - ensuring nothing is missed and every claim is verified.
development
Setup secure web-based terminal access to WSL2 from mobile/tablet via ttyd + ngrok/Cloudflare/Tailscale. One-command install, start, stop, status. Use when you need remote terminal access, web terminal, browser-based shell, or mobile access to WSL2 environment.
development
Complete development workflows where Claude writes the code while Gemini and Codex provide research, planning, reviews, and different perspectives. Claude remains the main developer. Use for complex projects requiring expert planning and multi-perspective reviews.
development
Systematic progress tracking for skill development. Manages task states (pending/in_progress/completed), updates in real-time, reports progress, identifies blockers, and maintains momentum. Use when tracking skill development, coordinating work, or reporting progress.
testing
Comprehensive testing workflow orchestrating functional testing, example validation, integration testing, and usability assessment. Sequential workflow for complete skill testing from examples through scenarios to integration validation. Use when conducting thorough testing, pre-deployment validation, ensuring skill functionality, or comprehensive quality checks.