skills/golem-powers/coderabbit/SKILL.md
Use when reviewing uncommitted changes, preparing PRs, requesting or receiving code review, handling CodeRabbit/Greptile/Bugbot/GitHub PR comments, checking security/secrets/a11y/code quality, or deciding whether to accept or reject reviewer feedback. Runs AI review via CLI and covers review triage, false-positive pushback, red/blue team profiles, PR-ready gates. NOT for: runtime debugging or test execution.
npx skillsauth add etanhey/golems coderabbitInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Fast AI code reviews via CodeRabbit CLI. Free for open source.
Works in any git repo. Free tier covers open source repos. For private repos, ensure CodeRabbit is configured in the repo settings.
coderabbit review --agent # Codex/AI-agent local review output
cr review --plain # Human-readable review
cr review --prompt-only # Minimal-token prompt output for specific workflows
cr review --type uncommitted # Only unstaged changes
cr review --base main # Compare against main branch
In Codex environments, use coderabbit review --agent (not --plain) for the local review gate.
Bound the local CLI wait: hard timeout at about 3 minutes. If the CLI hangs while writing comments, hits a rate limit, or reports review-limit exhaustion mid-loop, kill/stop it, record the limitation, and proceed through commit/push on fresh local test evidence. Do not block the PR loop indefinitely on a local CLI hang. After the PR exists, request @coderabbitai review and read the resulting bot status/comments before merge.
| Workflow | Use Case | |----------|----------| | review | Standard code review | | verify | Quick verification for Ralph V-* stories | | security | Security-focused review | | accessibility | A11y audit for UI changes | | secrets | Scan for hardcoded secrets/keys | | pr-ready | Pre-PR comprehensive check | | red-team | Adversarial security/reliability review profile | | blue-team | Architecture/quality/maintainability review profile |
| Flag | Best For | Token Usage |
|------|----------|-------------|
| --agent | Codex/AI agents running a local gate | Medium |
| --plain | Humans reading in terminal | High |
| --prompt-only | AI agents (Ralph, Claude) | Low |
| (default) | Interactive TUI | N/A |
For V-* verification stories, CodeRabbit runs FIRST as a fast pre-check:
cr review --prompt-only --type committed - Quick scanThis reduces Claude API costs and catches obvious issues fast.
Technical evaluation, not emotional performance. Verify before implementing. Push back when a suggestion is wrong for this codebase.
Response pattern:
Forbidden responses:
Instead, state the technical requirement, ask a clarifying question, or fix it silently.
| Comment type | Action | |---|---| | Real bug / Security | Fix immediately | | Important improvement | Fix before proceeding | | Style preference | Fix if genuinely better; skip if bikeshed | | Over-engineering | Skip with reasoning | | False positive | Skip with reasoning |
Implementation order for multi-item feedback:
Max 3 review-fix rounds for nitpicks. Push back when the suggestion breaks existing behavior, lacks context, violates YAGNI, conflicts with user-stated architecture, or is technically incorrect. If you were wrong, say "Checked X and you're correct. Fixing." Then fix it.
For high-stakes changes, pair CodeRabbit with the evaluator agent (claude --agent evaluator) for deeper qualitative scoring:
The evaluator is deliberately adversarial -- it compensates for LLM optimism bias in code review. See ~/Gits/orchestrator/standards/evaluator-grading.md for the full grading rubric.
When to add the evaluator gate:
Optional .coderabbit.yaml in repo root for custom rules:
reviews:
language: en
path_filters:
- "!**/*.test.ts"
- "!**/node_modules/**"
curl -fsSL https://cli.coderabbit.ai/install.sh | shcr auth logintools
The human-eval UX contract for Phoenix views: turn-by-turn scrollable replay (not a scorecard), hide-but-copyable IDs, collapsed thinking, identity chips, tool filters, tiny frozen starter datasets, mark-wrong-in-thread, mobile-first. Use when: building or reviewing ANY Phoenix/eval view, annotation UI, session replay, or human-grading surface. Triggers: phoenix view, eval UI, annotation view, session replay, human eval UX, grading interface. NOT for: Phoenix data pipelines/ingest (capture scripts have their own specs).
tools
macOS systems specialist — AppKit NSPanel architecture, launchd services, socket activation, MCP bridge resilience, syspolicyd, and high-frequency SwiftUI dashboards. Use when building menu-bar apps, LaunchAgents, debugging syspolicyd/Gatekeeper/TCC, resilient UDS/MCP bridges, or SwiftUI dashboards at 10Hz+.
development
Bulk LLM-judging protocol for fleet-dispatched verdict runs (KG cluster, eval harness). Use when: dispatching or running judge workers (J1/J2/RT), planning bulk-apply from verdict JSONL, or triaging evidence_degraded outputs. Triggers: judge fleet, bulk judge, R3 verdicts, kg-judge, RT gate, evidence_degraded. NOT for: single-item code review, Phoenix view UX (use phoenix-human-view), or non-judge eval pipelines.
development
Quiet-down protocol for sprint close: when the fleet wraps, delete ALL polling crons and monitors, send ONE final dashboard + ONE message, then go SILENT. Use when: fleet wraps, all workers done, overnight queue exhausted, sprint close, Etan asleep/away with nothing approved left. Triggers: fleet wrap, wrap the fleet, stand down, going quiet, sprint close. NOT for: mid-sprint monitoring (keep your loops), spawning a successor (use /session-handoff first).