.agent/skills/speckit-brutalreview/SKILL.md
Brutally honest end-to-end app review — code, UX, security, and product strategy with comprehensive analysis and actionable recommendations.
npx skillsauth add a2mus/smart-da3m speckit-brutalreviewInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
$ARGUMENTS
You MUST consider the user input before proceeding (if not empty).
Conduct a comprehensive, brutally honest review of the application covering code quality, UX/UI, security vulnerabilities, performance bottlenecks, and product strategy. Generate a detailed markdown report with specific findings, file/line references, and actionable improvement proposals.
Save the final review report to: .specify/reviews/brutal-review-[YYYY-MM-DD].md
Create screenshots directory: .specify/reviews/screenshots/
Screenshot naming convention: [YYYY-MM-DD]-[NNN]-[category]-[brief-description].png
Example: 2025-03-25-001-security-missing-csp.png
Categories: ux, security, performance, architecture, accessibility, mobile
Run .specify/scripts/powershell/check-prerequisites.ps1 -Json -CheckCode -DetectProjectType from repo root to detect project type and code statistics.
Parse JSON output for:
PROJECT_TYPE: web-app | cli-tool | library | mobile-app | unknownHAS_CODE: boolean indicating if codebase has substantial filesFILE_STATS: counts by file type/extensionTECH_STACK: detected frameworks/librariesENTRY_POINTS: main application entry filesIf HAS_CODE is false: Abort with error: "No substantial code found to review."
Focus Areas: If user provided --focus=[area], prioritize that section but still cover all phases.
Supported focus areas: security, ux, performance, architecture
Detect Project Type and Adjust Approach:
Web Applications (package.json + web assets):
npm run dev or npm start or python -m http.serverCLI Tools (pyproject.toml scripts or binary entry):
<tool> --help or <tool> -hLibraries (setup.py, Cargo.toml, package.json with "lib"):
Mobile Apps (React Native, Flutter, etc.):
android/, ios/, pubspec.yaml)Live Testing (where applicable):
First Impressions (< 5 seconds):
Core Flow Testing:
Responsive & Cross-Browser (Web):
Performance Feel:
Accessibility Quick-Check:
Read the entire codebase systematically. Evaluate against these dimensions:
Cite specific file paths and line numbers for each finding.
# TODO, # FIXME, # HACK commentsStep back from code and evaluate holistically:
Based on findings, propose 5–7 high-impact improvements, ordered by effort-to-impact ratio.
For each:
Create comprehensive markdown report at .specify/reviews/brutal-review-[YYYY-MM-DD].md
# 🔥 Brutal Review — [App/Project Name]
> Reviewed: [Date] | Reviewer: Senior Staff Engineer + Product Manager
> Project Type: [Detected Type] | Focus: [User Focus or "Complete"]
## Executive Summary
[3–5 sentence overall verdict. What's the single biggest thing holding this app back?]
## 🚨 Critical Flaws (Ship-Blockers)
[Issues that would embarrass you in production or pose security/data-loss risk]
- **[SEVERITY]** [Description] — `file:line` — [Screenshot: ./screenshots/...]
## 🎨 UX/UI Roast
[Visual, interaction, and experience problems — with embedded screenshots]
## 🏗️ Architecture & Code Smells
[Structural issues, tech debt, and maintainability concerns]
- **[SMELL]** [Description] — `file:line`
## 🔒 Security Concerns
[Vulnerabilities, missing protections, risky patterns]
- **[CRITICAL/HIGH/MEDIUM]** [Vulnerability] — `file:line` — [Mitigation suggestion]
## ⚡ Performance Issues
[Bottlenecks, waste, and optimization opportunities]
## 🧪 Testing & Quality Gaps
[Missing tests, weak CI, untested critical paths]
## 💡 Feature & Improvement Proposals
[Ranked table of proposed improvements]
| # | What | Why | How | Impact |
|---|------|-----|-----|--------|
| 1 | ... | ... | ... | 🔴 Critical |
## Scorecard
| Dimension | Grade | Notes |
|--------------------|-------|-------|
| Architecture | ?/10 | |
| Code Quality | ?/10 | |
| Security | ?/10 | |
| Performance | ?/10 | |
| UX/UI | ?/10 | |
| Test Coverage | ?/10 | |
| Product Readiness | ?/10 | |
| **Overall** | ?/10 | |
## Appendix: Screenshot Index
[List all screenshots with descriptions and paths]
## Next Actions
- [ ] Address critical security flaws before any production deployment
- [ ] Run `/speckit.implement` to fix ship-blocking issues
- [ ] Create checklist for improvement proposals
- [ ] Schedule follow-up review in 2 weeks
For each screenshot taken during review:
.specify/reviews/screenshots/[YYYY-MM-DD]-[NNN]-[category]-[description].pngScreenshot Guidelines:
Use these heuristics to auto-detect project type:
Web Application:
package.json exists with dependencies or devDependenciessrc/, public/, static/, or dist/react, vue, svelte, next, nuxt in dependenciesCLI Tool:
pyproject.toml with [project.scripts] sectionsetup.py or setup.cfg with console_scriptsbin/ or .specify/scripts/ directory with executable files#!/usr/bin/env) in main filesLibrary:
setup.py with packages but no scripts or minimal scriptsCargo.toml with [lib] sectionpackage.json with "main" or "exports" pointing to library codeMobile App:
pubspec.yaml (Flutter)android/ and ios/ directories (React Native/Flutter) capacitor.config.json or ionic.config.json (Ionic)Backend/API:
requirements.txt with Flask, Django, FastAPICargo.toml with web framework dependenciesDockerfile with web server configuration$ARGUMENTS
testing
Perform cross-artifact consistency analysis across spec.md, plan.md, and tasks.md. Use after task generation to identify gaps, duplications, and inconsistencies before implementation.
development
Run comprehensive verification on current codebase state.
testing
Intelligently sync your fork with the upstream spec-kit repository. Reads both versions of every changed file, compares quality, and produces the richest possible result by blending the best of each.
development
Impeccable UI design workflow — create distinctive, production-grade interfaces or enhance existing ones. Integrates design context gathering, anti-pattern detection, heuristic scoring, and systematic polish. Works after speckit-brainstorm (new design) or on existing UI code (enhancement mode).