skills/critique/SKILL.md
Evaluate an interface from a UX perspective, assessing hierarchy, information architecture, emotional resonance, cognitive load, and overall quality with quantitative scoring and actionable feedback. Use when the user wants an overall design or UX review—not when the main need is measurable accessibility/performance diagnosis, or final micro-detail polish.
npx skillsauth add aladicf/better-web-ui critiqueInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Users start this workflow with /critique. Once this skill is active, load $frontend-design — it contains design principles, anti-patterns, and the Context Gathering Protocol. Follow that protocol before proceeding — if no design context exists yet, you MUST load $setup first. Additionally gather: what the interface is trying to accomplish.
Conduct a holistic design critique, evaluating whether the interface actually works — not just technically, but as a designed experience. Think like a design director giving feedback.
Consult the hierarchy checklist for grayscale tests, action prioritization, and label/value treatment. Consult the text hierarchy and readability for line length, line-height, baseline, alignment, and title restraint. Consult the cognitive load for working-memory limits and the 8-item checklist. Consult the interaction design when evaluating familiar patterns, target sizing, focus treatment, and overlay behavior. Consult the search and findability when the interface depends on site search, command palettes, autosuggest, result relevance, or no-results recovery. Consult the legacy modernization when the critique involves legacy systems, old/new seams, migration candidates, or high-risk operational workflows. Consult the ai slop detection for the consolidated anti-pattern list. Consult the action hierarchy when evaluating primary/secondary/tertiary actions. Consult the semantic color when color is carrying meaning. Consult the surface separation when judging borders, card usage, overlap, or background-shift decisions. Consult the image treatment when screenshots, icons, and media affect readability or polish.
Treat the shared frontend-design references as canonical for hierarchy, readability, and cognitive-load doctrine, then keep this skill focused on evaluation, scoring, and prioritization.
When empty states are relevant, evaluate the zero-data surface itself through empty-state thinking, and evaluate broader activation, aha moments, and first-run education through onboard thinking.
Evaluate the interface across these dimensions:
This is the most important check. Does this look like every other AI-generated interface from 2024-2025?
Review the design against ALL the DON'T guidelines in the frontend-design skill and the ai slop detection reference — they are the fingerprints of AI-generated work. Check for the AI color palette, gradient text, dark mode with glowing accents, glassmorphism, hero metric layouts, identical card grids, generic fonts, and all other tells.
The test: If you showed this to someone and said "AI made this," would they believe you immediately? If yes, that's the problem.
Consult cognitive-load for the working memory rule and 8-item checklist
Structure your feedback as a design director would:
Consult heuristics-scoring
Score each of Nielsen's 10 heuristics 0–4. Present as a table:
| # | Heuristic | Score | Key Issue | |---|-----------|-------|-----------| | 1 | Visibility of System Status | ? | [specific finding or "—" if solid] | | 2 | Match System / Real World | ? | | | 3 | User Control and Freedom | ? | | | 4 | Consistency and Standards | ? | | | 5 | Error Prevention | ? | | | 6 | Recognition Rather Than Recall | ? | | | 7 | Flexibility and Efficiency | ? | | | 8 | Aesthetic and Minimalist Design | ? | | | 9 | Error Recovery | ? | | | 10 | Help and Documentation | ? | | | Total | | ??/40 | [Rating band] |
Be honest with scores. A 4 means genuinely excellent. Most real interfaces score 20–32.
Start here. Pass/fail: Does this look AI-generated? List specific tells from the skill's Anti-Patterns section. Be brutally honest.
Include a one-line hierarchy verdict immediately after the pass/fail statement.
A brief gut reaction — what works, what doesn't, and the single biggest opportunity.
Highlight 2–3 things done well. Be specific about why they work.
The 3–5 most impactful design problems, ordered by importance.
For each issue, tag with P0–P3 severity (consult heuristics-scoring for severity definitions):
Favor issues related to weak hierarchy, arbitrary systems, unclear action priority, and noisy decoration over surface-level nitpicks.
When relevant, explicitly call out: too many borders, ambiguous grouping, too many font sizes, too many unsystematic shades, loud section titles, flattened action hierarchy, line-length/line-height mismatches, label:value anti-patterns, unnecessary colored-link emphasis, non-right-aligned number columns, weak surface-separation strategy, scaled-down screenshot legibility failures, scaled-up icon chunkiness, and overlap clashes where layers are not cleanly separated.
Consult personas
Auto-select 2–3 personas most relevant to this interface type (use the selection table in the reference). If AGENTS.md contains a ## Design Context section from setup, also generate 1–2 project-specific personas from the audience/brand info.
For each selected persona, walk through the primary user action and list specific red flags found:
Alex (Power User): No keyboard shortcuts detected. Form requires 8 clicks for primary action. Forced modal onboarding. ⚠️ High abandonment risk.
Jordan (First-Timer): Icon-only nav in sidebar. Technical jargon in error messages ("404 Not Found"). No visible help. ⚠️ Will abandon at step 2.
Be specific — name the exact elements and interactions that fail each persona. Don't write generic persona descriptions; write what broke for them.
Quick notes on smaller issues worth addressing.
Remember:
After presenting findings, use targeted questions based on what was actually found. ask the user directly to clarify what you cannot infer. These answers will shape the action plan.
Ask questions along these lines (adapt to the specific findings — do NOT ask generic questions):
Priority direction: Based on the issues found, ask which category matters most to the user right now. For example: "I found problems with visual hierarchy, color usage, and information overload. Which area should we tackle first?" Offer the top 2–3 issue categories as options.
Design intent: If the critique found a tonal mismatch, ask whether it was intentional. For example: "The interface feels clinical and corporate. Is that the intended tone, or should it feel warmer, bolder, or more playful?" Offer 2–3 tonal directions as options based on what would fix the issues found.
Scope: Ask how much the user wants to take on. For example: "I found N issues. Want to address everything, or focus on the top 3?" Offer scope options like "Top 3 only", "All issues", "Critical issues only".
Constraints (optional — only ask if relevant): If the findings touch many areas, ask if anything is off-limits. For example: "Should any sections stay as-is?" This prevents the plan from touching things the user considers done.
Rules for questions:
After receiving the user's answers, present a prioritized action summary reflecting the user's priorities and scope from Phase 3.
List recommended commands in priority order, based on the user's answers:
/command-name — Brief description of what to fix (specific context from critique findings)/command-name — Brief description (specific context)
...Rules for recommendations:
/polish as the final step if any fixes were recommendedAfter presenting the summary, tell the user:
You can ask me to run these one at a time, all at once, or in any order you prefer.
Re-run
/critiqueafter fixes to see your score improve.
development
Build or improve a UI testing strategy covering visual regression, interaction testing, and accessibility assertions. Use when the user asks to add tests, set up testing, fix flaky tests, improve test coverage, validate UI behavior, catch visual bugs, or establish confidence in shipping frontend changes.
development
Design security-conscious interfaces that protect users without frustrating them. Use when the user asks about MFA, password UX, breach notifications, trust indicators, secure forms, account recovery, or making security feel safe rather than scary.
development
Design or improve search experiences, result presentation, and filtering interfaces. Use when the user asks to add search, redesign search results, improve findability, build autocomplete, add filters, or fix zero-results dead ends.
development
Plan, implement, or improve an internationalization and localization strategy for UI content, formatting, and regional adaptation. Use when the user asks to add i18n, localize, translate, support multiple languages, handle regional formats, manage locale switching, or build a multilingual product.