skills/dbx-design-judgment/SKILL.md
Use when the user asks to judge, audit, critique, review, shape, redesign, or produce a design plan for a UI, flow, PRD, screenshot, prototype, live product, component, design system, or code-backed interface. Grounds design judgment in target user, task path, information architecture, visual hierarchy, interaction states, visual language, design system consistency, accessibility, responsive behavior, and evidence. May read PRDs, code, screenshots, and live products. May inspect read-only surfaces. Must not edit files, write implementation code, apply patches, install packages, or commit changes.
npx skillsauth add dbvc/skills dbx-design-judgmentInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Judge design as an evidence-bounded control loop, not as generic taste advice and not as frontend implementation.
Core job:
design artifact + context/evidence
-> design frame
-> evidence-bounded design judgment or design brief
-> prioritized design decisions
-> implementation handoff spec without editing code
Universal kernel:
Good design helps a defined user complete a meaningful task with clear cognition, controllable interaction, coherent visual language, and acceptable risk.
The design surface may be beautiful, plain, dense, expressive, or familiar. The judgment is not "does the agent like it". The judgment is "does this design serve this user, task, context, product promise, and visual system".
Default output language follows the user's language.
This skill is design-only.
Allowed:
Forbidden:
If the user asks for implementation, produce a design handoff and route implementation to the appropriate coding or frontend skill.
Use this skill for:
Do not use this skill for:
dbx-product-judgment.Do not produce a confident design verdict until these gates are handled.
If gates 1 to 4 are missing and cannot be inferred from visible evidence, ask up to five focused questions and stop. If enough evidence exists for a bounded partial judgment, proceed and mark unknowns clearly.
Choose the smallest mode that answers the user's actual request.
| Mode | Use when | Behavior |
| --- | --- | --- |
| quick_design_read | Small UI question, single screenshot, narrow visual/interaction complaint | Give a compact diagnosis, top risks, and smallest useful design direction. |
| standard_design_audit | Existing screen, flow, prototype, URL, or app surface | Build a design frame, inspect evidence, judge relevant lenses, prioritize fixes. |
| prd_to_design_brief | PRD/spec/feature idea needs design planning | Produce design brief, IA, flow, key states, visual direction, component needs, handoff. |
| screenshot_review | Screenshot/image is primary evidence | Judge visible hierarchy, spacing, typography, affordance, density, consistency, and likely missing states. |
| code_design_alignment | Code/repo is supplied and design quality is in scope | Read code as design evidence: tokens, component vocabulary, states, responsive patterns, drift. |
| design_system_review | Consistency/tokens/components/visual language are in scope | Audit typography, color roles, spacing, radius, elevation, components, states, and usage rules. |
Modes can be combined, but do not inflate a small review into a full audit unless the user asks or the risk requires it.
Before judging or shaping, form this internal frame. Print it when ambiguity matters or the task is formal.
design_judgment_frame:
mode: quick_design_read | standard_design_audit | prd_to_design_brief | screenshot_review | code_design_alignment | design_system_review
design_target: screen | flow | component | app_shell | dashboard | form | onboarding | settings | checkout | empty_state | error_state | design_system | prd | code_path | concept
artifact_types: []
target_users: []
user_jobs: []
usage_contexts: []
risk_level: low | medium | high | unknown
product_promise: ""
design_goal: ""
primary_task_path: []
information_priorities:
primary: []
secondary: []
tertiary: []
should_hide_or_remove: []
required_states:
default: unknown | reviewed | missing
loading: unknown | reviewed | missing
empty: unknown | reviewed | missing
error: unknown | reviewed | missing
success: unknown | reviewed | missing
disabled: unknown | reviewed | missing
permission: unknown | reviewed | missing
partial_success: unknown | reviewed | missing
visual_language:
register: product | brand | hybrid | unknown
tone: ""
density: low | medium | high | unknown
references: []
anti_references: []
design_system_sources: []
evidence_sources: []
tools_or_actions_allowed: []
out_of_scope: []
assumptions: []
blocking_unknowns: []
confidence: high | medium | low
The frame prevents judging "good design" without knowing "good for whom, in what context, toward what task, with what visual language".
Never invent target users, goals, metrics, brand constraints, design systems, analytics, user preferences, accessibility compliance, or implemented behavior.
Separate these internally and, when useful, visibly:
design_evidence_state:
observed_visual_facts: []
user_provided_claims: []
prd_or_doc_claims: []
code_observations: []
browser_observations: []
screenshots: []
references: []
assumptions: []
judgments: []
unknowns: []
not_verified: []
Rules:
Judge only dimensions relevant to the task. Every finding must connect evidence, user/task impact, fix direction, validation, and confidence.
A design can be visually plain and still design-correct. A design can be beautiful and still wrong if the task path, state model, or hierarchy is broken.
Classify the surface before judging visual direction.
product: app UI, admin, dashboard, settings, forms, developer tools, task surfaces. Design serves the task. Prefer clarity, consistency, standard affordance, state coverage, useful density, and low surprise.brand: landing pages, marketing, campaign, portfolio, public storytelling. Design participates in the product promise. Prefer distinctiveness, memory, voice, and intentional visual risk.hybrid: separate expressive areas from task areas. Brand expression must not damage task clarity.Do not judge product UI by marketing-page drama. Do not judge brand surfaces by admin-console restraint.
Use P-level design severity to align with DBX product judgment. Here P means design-impact severity, not implementation order. Fix order should consider severity, confidence, effort, and blast radius.
[P0 blocker]: The design likely blocks the core task or creates serious trust, safety, accessibility, comprehension, or irreversible-action risk.[P1 high]: A critical path, hierarchy, state, affordance, IA, or design system issue likely causes confusion, abandonment, wrong action, or high support cost.[P2 medium]: A bounded weakness increases friction, inconsistency, cognitive load, responsive risk, or implementation ambiguity.[P3 low]: Local polish, copy, spacing, typography, alignment, or consistency improvement that helps but does not block the task.Confidence must be evidence-bound:
high: Direct artifact, screenshot, browser observation, code, user data, or authoritative source supports the claim.medium: Evidence is plausible but partial, or inference bridges are needed.low: The issue is a hypothesis that needs user research, analytics, domain review, or deeper access.Report a finding only if it has:
Suppress:
Default standard shape:
## 核心判断
- 结论:设计成立 / 方向对但关键风险未解 / 局部可用但需要修正 / 不建议按当前设计实现 / 信息不足无法判断
- 置信度:high / medium / low
- 判断对象:...
- 最主要风险:...
## 范围和证据
- 已查看:...
- 未查看:...
- 使用的证据:...
- 假设:...
- 未验证:...
## 设计模型
- 目标用户:...
- 使用场景:...
- 核心任务:...
- 关键路径:...
- 信息优先级:...
- Register:product / brand / hybrid
- 视觉方向:...
## 主要发现
1. [P1 high] 标题
- Evidence: 具体 PRD/页面/截图/代码/来源/用户说法
- Impact: 影响谁,在什么场景下为什么会出问题
- Fix: 最小可行设计修正方向
- Validation: 如何验证修正有效
- Confidence: high / medium / low
## 优先级建议
- 先做:...
- 暂缓:...
- 不做:...
## 交付给实现者的设计规格
- IA:...
- Flow:...
- Components:...
- States:...
- Responsive:...
- Accessibility:...
- Copy:...
- Tokens / decisions:...
## 仍需确认的问题
- ...
If information is insufficient, do not use the same report shape with a fake verdict. Use:
## 暂不能下结论
当前缺少会改变判断的背景:...
## 我能基于现有信息判断的部分
- ...
## 需要补充的最少问题
1. ...
For quick tasks, keep the output compact. For formal audits, keep evidence and limitations explicit.
You may claim the design judgment or design brief is complete when:
You may not claim:
without matching evidence.
Read only when needed:
references/core-model.md: design philosophy, five-lens kernel, register rule, severity, and confidence.references/artifact-playbooks.md: PRD, screenshot, live product, code, design system, and reference handling.references/design-rubric.md: detailed questions for IA, hierarchy, interaction states, visual language, system consistency, accessibility, and responsive behavior.references/anti-patterns.md: common AI design failure modes and correction rules.references/output-contracts.md: compact, standard, PRD-to-brief, screenshot review, code-alignment, and insufficient-context report shapes.assets/design-audit-report-template.md: reusable Markdown report template.assets/design-brief-template.md: reusable design brief template.assets/screenshot-review-template.md: reusable screenshot review template.scripts/validate-design-report.py: optional local checker for saved Markdown reports. It checks structure and evidence markers, not design truth.development
Evidence-grounded technical implementation planning before code changes. Use when the user asks to plan a software feature, refactor, migration, bug-fix strategy, infrastructure change, validation strategy, rollout, or codebase modification before implementation. Produces a scoped plan contract with goal, non-goals, evidence boundary, source of truth, invariants, implementation slices, validation model, risks, and handoff. Do not use for direct implementation, concrete diff review, strict critique of an existing plan, product/design judgment, generic brainstorming, commit/PR writing, or the stateful dbx-software-plan-first phase chain unless explicitly requested.
development
Codex 专用的 subagent 上下文策略辅助技能。Use when 用户要求使用 Codex subagent delegation、Codex worker/explorer delegation、并行 review、独立 reviewer、隔离上下文、不要带上下文、fork_context=false,或把 review/dbx-linus-review/waza-check/plan-eng-review 等技能交给 Codex subagent 执行时。默认新开 Codex subagent 不继承父线程历史,只传最小任务摘要;只有用户明确要求继承父线程时才使用 fork_context=true。不要因为用户讨论业务代码里的 worker、browser worker、架构 explorer 概念而触发。
development
Use when the user asks to judge whether a product, feature, PRD, information architecture, content, interaction, UI flow, live product, implementation, roadmap, or competitor position is right, good, usable, valuable, coherent, worth building, or product-ready. Grounds judgment in target user, job/context, evidence, alternatives, critical path, trust, and implementation alignment. Ask blocking questions or collect evidence before judging when background is missing. Do not use for pure implementation, generic UI inspiration, ordinary code review without product judgment, generic market summaries, or unsupported guesses.
development
High-signal code change review workflow for PRs, patches, staged/index changes, working-tree diffs, commit ranges, and file-scoped review. Use when the user asks to review concrete code changes for functional/user-impact risks, data model correctness, state ownership, compatibility, or maintainability. Handles partial staged hunks and user-selected files/commits by selecting the exact review target before reading the diff. Do not use for implementation-only requests, generic code explanation, release publishing, issue triage, or plan-only Linus-style strict-pragmatic critique; for strict critique of a concrete diff, use this skill only to establish the target before dbx-linus-review.