skills/dbx-product-judgment/SKILL.md
Use when the user asks to judge whether a product, feature, PRD, information architecture, content, interaction, UI flow, live product, implementation, roadmap, or competitor position is right, good, usable, valuable, coherent, worth building, or product-ready. Grounds judgment in target user, job/context, evidence, alternatives, critical path, trust, and implementation alignment. Ask blocking questions or collect evidence before judging when background is missing. Do not use for pure implementation, generic UI inspiration, ordinary code review without product judgment, generic market summaries, or unsupported guesses.
npx skillsauth add dbvc/skills dbx-product-judgmentInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Judge product correctness as an evidence-bounded control loop, not as a taste checklist.
Core job:
product artifact + context/evidence -> product contract -> evidence-bounded judgment -> prioritized fixes and validation plan
The universal kernel is stable across domains: a product is right when it helps a defined user in a real context achieve a valuable state change with acceptable cost, risk, trust, and implementation support. Domain facts, user groups, regulations, norms, competitors, and success metrics are not universal. Ground them in supplied artifacts, user answers, observed product behavior, code, PRD, or current external sources.
Default output language follows the user's language.
Use this skill for:
Do not use this skill for:
Do not produce a confident product verdict until these gates are handled.
If gates 1 to 4 are missing and cannot be inferred from visible evidence, ask up to five blocking questions and stop. If enough evidence exists for a bounded partial judgment, proceed and mark unknowns clearly.
Never invent target users, business goals, metrics, domain rules, competitor facts, conversion rates, technical architecture, or user behavior.
Separate these categories internally and, when useful, visibly:
product_evidence_state:
observed_facts: []
user_provided_claims: []
document_claims: []
code_observations: []
external_sources: []
assumptions: []
judgments: []
unknowns: []
not_verified: []
Rules:
Choose the smallest mode that can answer the user’s actual question.
| Mode | Use when | Behavior |
| --- | --- | --- |
| quick_product_read | Small product/feature question, screenshot, single flow, early idea | Give a compact verdict, top risks, and next validation action. |
| standard_product_audit | Default for product, feature, PRD, flow, or implementation judgment | Build the product contract, inspect evidence, judge core dimensions, prioritize fixes. |
| deep_product_audit | High-stakes, ambiguous, large product, major roadmap, B2B workflow, regulated or revenue-critical flow | Add competitor/domain research, implementation alignment, risk model, validation design, and uncertainty map. |
| prd_review | PRD, requirements, roadmap, user stories, specs | Judge problem framing, users, success criteria, scope, edge cases, risks, metrics, implementation handoff. |
| live_product_walkthrough | URL, prototype, app, web product, or interactive surface | Explore representative flows, take screenshots when visual/interaction evidence matters, avoid external writes without approval. |
| implementation_alignment | Code/repo/API/data model is provided and product correctness is in scope | Map product promise to entities, state, APIs, errors, performance, telemetry, and validation. |
| competitive_judgment | User asks to compare competitors or market alternatives | Research current alternatives, compare on user job and switching cost, avoid stale or unsupported claims. |
| validation_design | User asks whether to build, ship, iterate, or test | Convert judgment into falsifiable assumptions, metrics, experiments, and decision gates. |
Modes can be combined, but do not inflate the audit if the user asked for a small judgment.
Before judging, form this internal contract. Print it when ambiguity matters or the task is formal.
product_judgment_contract:
mode: quick_product_read | standard_product_audit | deep_product_audit | prd_review | live_product_walkthrough | implementation_alignment | competitive_judgment | validation_design
judgment_target: overall_product | feature | flow | ia | content | interaction | ui | implementation | roadmap | competitor_position | validation_plan
artifact_types: []
target_users: []
jobs_to_be_done: []
usage_contexts: []
product_promise: ""
desired_state_change: ""
success_criteria: []
evidence_sources: []
tools_or_actions_allowed: []
out_of_scope: []
assumptions: []
blocking_unknowns: []
confidence: high | medium | low
The contract is not decoration. It prevents judging “good” without knowing “good for whom, in what situation, toward what outcome”.
Use evidence in this order when available:
Artifact-specific rules:
Judge only dimensions relevant to the task. For each dimension, connect the judgment to user outcome, evidence, and trade-off.
A product can be visually simple but product-correct. A product can be beautiful and still wrong if the value path is broken.
Use product severity, not aesthetic intensity.
[P0 blocker]: The product likely fails the core job, creates serious user harm, violates trust/safety, or should not ship without redesign or expert review.[P1 high]: Important user flow, value proposition, data/state model, trust mechanism, or validation assumption is likely wrong or unprotected.[P2 medium]: Real but bounded weakness that increases friction, confusion, support cost, implementation risk, or learning uncertainty.[P3 low]: Local polish, clarity, hierarchy, copy, or simplification that helps but does not change product viability.Confidence must be evidence-bound:
high: Direct artifact, observation, code, user data, or authoritative source supports the claim.medium: Evidence is plausible but partial, or inference bridges are needed.low: The issue is a hypothesis that needs user research, analytics, domain review, or deeper product access.Report a finding only if it has:
Suppress:
Use this pass only when code, architecture, APIs, or technical implementation is in scope.
Build this internal model:
implementation_alignment_model:
product_promise: ""
changed_or_relevant_user_paths: []
core_entities: []
state_owners: []
lifecycles: []
public_contracts: []
persistence_or_schema: []
permissions_or_privacy: []
async_failure_surfaces: []
performance_surfaces: []
observability_and_metrics: []
validation_coverage: []
Judge whether the implementation makes the intended product behavior natural, reliable, observable, and evolvable. Prefer findings about wrong entity boundaries, duplicated sources of truth, hidden state, missing error states, broken contracts, privacy gaps, missing metrics, and fragile flows over style comments.
Default shape:
## 核心判断
- 结论:做对 / 方向对但未证明 / 局部成立但关键风险未解 / 不建议继续 / 信息不足无法判断
- 置信度:high / medium / low
- 判断对象:...
- 最主要风险:...
## 范围和证据
- 已查看:...
- 未查看:...
- 使用的证据:...
- 假设:...
- 未验证:...
## 产品模型
- 目标用户:...
- 核心场景:...
- 用户要完成的状态变化:...
- 当前关键路径:...
- 成功标准:...
## 主要发现
1. [P1 high] 标题
- Evidence: 具体 PRD/页面/截图/代码/来源/用户说法
- Impact: 影响谁,在什么场景下为什么会出问题
- Fix: 最小可行修正方向
- Validation: 如何验证修正有效
- Confidence: high / medium / low
## 优先级建议
- 先做:...
- 暂缓:...
- 不做:...
## 验证计划
- 定性验证:...
- 定量验证:...
- 技术验证:...
- Guardrail:...
## 仍需确认的问题
- ...
If information is insufficient, do not use the same report shape with a fake verdict. Use:
## 暂不能下结论
当前缺少会改变判断的背景:...
## 我能基于现有信息判断的部分
...
## 需要补充的最少问题
1. ...
For quick tasks, keep the output compact. For formal audits, keep evidence and limitations explicit.
You may claim the product judgment is complete when:
You may not claim:
Read only when needed:
references/evidence-policy.md: no-guess rules, question gates, live-product/tool safety, source hierarchy.references/product-judgment-kernel.md: detailed universal rubric, severity, confidence, and anti-patterns.references/artifact-playbooks.md: PRD, live URL, screenshot, code, competitor, analytics, and user-feedback playbooks.references/output-contracts.md: compact, standard, deep, PRD, live walkthrough, implementation alignment, and insufficient-context report shapes.assets/product-audit-report-template.md: reusable Markdown report template.assets/product-context-question-bank.md: optional question bank when context is missing.scripts/validate-product-report.py: optional local checker for saved Markdown reports. It checks structure and evidence markers, not product truth.development
Evidence-grounded technical implementation planning before code changes. Use when the user asks to plan a software feature, refactor, migration, bug-fix strategy, infrastructure change, validation strategy, rollout, or codebase modification before implementation. Produces a scoped plan contract with goal, non-goals, evidence boundary, source of truth, invariants, implementation slices, validation model, risks, and handoff. Do not use for direct implementation, concrete diff review, strict critique of an existing plan, product/design judgment, generic brainstorming, commit/PR writing, or the stateful dbx-software-plan-first phase chain unless explicitly requested.
development
Codex 专用的 subagent 上下文策略辅助技能。Use when 用户要求使用 Codex subagent delegation、Codex worker/explorer delegation、并行 review、独立 reviewer、隔离上下文、不要带上下文、fork_context=false,或把 review/dbx-linus-review/waza-check/plan-eng-review 等技能交给 Codex subagent 执行时。默认新开 Codex subagent 不继承父线程历史,只传最小任务摘要;只有用户明确要求继承父线程时才使用 fork_context=true。不要因为用户讨论业务代码里的 worker、browser worker、架构 explorer 概念而触发。
development
High-signal code change review workflow for PRs, patches, staged/index changes, working-tree diffs, commit ranges, and file-scoped review. Use when the user asks to review concrete code changes for functional/user-impact risks, data model correctness, state ownership, compatibility, or maintainability. Handles partial staged hunks and user-selected files/commits by selecting the exact review target before reading the diff. Do not use for implementation-only requests, generic code explanation, release publishing, issue triage, or plan-only Linus-style strict-pragmatic critique; for strict critique of a concrete diff, use this skill only to establish the target before dbx-linus-review.
development
Use when the user asks to judge, audit, critique, review, shape, redesign, or produce a design plan for a UI, flow, PRD, screenshot, prototype, live product, component, design system, or code-backed interface. Grounds design judgment in target user, task path, information architecture, visual hierarchy, interaction states, visual language, design system consistency, accessibility, responsive behavior, and evidence. May read PRDs, code, screenshots, and live products. May inspect read-only surfaces. Must not edit files, write implementation code, apply patches, install packages, or commit changes.