blueprint-plugin/skills/confidence-scoring/SKILL.md
Assess quality of PRPs and work-orders using systematic confidence scoring. Use when evaluating readiness for execution or subagent delegation.
npx skillsauth add laurigates/claude-plugins confidence-scoringInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill provides systematic evaluation of PRPs (Product Requirement Prompts) and work-orders to determine their readiness for execution or delegation.
| Use this skill when... | Use blueprint-prp-create instead when... |
|---|---|
| Scoring a draft PRP/work-order before execution or delegation | Authoring the PRP itself from research and context |
| Deciding whether a work-order is ready for a subagent or needs refinement | Producing the work-order content (use blueprint-work-order) |
| Reviewing existing PRPs/work-orders for context completeness and validation gates | Generating the PRD that the PRP derives from (use blueprint-derive-plans) |
Evaluates whether all necessary context is explicitly provided.
| Score | Criteria | |-------|----------| | 10 | All file paths explicit with line numbers, all code snippets included, library versions specified, integration points documented | | 8-9 | Most context provided, minor gaps that can be inferred from codebase | | 6-7 | Key context present but some discovery required | | 4-5 | Significant context missing, will need exploration | | 1-3 | Minimal context, extensive discovery needed |
Checklist:
src/auth.py:45-60)Evaluates how clear the implementation approach is.
| Score | Criteria | |-------|----------| | 10 | Pseudocode covers all cases, step-by-step clear, edge cases addressed | | 8-9 | Main path clear, most edge cases covered | | 6-7 | Implementation approach clear, some details need discovery | | 4-5 | High-level only, significant ambiguity | | 1-3 | Vague requirements, unclear approach |
Checklist:
Evaluates whether known pitfalls are documented with mitigations.
| Score | Criteria | |-------|----------| | 10 | All known pitfalls documented, each has mitigation, library-specific issues covered | | 8-9 | Major gotchas covered, mitigations clear | | 6-7 | Some gotchas documented, may discover more | | 4-5 | Few gotchas mentioned, incomplete coverage | | 1-3 | No gotchas documented |
Checklist:
Evaluates whether executable validation commands are provided.
| Score | Criteria | |-------|----------| | 10 | All quality gates have executable commands, expected outcomes specified | | 8-9 | Main validation commands present, most outcomes specified | | 6-7 | Some validation commands, gaps in coverage | | 4-5 | Minimal validation commands | | 1-3 | No executable validation |
Checklist:
Evaluates whether test cases are specified.
| Score | Criteria | |-------|----------| | 10 | All test cases specified with assertions, edge cases covered | | 8-9 | Main test cases specified, most assertions included | | 6-7 | Key test cases present, some gaps | | 4-5 | Few test cases, minimal detail | | 1-3 | No test cases specified |
Checklist:
Overall = (Context + Implementation + Gotchas + Validation) / 4
Overall = (Context + Gotchas + TestCoverage + Validation) / 4
| Score | Readiness | Recommendation | |-------|-----------|----------------| | 9-10 | Excellent | Ready for autonomous subagent execution | | 7-8 | Good | Ready for execution with some discovery | | 5-6 | Fair | Needs refinement before execution | | 3-4 | Poor | Significant gaps, recommend research phase | | 1-2 | Inadequate | Restart with proper research |
## Confidence Score: X.X/10
| Dimension | Score | Notes |
|-----------|-------|-------|
| Context Completeness | X/10 | [specific observation] |
| Implementation Clarity | X/10 | [specific observation] |
| Gotchas Documented | X/10 | [specific observation] |
| Validation Coverage | X/10 | [specific observation] |
| **Overall** | **X.X/10** | |
**Assessment:** Ready for execution
**Strengths:**
- [Key strength 1]
- [Key strength 2]
**Recommendations (optional):**
- [Minor improvement 1]
## Confidence Score: X.X/10
| Dimension | Score | Notes |
|-----------|-------|-------|
| Context Completeness | X/10 | [specific gap] |
| Implementation Clarity | X/10 | [specific gap] |
| Gotchas Documented | X/10 | [specific gap] |
| Validation Coverage | X/10 | [specific gap] |
| **Overall** | **X.X/10** | |
**Assessment:** Needs refinement before execution
**Gaps to Address:**
- [ ] [Gap 1 with suggested action]
- [ ] [Gap 2 with suggested action]
- [ ] [Gap 3 with suggested action]
**Next Steps:**
1. [Specific research action]
2. [Specific documentation action]
3. [Specific validation action]
## Confidence Score: 8.5/10
| Dimension | Score | Notes |
|-----------|-------|-------|
| Context Completeness | 9/10 | All files explicit, code snippets with line refs |
| Implementation Clarity | 8/10 | Pseudocode covers main path, one edge case unclear |
| Gotchas Documented | 8/10 | Redis connection pool, JWT format issues covered |
| Validation Coverage | 9/10 | All gates have commands, outcomes specified |
| **Overall** | **8.5/10** | |
**Assessment:** Ready for execution
**Strengths:**
- Comprehensive codebase intelligence with actual code snippets
- Validation gates are copy-pasteable
- Known library gotchas well-documented
**Recommendations:**
- Consider documenting concurrent token refresh edge case
## Confidence Score: 5.0/10
| Dimension | Score | Notes |
|-----------|-------|-------|
| Context Completeness | 4/10 | File paths vague ("somewhere in auth/") |
| Implementation Clarity | 6/10 | High-level approach clear, no pseudocode |
| Gotchas Documented | 3/10 | No library-specific gotchas |
| Validation Coverage | 7/10 | Test command present, missing lint/type check |
| **Overall** | **5.0/10** | |
**Assessment:** Needs refinement before execution
**Gaps to Address:**
- [ ] Add explicit file paths (use `grep` to find them)
- [ ] Add pseudocode for token generation logic
- [ ] Research jsonwebtoken gotchas (check GitHub issues)
- [ ] Add linting and type checking commands
**Next Steps:**
1. Run `/prp:curate-docs jsonwebtoken` to create ai_docs entry
2. Use Explore agent to find exact file locations
3. Add validation gate commands from project's package.json
This skill is automatically applied when:
/prp:create generates a new PRP/blueprint:work-order generates a work-orderThe confidence score determines:
grep to find exact file locationstools
Scaffold a new ComfyUI custom-node repo (pyproject, CI, release-please, vitest+pytest, JS extension skeleton) in the picker/gesture vein. Use when bootstrapping or init-ing a comfyui node pack.
tools
Orchestrate a ComfyUI node pack from idea to registry: scaffold, create + seed the repo, open the gitops adoption PR. Use when releasing or spinning up a new comfyui node pack.
testing
macOS EndpointSecurity/EDR high CPU & battery drain. Use when Kandji ESF / XProtect pegs a core; trace the exec storm via powermetrics + eslogger.
development
odiff pixel-by-pixel image diffing. Use when comparing screenshots, detecting visual regressions, diffing before/after PNGs, asserting golden images.