skills/confidence/SKILL.md
Rate confidence that the current work fully solves the stated requirement. Supports plan validation, code review, and bug analysis modes. Use before committing to autonomous execution, after implementation, or during investigation. Triggers on confidence check, validate plan, rate confidence, or quality gate.
npx skillsauth add mthines/gw-tools confidenceInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Rate your confidence that the current work fully solves the stated requirement.
Check the arguments: $ARGUMENTS
| Argument | Default | Validates | When to use |
| -------------- | ------- | -------------------------------- | --------------------------------------------------- |
| plan | | Implementation plan completeness | After Phase 1 planning, before autonomous execution |
| code | yes | Code implementation correctness | After writing code, before PR |
| bug-analysis | | Root cause analysis accuracy | During investigation, before proposing fix |
If no argument is provided, default to code.
If arguments contain "fix" (e.g., code fix, plan fix), run in Fix Mode — after the review, automatically apply fixes for any concerns found.
plan mode| Dimension | Weight | What to evaluate | | ---------------- | ------ | ---------------------------------------------------------------------------------------------------------------- | | Completeness | 40% | Are ALL Phase 0 requirements captured? All sections populated? Could a new session execute from this plan alone? | | Feasibility | 30% | Is the technical approach sound? Are patterns consistent with the codebase? Are risks identified? | | No ambiguity | 30% | Are implementation steps specific enough to execute without interpretation? Are edge cases addressed? |
code mode| Dimension | Weight | What to evaluate | | ------------------ | ------ | ------------------------------------------------------------- | | Correctness | 40% | Does the logic actually address the problem as described? | | Completeness | 30% | Are all cases, edge cases, and requirements covered? | | No regressions | 30% | Could this break existing behavior or introduce side effects? |
bug-analysis mode| Dimension | Weight | What to evaluate | | ------------------------ | ------ | ---------------------------------------------------------------------------- | | Evidence strength | 40% | Is the analysis backed by concrete evidence (logs, traces, code paths)? | | Root cause certainty | 30% | Is this the root cause or just a symptom? How deep did the investigation go? | | Fix confidence | 30% | Will the proposed fix resolve the issue without introducing new problems? |
You MUST output in this exact format:
## Confidence: X%
| Dimension | Score | Notes |
|-----------|-------|-------|
| <dim 1> | X% | ... |
| <dim 2> | X% | ... |
| <dim 3> | X% | ... |
Calculate the overall score as the weighted average using the weights above.
Be honest and critical — do not inflate scores. A low score with clear reasoning is more valuable than a false 95%.
| Score | Action | | ------------- | ------------------------------------------------------------------------------------------- | | 90-100% | Proceed — work is ready | | 70-89% | List specific concerns and what would raise confidence. If in Fix Mode, apply fixes. | | Below 70% | Recommend concrete next steps to validate or fix. Do NOT proceed with autonomous execution. |
When used as a quality gate before autonomous execution:
If confidence is below 90%, do up to 2 iterations of additional research, analysis, and evidence collection to raise the score. After each iteration, re-run the confidence assessment. If still below 90% after 2 iterations, present findings to the user and ask whether to proceed or refine further.
Skip this section entirely if not in Fix Mode.
When running in Fix Mode (plan fix, code fix, bug-analysis fix), automatically address every concern that lowered your score:
Fix these without asking — they are low-risk and mechanical:
After applying each fix, briefly note what was changed (one line per fix).
For issues requiring more thought:
For each, output:
### [Issue title]
**Why:** [1-sentence explanation]
**Fix plan:**
1. [Step 1]
2. [Step 2]
**Files involved:** [list]
Then execute the plan.
After all fixes are applied:
tools
Use the `gw` CLI for ALL Git worktree work — creating, navigating, listing, removing, syncing, updating, checking out PRs, troubleshooting. Replaces raw `git worktree`, `cd ../wt`, `git checkout -b`, and manual file copies. Triggers on: "spin up a branch", "work on a feature", "check out PR", "switch branch without stashing", "create a worktree", "parallel branches", "clean up branches", "gw", "gw add", "gw checkout", "git worktree", or any branch workflow.
tools
Configure .gw/config.json for gw-tools repos — auto-copy files, hooks, cleanup thresholds, update strategy, and the config migration system. Use when: setting up gw for a new project, adding or changing a config field, adding a hook, configuring auto-copy patterns, asking what fields gw config supports, running gw init, adding a migration, bumping configVersion, keeping schema.json in sync, or troubleshooting missing env files in worktrees.
development
Autonomous feature development workflow using isolated worktrees. Use to autonomously implement features from task description through tested PR delivery. Handles worktree creation, implementation, testing, iteration, documentation, and PR creation. Triggers on autonomous feature development, end-to-end implementation, or "implement X autonomously."
development
Generate a walkthrough artifact (walkthrough.md) summarizing completed work for PR delivery. Gathers information from plan.md, git history, and test results to produce a comprehensive summary. Use at Phase 6 before creating the draft PR. Triggers on create walkthrough, generate walkthrough, write walkthrough artifact.