Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

mthines/confidence

Name: confidence
Author: mthines

skills/confidence/SKILL.md

npx skillsauth add mthines/gw-tools confidence

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Confidence Assessment

Rate your confidence that the current work fully solves the stated requirement.

Mode Detection

Check the arguments: $ARGUMENTS

| Argument | Default | Validates | When to use | | -------------- | ------- | -------------------------------- | --------------------------------------------------- | | plan | | Implementation plan completeness | After Phase 1 planning, before autonomous execution | | code | yes | Code implementation correctness | After writing code, before PR | | bug-analysis | | Root cause analysis accuracy | During investigation, before proposing fix |

If no argument is provided, default to code.

If arguments contain "fix" (e.g., code fix, plan fix), run in Fix Mode — after the review, automatically apply fixes for any concerns found.

Assessment Dimensions

For `plan` mode

| Dimension | Weight | What to evaluate | | ---------------- | ------ | ---------------------------------------------------------------------------------------------------------------- | | Completeness | 40% | Are ALL Phase 0 requirements captured? All sections populated? Could a new session execute from this plan alone? | | Feasibility | 30% | Is the technical approach sound? Are patterns consistent with the codebase? Are risks identified? | | No ambiguity | 30% | Are implementation steps specific enough to execute without interpretation? Are edge cases addressed? |

For `code` mode

| Dimension | Weight | What to evaluate | | ------------------ | ------ | ------------------------------------------------------------- | | Correctness | 40% | Does the logic actually address the problem as described? | | Completeness | 30% | Are all cases, edge cases, and requirements covered? | | No regressions | 30% | Could this break existing behavior or introduce side effects? |

For `bug-analysis` mode

| Dimension | Weight | What to evaluate | | ------------------------ | ------ | ---------------------------------------------------------------------------- | | Evidence strength | 40% | Is the analysis backed by concrete evidence (logs, traces, code paths)? | | Root cause certainty | 30% | Is this the root cause or just a symptom? How deep did the investigation go? | | Fix confidence | 30% | Will the proposed fix resolve the issue without introducing new problems? |

Output Format

You MUST output in this exact format:

## Confidence: X%

| Dimension | Score | Notes |
|-----------|-------|-------|
| <dim 1>   | X%    | ...   |
| <dim 2>   | X%    | ...   |
| <dim 3>   | X%    | ...   |

Calculate the overall score as the weighted average using the weights above.

Be honest and critical — do not inflate scores. A low score with clear reasoning is more valuable than a false 95%.

Score Thresholds

| Score | Action | | ------------- | ------------------------------------------------------------------------------------------- | | 90-100% | Proceed — work is ready | | 70-89% | List specific concerns and what would raise confidence. If in Fix Mode, apply fixes. | | Below 70% | Recommend concrete next steps to validate or fix. Do NOT proceed with autonomous execution. |

Iteration Protocol (plan mode)

When used as a quality gate before autonomous execution:

If confidence is below 90%, do up to 2 iterations of additional research, analysis, and evidence collection to raise the score. After each iteration, re-run the confidence assessment. If still below 90% after 2 iterations, present findings to the user and ask whether to proceed or refine further.

Auto-Fix (Fix Mode Only)

Skip this section entirely if not in Fix Mode.

When running in Fix Mode (plan fix, code fix, bug-analysis fix), automatically address every concern that lowered your score:

Simple Fixes (apply immediately)

Fix these without asking — they are low-risk and mechanical:

Missing edge case handling with obvious implementation
Missing null/undefined checks
Off-by-one errors or incorrect boundary conditions
Typos in strings, comments, or variable names
Missing return types or type annotations where the type is clear
Small logic errors with an unambiguous correction
(plan mode) Missing sections, incomplete requirements, vague implementation steps

After applying each fix, briefly note what was changed (one line per fix).

Complex Fixes (plan, then apply)

For issues requiring more thought:

Missing test coverage for uncovered paths
Incomplete implementations (missing cases, unhandled states)
Architectural concerns or incorrect abstractions
(plan mode) Fundamental approach issues, missing technical design

For each, output:

### [Issue title]
**Why:** [1-sentence explanation]
**Fix plan:**
1. [Step 1]
2. [Step 2]
**Files involved:** [list]

Then execute the plan.

Post-Fix Re-Assessment

After all fixes are applied:

Re-run the confidence assessment with updated scores
List what was fixed and how each fix improved the score
If confidence is still below 90%, list remaining concerns that could not be auto-fixed

mthines/confidence

skills/confidence/SKILL.md

Rate confidence that the current work fully solves the stated requirement. Supports plan validation, code review, and bug analysis modes. Use before committing to autonomous execution, after implementation, or during investigation. Triggers on confidence check, validate plan, rate confidence, or quality gate.

6 stars

development

Updated Apr 19, 2026

$ install --global

skillsauth

npx skillsauth add mthines/gw-tools confidence

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 19, 2026, 4:17 AM8.2s1 file scanned

SKILL.md

name:: confidence
description:: >
license:: MIT
author:: mthines
version:: 1.0.0
workflow_type:: advisory

Confidence Assessment

Rate your confidence that the current work fully solves the stated requirement.

Mode Detection

Check the arguments: $ARGUMENTS

If no argument is provided, default to code.

If arguments contain "fix" (e.g., code fix, plan fix), run in Fix Mode — after the review, automatically apply fixes for any concerns found.

Assessment Dimensions

For `plan` mode

For `code` mode

For `bug-analysis` mode

Output Format

You MUST output in this exact format:

## Confidence: X%

| Dimension | Score | Notes |
|-----------|-------|-------|
| <dim 1>   | X%    | ...   |
| <dim 2>   | X%    | ...   |
| <dim 3>   | X%    | ...   |

Calculate the overall score as the weighted average using the weights above.

Be honest and critical — do not inflate scores. A low score with clear reasoning is more valuable than a false 95%.

Score Thresholds

Iteration Protocol (plan mode)

When used as a quality gate before autonomous execution:

If confidence is below 90%, do up to 2 iterations of additional research, analysis, and evidence collection to raise the score. After each iteration, re-run the confidence assessment. If still below 90% after 2 iterations, present findings to the user and ask whether to proceed or refine further.

Auto-Fix (Fix Mode Only)

Skip this section entirely if not in Fix Mode.

When running in Fix Mode (plan fix, code fix, bug-analysis fix), automatically address every concern that lowered your score:

Simple Fixes (apply immediately)

Fix these without asking — they are low-risk and mechanical:

Missing edge case handling with obvious implementation
Missing null/undefined checks
Off-by-one errors or incorrect boundary conditions
Typos in strings, comments, or variable names
Missing return types or type annotations where the type is clear
Small logic errors with an unambiguous correction
(plan mode) Missing sections, incomplete requirements, vague implementation steps

After applying each fix, briefly note what was changed (one line per fix).

Complex Fixes (plan, then apply)

For issues requiring more thought:

Missing test coverage for uncovered paths
Incomplete implementations (missing cases, unhandled states)
Architectural concerns or incorrect abstractions
(plan mode) Fundamental approach issues, missing technical design

For each, output:

### [Issue title]
**Why:** [1-sentence explanation]
**Fix plan:**
1. [Step 1]
2. [Step 2]
**Files involved:** [list]

Then execute the plan.

Post-Fix Re-Assessment

After all fixes are applied:

Re-run the confidence assessment with updated scores
List what was fixed and how each fix improved the score
If confidence is still below 90%, list remaining concerns that could not be auto-fixed

Related Skills

mthines/git-worktree-workflows

tools

VerifiedTrustedCommunity

Use the `gw` CLI for ALL Git worktree work — creating, navigating, listing, removing, syncing, updating, checking out PRs, troubleshooting. Replaces raw `git worktree`, `cd ../wt`, `git checkout -b`, and manual file copies. Triggers on: "spin up a branch", "work on a feature", "check out PR", "switch branch without stashing", "create a worktree", "parallel branches", "clean up branches", "gw", "gw add", "gw checkout", "git worktree", or any branch workflow.

8SKILL.mdUpdated Apr 19, 2026

mthines/git-worktree-workflows

mthines/gw-config-management

tools

VerifiedTrustedCommunity

Configure .gw/config.json for gw-tools repos — auto-copy files, hooks, cleanup thresholds, update strategy, and the config migration system. Use when: setting up gw for a new project, adding or changing a config field, adding a hook, configuring auto-copy patterns, asking what fields gw config supports, running gw init, adding a migration, bumping configVersion, keeping schema.json in sync, or troubleshooting missing env files in worktrees.

7SKILL.mdUpdated Apr 19, 2026

mthines/gw-config-management

mthines/autonomous-workflow

development

VerifiedTrustedCommunity

Autonomous feature development workflow using isolated worktrees. Use to autonomously implement features from task description through tested PR delivery. Handles worktree creation, implementation, testing, iteration, documentation, and PR creation. Triggers on autonomous feature development, end-to-end implementation, or "implement X autonomously."

7SKILL.mdUpdated Apr 19, 2026

mthines/autonomous-workflow

mthines/create-walkthrough

development

VerifiedTrustedCommunity

Generate a walkthrough artifact (walkthrough.md) summarizing completed work for PR delivery. Gathers information from plan.md, git history, and test results to produce a comprehensive summary. Use at Phase 6 before creating the draft PR. Triggers on create walkthrough, generate walkthrough, write walkthrough artifact.

6SKILL.mdUpdated Apr 19, 2026

mthines/create-walkthrough

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/mthines/gw-tools.git

# Copy into Claude Code skills folder (global)
cp -r gw-tools/skills/confidence ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

mthines/gw-tools

6 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

mthines/confidence

$ install --global

Security Scan Results

SKILL.md

Confidence Assessment

Mode Detection

Assessment Dimensions

For plan mode

For code mode

For bug-analysis mode

Output Format

Score Thresholds

Iteration Protocol (plan mode)

Auto-Fix (Fix Mode Only)

Simple Fixes (apply immediately)

Complex Fixes (plan, then apply)

Post-Fix Re-Assessment

Related Skills

mthines/git-worktree-workflows

mthines/gw-config-management

mthines/autonomous-workflow

mthines/create-walkthrough

mthines/confidence

$ install --global

Security Scan Results

SKILL.md

Confidence Assessment

Mode Detection

Assessment Dimensions

For plan mode

For code mode

For bug-analysis mode

Output Format

Score Thresholds

Iteration Protocol (plan mode)

Auto-Fix (Fix Mode Only)

Simple Fixes (apply immediately)

Complex Fixes (plan, then apply)

Post-Fix Re-Assessment

Related Skills

mthines/git-worktree-workflows

mthines/gw-config-management

mthines/autonomous-workflow

mthines/create-walkthrough

For `plan` mode

For `code` mode

For `bug-analysis` mode

For `plan` mode

For `code` mode

For `bug-analysis` mode