Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

acaprino/parallel-debugging

Name: parallel-debugging
Author: acaprino

plugins/agent-teams/skills/parallel-debugging/SKILL.md

npx skillsauth add acaprino/anvil-toolset parallel-debugging

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Parallel Debugging

Framework for debugging complex issues using the Analysis of Competing Hypotheses (ACH) methodology with parallel agent investigation.

When to Use This Skill

Bug has multiple plausible root causes
Initial debugging attempts haven't identified the issue
Issue spans multiple modules or components
Need systematic root cause analysis with evidence
Want to avoid confirmation bias in debugging

Hypothesis Generation Framework

Generate hypotheses across 6 failure mode categories:

1. Logic Error

Incorrect conditional logic (wrong operator, missing case)
Off-by-one errors in loops or array access
Missing edge case handling
Incorrect algorithm implementation

2. Data Issue

Invalid or unexpected input data
Type mismatch or coercion error
Null/undefined/None where value expected
Encoding or serialization problem
Data truncation or overflow

3. State Problem

Race condition between concurrent operations
Stale cache returning outdated data
Incorrect initialization or default values
Unintended mutation of shared state
State machine transition error

4. Integration Failure

API contract violation (request/response mismatch)
Version incompatibility between components
Configuration mismatch between environments
Missing or incorrect environment variables
Network timeout or connection failure

5. Resource Issue

Memory leak causing gradual degradation
Connection pool exhaustion
File descriptor or handle leak
Disk space or quota exceeded
CPU saturation from inefficient processing

6. Environment

Missing runtime dependency
Wrong library or framework version
Platform-specific behavior difference
Permission or access control issue
Timezone or locale-related behavior

Evidence Collection Standards

What Constitutes Evidence

| Evidence Type | Strength | Example | | ----------------- | -------- | --------------------------------------------------------------- | | Direct | Strong | Code at file.ts:42 shows if (x > 0) should be if (x >= 0) | | Correlational | Medium | Error rate increased after commit abc123 | | Testimonial | Weak | "It works on my machine" | | Absence | Variable | No null check found in the code path |

Citation Format

Always cite evidence with file:line references:

**Evidence**: The validation function at `src/validators/user.ts:87`
does not check for empty strings, only null/undefined. This allows
empty email addresses to pass validation.

Confidence Levels

| Level | Criteria | | ------------------- | ----------------------------------------------------------------------------------- | | High (>80%) | Multiple direct evidence pieces, clear causal chain, no contradicting evidence | | Medium (50-80%) | Some direct evidence, plausible causal chain, minor ambiguities | | Low (<50%) | Mostly correlational evidence, incomplete causal chain, some contradicting evidence |

Result Arbitration Protocol

After all investigators report:

Step 1: Categorize Results

Confirmed: High confidence, strong evidence, clear causal chain
Plausible: Medium confidence, some evidence, reasonable causal chain
Falsified: Evidence contradicts the hypothesis
Inconclusive: Insufficient evidence to confirm or falsify

Step 2: Compare Confirmed Hypotheses

If multiple hypotheses are confirmed, rank by:

Confidence level
Number of supporting evidence pieces
Strength of causal chain
Absence of contradicting evidence

Step 3: Determine Root Cause

If one hypothesis clearly dominates: declare as root cause
If multiple hypotheses are equally likely: may be compound issue (multiple contributing causes)
If no hypotheses confirmed: generate new hypotheses based on evidence gathered

Step 4: Validate Fix

Before declaring the bug fixed:

[ ] Fix addresses the identified root cause
[ ] Fix doesn't introduce new issues
[ ] Original reproduction case no longer fails
[ ] Related edge cases are covered
[ ] Relevant tests are added or updated

Runtime Evidence Pattern

When arbitration leaves hypotheses Plausible or Inconclusive and static evidence is exhausted, inject targeted runtime logs to disambiguate. The pattern is designed so cleanup is mechanical (grep + delete) and the logs never leak into production.

Log Convention

Every injected log MUST follow this format:

[DEBUG] [{file}:{line}] {short description} { {relevant vars} }

Concrete examples per language:

// JS/TS
console.log("[DEBUG] [auth.ts:42] before token verify", { tokenLen: token.length, hasUser: !!user })

# Python
print(f"[DEBUG] [auth.py:42] before token verify", {"token_len": len(token), "has_user": user is not None})
# or
logger.debug("[DEBUG] [auth.py:42] before token verify token_len=%d has_user=%s", len(token), user is not None)

// Rust
eprintln!("[DEBUG] [auth.rs:42] before token verify token_len={} has_user={}", token.len(), user.is_some());

// Go
fmt.Fprintf(os.Stderr, "[DEBUG] [auth.go:42] before token verify token_len=%d has_user=%v\n", len(token), user != nil)

The [DEBUG] prefix is non-negotiable. It is the cleanup anchor.

Strategic Placement

Pick 3-5 points, not more. Flooding logs makes signal harder to extract.

| Location | Captures | | ----------------- | ----------------------------------- | | Function entry | Confirms execution path, args | | Before async call | State just before the operation | | After async call | Result, error, timing | | Conditional | Which branch was taken | | Catch block | Error name, message, partial state |

What NOT to Log

Passwords, tokens, API keys (use length or present/absent markers instead)
PII (email, phone, names) unless masked
Full request/response bodies (use sizes and selected fields)
Inside hot loops without rate limiting

Cleanup Protocol

After the bug is verified fixed, remove every [DEBUG] log. The grep is the source of truth — if grep returns zero matches, cleanup is complete.

# JS/TS
grep -rn '\[DEBUG\]' . --include='*.ts' --include='*.tsx' --include='*.js' --include='*.jsx'

# Python
grep -rn '\[DEBUG\]' . --include='*.py'

# Rust
grep -rn '\[DEBUG\]' . --include='*.rs'

# Go
grep -rn '\[DEBUG\]' . --include='*.go'

Multi-line console statements that span more than the matched line must be removed in full. Verify the file still parses after deletion.

Iteration Cap

A debug session should perform at most 2 rounds of log injection. If after the second round the hypotheses are still Inconclusive, escalate to the user with a written summary of what has been ruled out and what additional context (MCP access, production logs, a minimal reproduction) would be needed.

Specialized Investigation Agents

When a hypothesis falls into one of the 6 failure mode categories above, prefer a specialized investigator over a generic team-debugger. The specialized agent loads the right knowledge base automatically and produces higher-precision findings.

| Failure mode category | Preferred specialized agent | Notes | |---|---|---| | Logic Error | senior-review:code-auditor | Failure-flow tracing + pattern consistency | | Data Issue | senior-review:code-auditor or senior-review:security-auditor | The latter when the data crosses a trust boundary | | State Problem (concurrency, cache, mutation) | senior-review:ui-race-auditor for UI; senior-review:distributed-flow-auditor for cross-service | | | Integration Failure | senior-review:distributed-flow-auditor | Both sides of the contract | | Resource Issue | senior-review:code-auditor + react-development:react-performance-optimizer (if frontend) | | | Environment | senior-review:chicken-egg-detector | Startup cycles, init order, config bootstrap |

team-debugger remains the fallback when no specialized agent matches the hypothesis cleanly, or when the investigation is too cross-cutting for a single specialist.

Sub-spawning caveat

When a team-debugger spawns a specialized sub-agent via the Agent tool (e.g. to deepen one of the 6 categories), that sub-agent itself cannot spawn further sub-agents (Claude Agent SDK restriction: one-level subagent nesting). If a hypothesis requires deeper delegation, the debugger reports this to the team lead rather than chaining indirectly; the lead has the team-level view to decide whether to re-spawn at the top level or escalate to the user.

Output Persistence

The team-debugger agent and the specialized investigators all accept a spawner-provided output file path. When the orchestrator spawns an investigator and wants the structured report on disk (the default in /team-review and similar pipelines), the spawn prompt must include Write your final report to <path>. Investigators write directly with the Write tool rather than returning the report only as message text.

Reference: docs/references/agent-teams-best-practices.md § Operational do's and don'ts.

acaprino/parallel-debugging

plugins/agent-teams/skills/parallel-debugging/SKILL.md

Debug complex issues using competing hypotheses with parallel investigation, evidence collection, and root cause arbitration. Use this skill when debugging bugs with multiple potential causes, performing root cause analysis, or organizing parallel investigation workflows.

2 stars

development

Updated May 20, 2026

$ install --global

skillsauth

npx skillsauth add acaprino/anvil-toolset parallel-debugging

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 20, 2026, 4:04 AM39.1s2 files scanned

SKILL.md

name:: parallel-debugging
description:: >
version:: 1.2.0

Parallel Debugging

Framework for debugging complex issues using the Analysis of Competing Hypotheses (ACH) methodology with parallel agent investigation.

When to Use This Skill

Bug has multiple plausible root causes
Initial debugging attempts haven't identified the issue
Issue spans multiple modules or components
Need systematic root cause analysis with evidence
Want to avoid confirmation bias in debugging

Hypothesis Generation Framework

Generate hypotheses across 6 failure mode categories:

1. Logic Error

Incorrect conditional logic (wrong operator, missing case)
Off-by-one errors in loops or array access
Missing edge case handling
Incorrect algorithm implementation

2. Data Issue

Invalid or unexpected input data
Type mismatch or coercion error
Null/undefined/None where value expected
Encoding or serialization problem
Data truncation or overflow

3. State Problem

Race condition between concurrent operations
Stale cache returning outdated data
Incorrect initialization or default values
Unintended mutation of shared state
State machine transition error

4. Integration Failure

API contract violation (request/response mismatch)
Version incompatibility between components
Configuration mismatch between environments
Missing or incorrect environment variables
Network timeout or connection failure

5. Resource Issue

Memory leak causing gradual degradation
Connection pool exhaustion
File descriptor or handle leak
Disk space or quota exceeded
CPU saturation from inefficient processing

6. Environment

Missing runtime dependency
Wrong library or framework version
Platform-specific behavior difference
Permission or access control issue
Timezone or locale-related behavior

Evidence Collection Standards

What Constitutes Evidence

Citation Format

Always cite evidence with file:line references:

**Evidence**: The validation function at `src/validators/user.ts:87`
does not check for empty strings, only null/undefined. This allows
empty email addresses to pass validation.

Confidence Levels

Result Arbitration Protocol

After all investigators report:

Step 1: Categorize Results

Confirmed: High confidence, strong evidence, clear causal chain
Plausible: Medium confidence, some evidence, reasonable causal chain
Falsified: Evidence contradicts the hypothesis
Inconclusive: Insufficient evidence to confirm or falsify

Step 2: Compare Confirmed Hypotheses

If multiple hypotheses are confirmed, rank by:

Confidence level
Number of supporting evidence pieces
Strength of causal chain
Absence of contradicting evidence

Step 3: Determine Root Cause

If one hypothesis clearly dominates: declare as root cause
If multiple hypotheses are equally likely: may be compound issue (multiple contributing causes)
If no hypotheses confirmed: generate new hypotheses based on evidence gathered

Step 4: Validate Fix

Before declaring the bug fixed:

[ ] Fix addresses the identified root cause
[ ] Fix doesn't introduce new issues
[ ] Original reproduction case no longer fails
[ ] Related edge cases are covered
[ ] Relevant tests are added or updated

Runtime Evidence Pattern

Log Convention

Every injected log MUST follow this format:

[DEBUG] [{file}:{line}] {short description} { {relevant vars} }

Concrete examples per language:

// JS/TS
console.log("[DEBUG] [auth.ts:42] before token verify", { tokenLen: token.length, hasUser: !!user })

# Python
print(f"[DEBUG] [auth.py:42] before token verify", {"token_len": len(token), "has_user": user is not None})
# or
logger.debug("[DEBUG] [auth.py:42] before token verify token_len=%d has_user=%s", len(token), user is not None)

// Rust
eprintln!("[DEBUG] [auth.rs:42] before token verify token_len={} has_user={}", token.len(), user.is_some());

// Go
fmt.Fprintf(os.Stderr, "[DEBUG] [auth.go:42] before token verify token_len=%d has_user=%v\n", len(token), user != nil)

The [DEBUG] prefix is non-negotiable. It is the cleanup anchor.

Strategic Placement

Pick 3-5 points, not more. Flooding logs makes signal harder to extract.

What NOT to Log

Passwords, tokens, API keys (use length or present/absent markers instead)
PII (email, phone, names) unless masked
Full request/response bodies (use sizes and selected fields)
Inside hot loops without rate limiting

Cleanup Protocol

After the bug is verified fixed, remove every [DEBUG] log. The grep is the source of truth — if grep returns zero matches, cleanup is complete.

# JS/TS
grep -rn '\[DEBUG\]' . --include='*.ts' --include='*.tsx' --include='*.js' --include='*.jsx'

# Python
grep -rn '\[DEBUG\]' . --include='*.py'

# Rust
grep -rn '\[DEBUG\]' . --include='*.rs'

# Go
grep -rn '\[DEBUG\]' . --include='*.go'

Multi-line console statements that span more than the matched line must be removed in full. Verify the file still parses after deletion.

Iteration Cap

Specialized Investigation Agents

team-debugger remains the fallback when no specialized agent matches the hypothesis cleanly, or when the investigation is too cross-cutting for a single specialist.

Sub-spawning caveat

Output Persistence

Reference: docs/references/agent-teams-best-practices.md § Operational do's and don'ts.

Related Skills

acaprino/review-quality-gates

development

VerifiedTrustedCommunity

Quality gates for multi-reviewer code review pipelines: adversarial verification panel, completeness critic, reviewer pipeline conventions, and the context sharing pattern for parallel reviewers. TRIGGER WHEN: running /senior-review:team-review quality gates; running /senior-review:code-review Steps 4b/4c (adversarial verification and completeness check); consolidating or deduplicating findings from multiple parallel reviewers. DO NOT TRIGGER WHEN: single-reviewer style review without a consolidation phase, or generic team coordination (the upstream agent-teams skills cover that).

6SKILL.mdUpdated Jul 28, 2026

acaprino/review-quality-gates

acaprino/abstraction-architect

development

VerifiedTrustedCommunity

Knowledge base for pure-architecture decisions on when to unify duplicated logic into a shared abstraction versus leave it duplicated. Covers the canonical theory (Rule of Three, DRY/WET/AHA, Wrong Abstraction, Locality of Behaviour, Bounded Contexts, Tidy First options framing, CUPID vs SOLID), 12 essential-duplication patterns that justify unification, 12 wrong-abstraction patterns that justify inlining or decomposition, an operational decision frame, and a verified reading list. TRIGGER WHEN: the user is making an architectural decision about whether to centralize, extract, or remove a layer; reviewing an abstraction for premature generality; auditing scattered cross-cutting concerns; spawned by the abstraction-architect agent during /abstraction-architect:audit or as the Abstraction dimension of /senior-review:team-review or /senior-review:code-review; the user asks "should I extract this into a service" / "is this DRY enough" / "is this wrong abstraction". DO NOT TRIGGER WHEN: the task is code formatting and readability cleanup (use clean-code:clean-code), Python-specific refactoring with metrics (use python-development:python-refactor), generic dead-code removal (use senior-review:cleanup-dead-code), security review (use senior-review:security-auditor), or pure pattern-consistency review without an architecture lens (use senior-review:code-auditor).

6SKILL.mdUpdated May 26, 2026

acaprino/abstraction-architect

acaprino/frontend-css

development

VerifiedTrustedCommunity

Unified web frontend knowledge base covering CSS architecture, UX psychology, UI components, distinctive aesthetics, and interface design generation. TRIGGER WHEN: working on web styling, design systems, component decisions, responsive strategy, distinctive frontend aesthetics, or exploring multiple interface designs. DO NOT TRIGGER WHEN: the task is purely backend or unrelated to web frontend.

6SKILL.mdUpdated May 13, 2026

acaprino/frontend-css

acaprino/stripe

development

VerifiedTrustedCommunity

Stripe payments knowledge base - API patterns, checkout optimization, subscription lifecycle, pricing strategies, webhook reliability, Firebase integration, cost analysis, and revenue modeling. Loaded by stripe-integrator and revenue-optimizer agents; also consumable directly when the user asks for Stripe-specific patterns without needing an agent. TRIGGER WHEN: working with Stripe API (Payment Intents, Customers, Subscriptions, Checkout Sessions, Connect, webhooks, tax, usage-based billing), pricing strategy, or revenue modeling. DO NOT TRIGGER WHEN: payment work is non-Stripe (PayPal, Square, crypto) or the task is generic e-commerce unrelated to payments.

6SKILL.mdUpdated Apr 20, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/acaprino/anvil-toolset.git

# Copy into Claude Code skills folder (global)
cp -r anvil-toolset/plugins/agent-teams/skills/parallel-debugging ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

acaprino/anvil-toolset

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT