Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

jamie-bitflight/analyze-test-failures

Name: analyze-test-failures
Author: jamie-bitflight

plugins/python3-development/skills/analyze-test-failures/SKILL.md

npx skillsauth add jamie-bitflight/claude_skills analyze-test-failures

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Analyze Test Failures

Analyze failing test cases with a balanced, investigative approach.

Context

Consult ../python3-development/references/python3-standards.md when shared testing or quality rules from this plugin apply; full standards, graphs, and amendment process are documented there.

When tests fail, there are two primary possibilities:

False positive: The test itself is incorrect
True positive: The test discovered a genuine bug

Assuming tests are wrong by default is a dangerous anti-pattern that defeats the purpose of testing.

Analysis Process

1. Initial Analysis

Read the failing test carefully, understanding its intent
Examine the test's assertions and expected behavior
Review the error message and stack trace

2. Investigate the Implementation

Check the actual implementation being tested
Trace through the code path that leads to the failure
Verify that implementation matches documented behavior

3. Apply Critical Thinking

For each failing test, ask:

What behavior is the test trying to verify?
Is this behavior clearly documented or implied by the API design?
Does the current implementation actually provide this behavior?
Could this be an edge case the implementation missed?

4. Make a Determination

Classify the failure as one of:

| Classification | Meaning | | ---------------------- | --------------------------------- | | Test Bug | Test's expectations are incorrect | | Implementation Bug | Code doesn't behave as it should | | Ambiguous | Intended behavior is unclear |

5. Document Reasoning

Provide clear explanation including:

Evidence supporting the conclusion
Specific mismatch between expectation and reality
Recommended fix (to test or implementation)

Example Analyses

Example 1: Ambiguous Behavior

Scenario: Test expects calculateDiscount(100, 0.2) to return 20, but it returns 80

Analysis:

Test assumes function returns discount amount
Implementation returns price after discount
Function name is ambiguous

Determination: Ambiguous Recommendation: Check documentation or clarify intended behavior

Example 2: Implementation Bug

Scenario: Test expects validateEmail("[email protected]") to return true, but it returns false

Analysis:

Test provides a valid email format
Implementation regex is missing support for dots in domain
Other valid emails also fail

Determination: Implementation Bug Recommendation: Fix the regex to properly validate email addresses per RFC standards

Example 3: Test Bug

Scenario: Test expects divide(10, 0) to return 0, but it throws an error

Analysis:

Test assumes division by zero returns 0
Implementation throws DivisionByZeroError
Standard mathematical behavior is to treat as undefined/error

Determination: Test Bug Recommendation: Update test to expect an error, not 0

Output Format

For each failing test, provide:

Test: [test name/description]
Failure: [what failed and how]

Investigation:
- Test expects: [expected behavior]
- Implementation does: [actual behavior]
- Root cause: [why they differ]

Determination: [Test Bug | Implementation Bug | Ambiguous]

Recommendation:
[Specific fix to either test or implementation]

Key Principles

NEVER automatically assume the test is wrong
ALWAYS consider that the test might have found a real bug
When uncertain, lean toward investigating the implementation
Tests are often your specification - they define expected behavior
A failing test is a gift - it's either catching a bug or clarifying requirements

Related Skills

test-failure-mindset: Set investigative approach for session
comprehensive-test-review: Full test suite review

jamie-bitflight/analyze-test-failures

plugins/python3-development/skills/analyze-test-failures/SKILL.md

Use when analyzing failing test cases to determine whether failures indicate genuine bugs or test implementation issues. Activates on "analyze failing tests", "debug test failures", "investigate test errors", or when provided with specific failing test names or output. Applies balanced investigative reasoning — does not auto-fix tests without establishing root cause.

39 stars

development

Updated Apr 28, 2026

$ install --global

skillsauth

npx skillsauth add jamie-bitflight/claude_skills analyze-test-failures

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 28, 2026, 2:17 PM303.2s1 file scanned

SKILL.md

name:: analyze-test-failures
description:: Use when analyzing failing test cases to determine whether failures indicate genuine bugs or test implementation issues. Activates on "analyze failing tests", "debug test failures", "investigate test errors", or when provided with specific failing test names or output. Applies balanced investigative reasoning — does not auto-fix tests without establishing root cause.
argument-hint:: <test_file_or_test_name>
user-invocable:: true

Analyze Test Failures

Analyze failing test cases with a balanced, investigative approach.

Context

Consult ../python3-development/references/python3-standards.md when shared testing or quality rules from this plugin apply; full standards, graphs, and amendment process are documented there.

When tests fail, there are two primary possibilities:

False positive: The test itself is incorrect
True positive: The test discovered a genuine bug

Assuming tests are wrong by default is a dangerous anti-pattern that defeats the purpose of testing.

Analysis Process

1. Initial Analysis

Read the failing test carefully, understanding its intent
Examine the test's assertions and expected behavior
Review the error message and stack trace

2. Investigate the Implementation

Check the actual implementation being tested
Trace through the code path that leads to the failure
Verify that implementation matches documented behavior

3. Apply Critical Thinking

For each failing test, ask:

What behavior is the test trying to verify?
Is this behavior clearly documented or implied by the API design?
Does the current implementation actually provide this behavior?
Could this be an edge case the implementation missed?

4. Make a Determination

Classify the failure as one of:

5. Document Reasoning

Provide clear explanation including:

Evidence supporting the conclusion
Specific mismatch between expectation and reality
Recommended fix (to test or implementation)

Example Analyses

Example 1: Ambiguous Behavior

Scenario: Test expects calculateDiscount(100, 0.2) to return 20, but it returns 80

Analysis:

Test assumes function returns discount amount
Implementation returns price after discount
Function name is ambiguous

Determination: Ambiguous Recommendation: Check documentation or clarify intended behavior

Example 2: Implementation Bug

Scenario: Test expects validateEmail("[email protected]") to return true, but it returns false

Analysis:

Test provides a valid email format
Implementation regex is missing support for dots in domain
Other valid emails also fail

Determination: Implementation Bug Recommendation: Fix the regex to properly validate email addresses per RFC standards

Example 3: Test Bug

Scenario: Test expects divide(10, 0) to return 0, but it throws an error

Analysis:

Test assumes division by zero returns 0
Implementation throws DivisionByZeroError
Standard mathematical behavior is to treat as undefined/error

Determination: Test Bug Recommendation: Update test to expect an error, not 0

Output Format

For each failing test, provide:

Test: [test name/description]
Failure: [what failed and how]

Investigation:
- Test expects: [expected behavior]
- Implementation does: [actual behavior]
- Root cause: [why they differ]

Determination: [Test Bug | Implementation Bug | Ambiguous]

Recommendation:
[Specific fix to either test or implementation]

Key Principles

NEVER automatically assume the test is wrong
ALWAYS consider that the test might have found a real bug
When uncertain, lean toward investigating the implementation
Tests are often your specification - they define expected behavior
A failing test is a gift - it's either catching a bug or clarifying requirements

Related Skills

test-failure-mindset: Set investigative approach for session
comprehensive-test-review: Full test suite review

Related Skills

jamie-bitflight/xdg-base-directory

development

VerifiedTrustedCommunity

When an application needs to store config, data, cache, or state files. When designing where user-specific files should live. When code writes to ~/.appname or hardcoded home paths. When implementing cross-platform file storage with platformdirs.

39SKILL.mdUpdated Apr 30, 2026

jamie-bitflight/xdg-base-directory

jamie-bitflight/verification-gate

testing

VerifiedTrustedCommunity

Enforce mandatory pre-action verification checkpoints to prevent pattern-matching from overriding explicit reasoning. Use this skill when about to execute implementation actions (Bash, Write, Edit) to verify hypothesis-action alignment. Blocks execution when hypothesis unverified or action targets different system than hypothesis identified. Critical for preventing cognitive dissonance where correct diagnosis leads to wrong implementation.

39SKILL.mdUpdated Apr 30, 2026

jamie-bitflight/verification-gate

jamie-bitflight/twelve-factor-app

tools

VerifiedTrustedCommunity

Reference guide for the Twelve-Factor App methodology — 15 principles (12 original + 3 modern extensions) for building portable, resilient, cloud-native applications. Use when evaluating application architecture, designing cloud-native services, reviewing codebases for methodology compliance, advising on configuration, scaling, observability, security, and deployment patterns. Incorporates the 2025 open-source community evolution and cloud-native reinterpretations of each factor.

39SKILL.mdUpdated Apr 30, 2026

jamie-bitflight/twelve-factor-app

jamie-bitflight/user-docs-to-ai-skill

tools

VerifiedTrustedCommunity

Converts user-facing documentation (how-to guides, tutorials, API references, examples) in any format — Markdown, PDF, DOCX, PPTX, XLSX, AsciiDoc, RST, HTML, Jupyter notebooks, man pages, TOML/YAML/JSON configs, and plain text — into Claude Code skill directories with SKILL.md plus thematically grouped references/*.md files. Use when given a docs directory or mixed-format documentation to transform into an AI skill. Uses MCP file-reader server for binary formats.

39SKILL.mdUpdated Apr 30, 2026

jamie-bitflight/user-docs-to-ai-skill

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/jamie-bitflight/claude_skills.git

# Copy into Claude Code skills folder (global)
cp -r claude_skills/plugins/python3-development/skills/analyze-test-failures ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

jamie-bitflight/claude_skills

39 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT