Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

drn/ci-investigate

Name: ci-investigate
Author: drn

agents/skills/ci-investigate/SKILL.md

npx skillsauth add drn/dots ci-investigate

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

CI Failure Investigator

Investigate flaky CI failures across multiple workflow runs to identify patterns, categorize root causes, and propose targeted fixes.

Arguments

$ARGUMENTS - Optional: job name filter, number of runs to check, or branch name. Examples: "test-core-rspec", "--runs 10", "master"

If no arguments are provided, investigate the most recent failing workflow on the current branch.

Context

Current branch: !git branch --show-current
Repo remote: !git remote get-url origin 2>/dev/null | head -1
CI config: !find . -maxdepth 2 $ -name ".circleci" -o -name ".github" $ -type d 2>/dev/null | head -5
CircleCI config snippet: !head -30 .circleci/config.yml 2>/dev/null | head -30

Instructions

Investigate CI failures by fetching multiple workflow runs, extracting test results, and categorizing failure patterns.

Step 1: Determine Scope

Parse $ARGUMENTS for:

Job name filter (e.g., "test-core-rspec") -- only analyze jobs matching this name
Run count (e.g., "--runs 10") -- how many recent runs to check (default: 5, max: 15)
Branch -- which branch to investigate (default: current branch from context above)

Derive the project slug from the git remote URL. For GitHub repos, format is "gh/org/repo".

Report the scope to the user before proceeding:

Project slug
Branch
Job filter (if any)
Number of runs to fetch

Step 2: Fetch Recent Workflow Runs

Detect the CI provider from the context above and use the appropriate tools.

CircleCI (if .circleci/ exists): Use ToolSearch to find available CircleCI MCP tools (search "circleci"). Then:

Fetch recent pipelines for the project slug and branch
For each pipeline, get its workflows
For each workflow, get jobs and their statuses
Filter to failed jobs (and optionally by job name from Step 1)

GitHub Actions (if .github/workflows/ exists): Use the gh CLI:

gh run list --branch {branch} --limit {N} to get recent runs
gh run view {run_id} to get job details
gh run view {run_id} --log-failed to get failure output

Collect up to the target number of failed runs. If a workflow has no failures, skip it but note it as a passing run (useful for calculating flake rate).

Track:

Total workflow runs examined
Number with failures vs passing
Which specific jobs failed in each run

Step 3: Extract Failure Details

For each failed job found in Step 2:

CircleCI:

Fetch test results for the failed job (with artifacts and logs if the MCP tools support it)
If 0 test failures but non-zero exit code, fetch raw job logs. Flag as infrastructure failure.

GitHub Actions:

Use gh run view {run_id} --log-failed to get failure output
Parse test framework output (JUnit XML, RSpec, Jest, etc.) for structured results

For each failure, extract:

Test file path
Test name
Error message and stack trace
Node/container number (for parallel test splitting issues)

Use parallel Task tool calls to fetch test results for multiple jobs simultaneously when possible.

Step 4: Categorize Failures

Group failures by test file + test name. For each group, classify the root cause:

| Category | Signals | Common Fix | |----------|---------|------------| | Deterministic | Fails every run, same error | Fix the test or code -- this is a real bug | | Timing flake | Intermittent, error involves time comparison, values differ by milliseconds | Freeze time in tests, use tolerance matchers | | Parallel collision | Hard-coded IDs, PK violations, "Duplicate entry" | Use sequences/auto-increment, avoid hard-coded IDs | | Test isolation | Order-dependent failures, shared mutable state between tests | Reset state between tests, avoid global side effects | | Infrastructure | 0 test failures + exit code 1, OOM killed, container timeout | Retry or investigate resource limits | | External dependency | Timeout connecting to external service, API errors | Add retry logic or stub external calls in tests |

If a failure does not clearly fit one category, mark it as Unclassified and include the full error for manual review.

Step 5: Report Findings

Present the findings in this format:

## CI Investigation Report

**Scope:** {project} / {branch} / {job filter or "all jobs"}
**Runs analyzed:** {N} ({pass_count} passed, {fail_count} failed)
**Overall flake rate:** {fail_count/N * 100}%

### Failure Groups (by frequency)

#### 1. {test_file}:{test_name} -- {category}
- **Frequency:** {X}/{N} runs ({percentage}%)
- **Error:** {1-2 line error summary}
- **Affected nodes:** {node numbers if relevant}
- **Root cause:** {explanation}
- **Proposed fix:**
  {specific code change or strategy}

#### 2. ...

### Infrastructure Failures
{List any jobs that failed with 0 test failures}

### Cross-Repo Issues
{Flag any fixes needed in shared gems (Nucleus, etc.) or CI configuration}

### Recommended Priority
1. {highest frequency flake} -- affects X% of runs
2. ...

Step 6: Offer Next Steps

After presenting the report, ask the user if they want to:

Fix a specific flake -- implement the proposed fix for a chosen failure group
Investigate deeper -- fetch more runs or drill into a specific failure
Export the report -- save findings to a file for reference

drn/ci-investigate

agents/skills/ci-investigate/SKILL.md

Investigate flaky CI failures across multiple workflow runs to identify patterns, categorize root causes, and propose fixes. Use when asked to investigate CI failures, find flaky tests, diagnose test flakiness, or understand why CI is failing repeatedly.

23 stars

development

Updated Apr 21, 2026

$ install --global

skillsauth

npx skillsauth add drn/dots ci-investigate

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 21, 2026, 10:41 AM52.2s1 file scanned

SKILL.md

name:: ci-investigate
description:: Investigate flaky CI failures across multiple workflow runs to identify patterns, categorize root causes, and propose fixes. Use when asked to investigate CI failures, find flaky tests, diagnose test flakiness, or understand why CI is failing repeatedly.

CI Failure Investigator

Investigate flaky CI failures across multiple workflow runs to identify patterns, categorize root causes, and propose targeted fixes.

Arguments

$ARGUMENTS - Optional: job name filter, number of runs to check, or branch name. Examples: "test-core-rspec", "--runs 10", "master"

If no arguments are provided, investigate the most recent failing workflow on the current branch.

Context

Current branch: !git branch --show-current
Repo remote: !git remote get-url origin 2>/dev/null | head -1
CI config: !find . -maxdepth 2 $ -name ".circleci" -o -name ".github" $ -type d 2>/dev/null | head -5
CircleCI config snippet: !head -30 .circleci/config.yml 2>/dev/null | head -30

Instructions

Investigate CI failures by fetching multiple workflow runs, extracting test results, and categorizing failure patterns.

Step 1: Determine Scope

Parse $ARGUMENTS for:

Job name filter (e.g., "test-core-rspec") -- only analyze jobs matching this name
Run count (e.g., "--runs 10") -- how many recent runs to check (default: 5, max: 15)
Branch -- which branch to investigate (default: current branch from context above)

Derive the project slug from the git remote URL. For GitHub repos, format is "gh/org/repo".

Report the scope to the user before proceeding:

Project slug
Branch
Job filter (if any)
Number of runs to fetch

Step 2: Fetch Recent Workflow Runs

Detect the CI provider from the context above and use the appropriate tools.

CircleCI (if .circleci/ exists): Use ToolSearch to find available CircleCI MCP tools (search "circleci"). Then:

Fetch recent pipelines for the project slug and branch
For each pipeline, get its workflows
For each workflow, get jobs and their statuses
Filter to failed jobs (and optionally by job name from Step 1)

GitHub Actions (if .github/workflows/ exists): Use the gh CLI:

gh run list --branch {branch} --limit {N} to get recent runs
gh run view {run_id} to get job details
gh run view {run_id} --log-failed to get failure output

Collect up to the target number of failed runs. If a workflow has no failures, skip it but note it as a passing run (useful for calculating flake rate).

Track:

Total workflow runs examined
Number with failures vs passing
Which specific jobs failed in each run

Step 3: Extract Failure Details

For each failed job found in Step 2:

CircleCI:

Fetch test results for the failed job (with artifacts and logs if the MCP tools support it)
If 0 test failures but non-zero exit code, fetch raw job logs. Flag as infrastructure failure.

GitHub Actions:

Use gh run view {run_id} --log-failed to get failure output
Parse test framework output (JUnit XML, RSpec, Jest, etc.) for structured results

For each failure, extract:

Test file path
Test name
Error message and stack trace
Node/container number (for parallel test splitting issues)

Use parallel Task tool calls to fetch test results for multiple jobs simultaneously when possible.

Step 4: Categorize Failures

Group failures by test file + test name. For each group, classify the root cause:

If a failure does not clearly fit one category, mark it as Unclassified and include the full error for manual review.

Step 5: Report Findings

Present the findings in this format:

## CI Investigation Report

**Scope:** {project} / {branch} / {job filter or "all jobs"}
**Runs analyzed:** {N} ({pass_count} passed, {fail_count} failed)
**Overall flake rate:** {fail_count/N * 100}%

### Failure Groups (by frequency)

#### 1. {test_file}:{test_name} -- {category}
- **Frequency:** {X}/{N} runs ({percentage}%)
- **Error:** {1-2 line error summary}
- **Affected nodes:** {node numbers if relevant}
- **Root cause:** {explanation}
- **Proposed fix:**
  {specific code change or strategy}

#### 2. ...

### Infrastructure Failures
{List any jobs that failed with 0 test failures}

### Cross-Repo Issues
{Flag any fixes needed in shared gems (Nucleus, etc.) or CI configuration}

### Recommended Priority
1. {highest frequency flake} -- affects X% of runs
2. ...

Step 6: Offer Next Steps

After presenting the report, ask the user if they want to:

Fix a specific flake -- implement the proposed fix for a chosen failure group
Investigate deeper -- fetch more runs or drill into a specific failure
Export the report -- save findings to a file for reference

Related Skills

drn/slides

development

VerifiedTrustedCommunity

Build a self-contained, single-file HTML presentation deck from talking points or a source doc, using a terminal/TUI-styled template with keyboard, tap, and swipe navigation. Use when the user wants to create slides, build a presentation or deck, turn talking points or a doc into a talk, make an HTML slideshow, or produce a presentation as a shareable artifact (instead of Google Slides).

23SKILL.mdUpdated Jun 13, 2026

drn/markdown-preview

development

VerifiedTrustedCommunity

Render a Markdown file to GitHub-flavored HTML and open a styled local preview (light + dark) in the browser. Use when the user wants to preview markdown, see how a README renders on GitHub, check that relative screenshots or images display correctly, or get a GitHub-like local preview without installing grip or glow.

23SKILL.mdUpdated Jun 13, 2026

drn/complete

tools

VerifiedTrustedCommunity

Mark the current Argus task as complete. Use when the work for the current worktree is done and the user wants the task to transition to the "complete" status.

23SKILL.mdUpdated Jun 13, 2026

drn/orchestrate

development

VerifiedTrustedCommunity

Launch a dynamic Workflow where the top-tier session model (Fable) handles planning and orchestration while implementation subagents run on Sonnet for routine tasks and Opus for complex ones. Use when the user wants to orchestrate a build, a dynamic workflow, a model-tiered build, fable planning with sonnet and opus implementation, or tiered agents.

23SKILL.mdUpdated Jun 12, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/drn/dots.git

# Copy into Claude Code skills folder (global)
cp -r dots/agents/skills/ci-investigate ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

drn/dots

23 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT