Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

primatrix/beaver-audit

Name: beaver-audit
Author: primatrix

plugins/beaver/skills/beaver-audit/SKILL.md

npx skillsauth add primatrix/skills beaver-audit

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Beaver Audit

Audit the decomposition quality of a size/L parent Issue's sub-tasks. Checks three dimensions: coverage, atomicity, and test definitions.

References beaver-engine for: guardrails (Section 3), label ops (Section 4), transition execution (Section 6).

Prerequisites

gh auth status must succeed
Target Issue must be size/L with sub-issues

Workflow

Step 1: Load parent Issue

gh api repos/{owner}/{repo}/issues/{number} \
  --jq '{title, body, labels: [.labels[].name], milestone: (.milestone.title // null)}'

Verify it has size/L label. If not, inform the user this skill is for size/L issues only.

Step 2: Load all sub-issues

gh api repos/{owner}/{repo}/issues/{number}/sub_issues \
  -H "X-GitHub-Api-Version: 2026-03-10" \
  --jq '.[] | {number, title, body, labels: [.labels[].name]}'

If no sub-issues found, inform user and exit.

Step 3: LLM Audit — three checks

For each sub-issue, evaluate:

A. Coverage

Compare the parent Issue's Objective and Acceptance Criteria against the combined scope of all sub-issues. Identify:

Covered modules/requirements
Gaps: requirements in the parent that no sub-issue addresses

B. Atomicity (200 LOC)

For each sub-issue, estimate whether the implementation can fit within 200 lines of core code (excluding tests, docs, generated files). Flag sub-issues that appear too large.

Criteria for "too large":

Touches multiple independent modules
Requires both API + UI changes
Description implies significant new infrastructure

C. Test Definition

Check each sub-issue's body for a testing section. Look for:

Explicit "Test Method" or "How to Test" or "Test Plan" section
Specific test scenarios or commands
Mark as missing if no testing guidance found

Step 4: Generate audit report

Present as a table:

## Beaver Audit Report: #{parent_number} {parent_title}

### Coverage Analysis
- Covered: {list of covered requirements}
- Gaps: {list of uncovered requirements, or "None"}

### Sub-task Details

| # | Title | Atomicity | Test Def | Issues |
|---|-------|-----------|----------|--------|
| {n} | {title} | pass/warn | pass/fail | {details} |

### Summary
- Total sub-tasks: {count}
- Passing all checks: {count}
- Needing attention: {count}

Step 5: Apply labels for failures

For each sub-issue with missing test definition:

gh api repos/{owner}/{repo}/issues/{sub_number}/labels --method POST -f "labels[]=beaver/missing-test"

For each sub-issue flagged as too large:

gh api repos/{owner}/{repo}/issues/{sub_number}/labels --method POST -f "labels[]=beaver/needs-split"

If coverage gaps found, add to parent:

gh api repos/{owner}/{repo}/issues/{number}/labels --method POST -f "labels[]=beaver/missing-context"

Step 6: Post audit summary as Issue comment

Write the generated report to a temporary file first, then post it:

AUDIT_REPORT_FILE=$(mktemp)
cat > "$AUDIT_REPORT_FILE" << 'BEAVEREOF'
{rendered_audit_report}
BEAVEREOF

gh api repos/{owner}/{repo}/issues/{number}/comments --method POST \
  --raw-field body=@"$AUDIT_REPORT_FILE"
rm "$AUDIT_REPORT_FILE"

Step 7: Conditional transition

If ALL checks pass (no gaps, all atomic, all have test defs):

Ask user: "All checks passed. Transition parent to status/ready-to-develop?"
If confirmed: execute transition per engine Section 6

If ANY check fails:

Keep current status
Inform user what needs fixing

Constraints

Only works on size/L issues with sub-issues
Atomicity is an LLM estimate, not exact — flag as warning not failure
Never auto-transition without user confirmation
Audit comments in English where applicable

primatrix/beaver-audit

plugins/beaver/skills/beaver-audit/SKILL.md

Audit the decomposition of a size/L Beaver issue into sub-tasks. Checks coverage, atomicity (200 LOC limit), and test definitions. Trigger when the user wants to review task decomposition quality.

testing

Updated Apr 15, 2026

$ install --global

skillsauth

npx skillsauth add primatrix/skills beaver-audit

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 15, 2026, 8:19 AM44.4s1 file scanned

SKILL.md

name:: beaver-audit
description:: Audit the decomposition of a size/L Beaver issue into sub-tasks. Checks coverage, atomicity (200 LOC limit), and test definitions. Trigger when the user wants to review task decomposition quality.
argument-hint:: <issue-number>

Beaver Audit

Audit the decomposition quality of a size/L parent Issue's sub-tasks. Checks three dimensions: coverage, atomicity, and test definitions.

References beaver-engine for: guardrails (Section 3), label ops (Section 4), transition execution (Section 6).

Prerequisites

gh auth status must succeed
Target Issue must be size/L with sub-issues

Workflow

Step 1: Load parent Issue

gh api repos/{owner}/{repo}/issues/{number} \
  --jq '{title, body, labels: [.labels[].name], milestone: (.milestone.title // null)}'

Verify it has size/L label. If not, inform the user this skill is for size/L issues only.

Step 2: Load all sub-issues

gh api repos/{owner}/{repo}/issues/{number}/sub_issues \
  -H "X-GitHub-Api-Version: 2026-03-10" \
  --jq '.[] | {number, title, body, labels: [.labels[].name]}'

If no sub-issues found, inform user and exit.

Step 3: LLM Audit — three checks

For each sub-issue, evaluate:

A. Coverage

Compare the parent Issue's Objective and Acceptance Criteria against the combined scope of all sub-issues. Identify:

Covered modules/requirements
Gaps: requirements in the parent that no sub-issue addresses

B. Atomicity (200 LOC)

For each sub-issue, estimate whether the implementation can fit within 200 lines of core code (excluding tests, docs, generated files). Flag sub-issues that appear too large.

Criteria for "too large":

Touches multiple independent modules
Requires both API + UI changes
Description implies significant new infrastructure

C. Test Definition

Check each sub-issue's body for a testing section. Look for:

Explicit "Test Method" or "How to Test" or "Test Plan" section
Specific test scenarios or commands
Mark as missing if no testing guidance found

Step 4: Generate audit report

Present as a table:

## Beaver Audit Report: #{parent_number} {parent_title}

### Coverage Analysis
- Covered: {list of covered requirements}
- Gaps: {list of uncovered requirements, or "None"}

### Sub-task Details

| # | Title | Atomicity | Test Def | Issues |
|---|-------|-----------|----------|--------|
| {n} | {title} | pass/warn | pass/fail | {details} |

### Summary
- Total sub-tasks: {count}
- Passing all checks: {count}
- Needing attention: {count}

Step 5: Apply labels for failures

For each sub-issue with missing test definition:

gh api repos/{owner}/{repo}/issues/{sub_number}/labels --method POST -f "labels[]=beaver/missing-test"

For each sub-issue flagged as too large:

gh api repos/{owner}/{repo}/issues/{sub_number}/labels --method POST -f "labels[]=beaver/needs-split"

If coverage gaps found, add to parent:

gh api repos/{owner}/{repo}/issues/{number}/labels --method POST -f "labels[]=beaver/missing-context"

Step 6: Post audit summary as Issue comment

Write the generated report to a temporary file first, then post it:

AUDIT_REPORT_FILE=$(mktemp)
cat > "$AUDIT_REPORT_FILE" << 'BEAVEREOF'
{rendered_audit_report}
BEAVEREOF

gh api repos/{owner}/{repo}/issues/{number}/comments --method POST \
  --raw-field body=@"$AUDIT_REPORT_FILE"
rm "$AUDIT_REPORT_FILE"

Step 7: Conditional transition

If ALL checks pass (no gaps, all atomic, all have test defs):

Ask user: "All checks passed. Transition parent to status/ready-to-develop?"
If confirmed: execute transition per engine Section 6

If ANY check fails:

Keep current status
Inform user what needs fixing

Constraints

Only works on size/L issues with sub-issues
Atomicity is an LLM estimate, not exact — flag as warning not failure
Never auto-transition without user confirmation
Audit comments in English where applicable

Related Skills

primatrix/memory-profile

development

VerifiedTrustedCommunity

Use when analyzing TPU pretraining HBM occupancy from a profile directory — locates the static HBM peak (the same number TensorBoard's Memory Viewer shows), enumerates every buffer alive at the peak schedule moment with size / HLO instruction / opcode / op_name, and rolls the alive set up by opcode and op_name. Reads compile-time `*.hlo_proto.pb` (BufferAssignmentProto) as the primary source; runtime `*.xplane.pb` allocator events are a secondary, often-truncated signal.

SKILL.mdUpdated May 27, 2026

primatrix/memory-profile

primatrix/compute-breakdown

testing

VerifiedTrustedCommunity

Use when analyzing TPU pretraining compute efficiency from xplane.pb — produces source-line-aggregated HLO duration tables, layer-scoped breakdowns, non-compute (padding/cast/copy) audits, and v7x roofline shortfall vs theoretical peak. Reads schema documented by profile-anatomy.

SKILL.mdUpdated May 25, 2026

primatrix/compute-breakdown

primatrix/plugins/tpu-perf/skills/comm-analysis

tools

VerifiedTrustedCommunity

--- name: comm-analysis description: Use when analyzing communication on a TPU pretraining profile — extracts every comm primitive (async + sync, TC + SparseCore), attributes axes via HLO replica_groups, computes per-row NCCL bus BW vs per-axis peak ICI BW (peak_link × k_torus_dims × directions_per_dim; TPUv7x: 200 GB/s bidir per link on a 3D torus; util% requires `--mesh-spec` with topology), and reports per-step compute/comm overlap. Builds on profile-anatomy. --- # Communication Analysis **

SKILL.mdUpdated May 25, 2026

primatrix/plugins/tpu-perf/skills/comm-analysis

primatrix/profile-anatomy

documentation

VerifiedTrustedCommunity

Use when reading TPU pretraining profiles (xplane.pb, trace.json.gz) — describes the on-disk layout, the XSpace/XPlane/XLine/XEvent/XStat hierarchy, and provides reference scripts that future tpu-perf skills can read as schema documentation.

SKILL.mdUpdated May 24, 2026

primatrix/profile-anatomy

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/primatrix/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/plugins/beaver/skills/beaver-audit ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

primatrix/skills

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT