Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

bcbeidel/check-resolver

Name: check-resolver
Author: bcbeidel

plugins/build/skills/check-resolver/SKILL.md

npx skillsauth add bcbeidel/wos check-resolver

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

/build:check-resolver

Evaluate a root-level resolver in three tiers: deterministic artifact and path checks (no LLM), per-dimension semantic evaluation (one locked-rubric LLM call), and cross-artifact reachability + staleness against disk state.

This skill follows the check-skill pattern. Tier-1 detection is in 3 scripts emitting JSON envelopes via _common.py (11 rule_ids total). Tier-2 has 4 judgment dimensions read inline by the primary agent. Tier-3 cross-artifact checks are mechanized as Tier-1 rule_ids (dark-capability) or opt-in (--run-evals).

The audit rubric mirrors the authoring principles in resolver-best-practices.md. When the principles doc changes, the dimensions follow.

When to use

Also fires when the user phrases the request as:

"check RESOLVER.md"
"are my filing rules current"

Workflow

1. Discover Resolver Artifacts

Walk up from the target directory looking for RESOLVER.md. The first ancestor with one becomes the resolver root; all checks scope to that resolver and its subtree.

Locate three artifacts at the resolver root:

RESOLVER.md
AGENTS.md (for the pointer check)
.resolver/evals.yml (sibling to RESOLVER.md)

Report: "Found resolver at <resolver root>. Auditing N filing rows, M context rows, K eval cases."

If no RESOLVER.md is found anywhere up to the filesystem root, emit FAIL and stop — nothing to audit. To audit every resolver in a repo with nested resolvers, run this skill once per resolver root.

2. Tier-1 Deterministic Checks

Invoke the three detection scripts:

python3 plugins/build/skills/check-resolver/scripts/check_pointer.py <resolver-root>
python3 plugins/build/skills/check-resolver/scripts/check_resolver.py <resolver-root>
python3 plugins/build/skills/check-resolver/scripts/check_evals.py <resolver-root>

Each script emits a JSON array of envelopes. Parse stdout per script. The combined Tier-1 rule set:

Script-to-rules map (11 Tier-1 rule_ids):

| Script | rule_ids | Severity | |---|---|---| | check_pointer.py | pointer-present | fail | | check_pointer.py | pointer-resolves | fail | | check_resolver.py | markers-present | fail | | check_resolver.py | filing-paths-resolve | fail | | check_resolver.py | context-paths-resolve | fail | | check_resolver.py | filing-rows-unique | fail | | check_resolver.py | context-rows-unique | fail | | check_resolver.py | dark-capability | warn | | check_resolver.py | mtime-stale | warn | | check_evals.py | evals-parse | fail | | check_evals.py | eval-pass-stale | warn |

Each finding's recommended_changes is canonical — copy it through verbatim. recommended_changes is REQUIRED on every finding.

Tier-2 exclusion list. Any FAIL in pointer-present, pointer-resolves, markers-present, filing-paths-resolve, context-paths-resolve, filing-rows-unique, context-rows-unique, or evals-parse excludes the resolver from Tier-2 — a malformed or unreachable resolver shouldn't burn LLM budget.

WARN findings (dark-capability, mtime-stale, eval-pass-stale) never exclude. They surface alongside Tier-2 findings.

3. Tier-2 Judgment Dimensions

For resolvers that passed the Tier-2 exclusion gate, evaluate against the 4 judgment rules at references/check-*.md:

| File | Dimension | Severity | |---|---|---| | check-filing-coverage.md | D1 — every depth-1 directory classified (filing / context / out-of-scope / ambient / delegated) | warn | | check-context-actionability.md | D2 — context rows list 1-4 concrete entries, not vague prose | warn | | check-eval-representativeness.md | D3 — evals exercise both filing/context routing; ≥1 case per filing row; ≥15% negative | warn | | check-brief-presence-and-content.md | D4 — .briefs/<slug>.brief.md exists with 5 H2s; So-what is specific | warn |

Evaluator policy: see check-skill-pattern.md §Evaluator policy. Read all 4 rule files first, then evaluate the resolver in one LLM call.

Include RESOLVER.md verbatim in the Tier-2 prompt — never summarize. Include the directory scan output, .resolver/evals.yml, and .briefs/<slug>.brief.md (if present).

4. Tier-3 Cross-Artifact Checks

Dark-capabilities scan. Mechanized as Tier-1 dark-capability (already part of check_resolver.py's output). For every directory under the resolver root (depth 1–2), classify as: in-filing, in-context, in-out-of-scope, ambient (.git, node_modules, dist, build, .cache, .venv, target, __pycache__, .resolver), or delegated (nested RESOLVER.md). Anything unclassified surfaces as warn. Subdirectories of a filing dir are not auto-classified.

Managed-region drift. Currently judgment-evaluated as part of D1 (filing-coverage). Future work could mechanize as a separate Tier-3 rule that diffs the live managed region against a fresh regeneration.

Optional: --run-evals. When invoked with --run-evals, execute each case in .resolver/evals.yml against a Claude call with RESOLVER.md in context. Each failing case surfaces as a Tier-3 finding. This step is opt-in (slow and costs LLM calls).

5. Report Findings

Merge findings from all three sources (3 detection scripts' JSON envelopes + 4 Tier-2 judgment findings + optional --run-evals results) into a unified table:

| Tier | rule_id | Location | Status | Reasoning |
|------|---------|----------|--------|-----------|

Sort: fail before warn before inapplicable; Tier-1 before Tier-2 before Tier-3 within severity. Each finding's Recommendation: line copies through recommended_changes verbatim.

Close with:

Resolver audited — no findings or
Resolver audited, N findings (X fail, Y warn)

6. Opt-In Repair Loop

Ask exactly once:

"Apply fixes? Enter y (all), n (skip), or comma-separated numbers."

For each selected finding, route per the recipe in recommended_changes:

Direct edit — managed-region row corrections, AGENTS.md pointer text, eval-pass timestamp refresh. Show diff; write on confirmation.
Routed to another skill — large structural drift → /build:build-resolver --regenerate; missing filing rows → /build:build-resolver --add-filing <type>.
Tier-2 judgment — filing coverage, context actionability, eval representativeness, brief content quality. Ask the user; rewrite the section; show diff; write on confirmation.

After each applied fix, re-run the affected Tier-1 script (or re-judge the Tier-2 dimension). Terminate when the user enters n or exhausts findings.

Anti-Pattern Guards

LLM-evaluating path existence. Path existence is Tier-1's job (deterministic file checks); paths either resolve or they don't.
Per-dimension Tier-2 calls. Use one locked-rubric call per resolver — a unified rubric produces stable scoring.
Hand-managed region edits treated as valid. Any row in the managed region that doesn't regenerate from disk is drift — FAIL or WARN depending on whether the row still resolves.
Reporting without recommendations. Every finding's recommended_changes is canonical; copy it through.
Silent out-of-scope expansion. If the user asks to suppress a dark-capability finding, add the directory to the explicit out-of-scope list in RESOLVER.md; don't silently ignore.
Re-evaluating scripted rules in Tier-2. Scripts are authoritative for the 11 Tier-1 rules; trust the pass envelope.
Suppressing the inapplicable envelope. When a sub-artifact (e.g., .resolver/evals.yml) is missing, the affected Tier-1 rule emits fail — do not collapse downstream rules to silent skip.
Embellishing scripts' recommended_changes. Each rule's recipe constant is canonical guidance sourced from resolver-best-practices.md. Copy it through; do not paraphrase.

Key Instructions

Run Tier-1 first; the FAIL exclusion list above gates judgment evaluation.
Present the 4 Tier-2 dimensions in a single locked-rubric call; per-dimension calls degrade agreement.
Include RESOLVER.md verbatim in the Tier-2 prompt — never summarize.
The dark-capability scan is gated to depth 1–2 — deeper scans overwhelm with transient build outputs.
Run evals only when --run-evals is passed; eval execution is slow.
Recovery: read-only outside the Repair Loop; edits revertable via git diff / git checkout.

Handoff

Chainable to: /build:build-resolver --regenerate (rebuild managed region); /build:build-resolver --add-filing <type> (add missing filing row).

bcbeidel/check-resolver

plugins/build/skills/check-resolver/SKILL.md

Audit a root-level resolver — verify AGENTS.md pointer, managed-region integrity, filing-table coverage against disk, context-table actionability, and trigger-eval pass rate. Use when the user wants to "audit a resolver", "validate routing table", or "find dark capabilities".

1 stars

testing

Updated May 8, 2026

$ install --global

skillsauth

npx skillsauth add bcbeidel/wos check-resolver

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 8, 2026, 3:26 AM130.3s1 file scanned

SKILL.md

name:: check-resolver
description:: Audit a root-level resolver — verify AGENTS.md pointer, managed-region integrity, filing-table coverage against disk, context-table actionability, and trigger-eval pass rate. Use when the user wants to "audit a resolver", "validate routing table", or "find dark capabilities".
allowed-tools:: Read, Write, Edit, Bash, Grep, Glob
argument-hint:: [target directory — defaults to CWD; walks up to the nearest RESOLVER.md and audits that one]
user-invocable:: true
license:: MIT

/build:check-resolver

The audit rubric mirrors the authoring principles in resolver-best-practices.md. When the principles doc changes, the dimensions follow.

When to use

Also fires when the user phrases the request as:

"check RESOLVER.md"
"are my filing rules current"

Workflow

1. Discover Resolver Artifacts

Walk up from the target directory looking for RESOLVER.md. The first ancestor with one becomes the resolver root; all checks scope to that resolver and its subtree.

Locate three artifacts at the resolver root:

RESOLVER.md
AGENTS.md (for the pointer check)
.resolver/evals.yml (sibling to RESOLVER.md)

Report: "Found resolver at <resolver root>. Auditing N filing rows, M context rows, K eval cases."

2. Tier-1 Deterministic Checks

Invoke the three detection scripts:

python3 plugins/build/skills/check-resolver/scripts/check_pointer.py <resolver-root>
python3 plugins/build/skills/check-resolver/scripts/check_resolver.py <resolver-root>
python3 plugins/build/skills/check-resolver/scripts/check_evals.py <resolver-root>

Each script emits a JSON array of envelopes. Parse stdout per script. The combined Tier-1 rule set:

Script-to-rules map (11 Tier-1 rule_ids):

Each finding's recommended_changes is canonical — copy it through verbatim. recommended_changes is REQUIRED on every finding.

WARN findings (dark-capability, mtime-stale, eval-pass-stale) never exclude. They surface alongside Tier-2 findings.

3. Tier-2 Judgment Dimensions

For resolvers that passed the Tier-2 exclusion gate, evaluate against the 4 judgment rules at references/check-*.md:

Evaluator policy: see check-skill-pattern.md §Evaluator policy. Read all 4 rule files first, then evaluate the resolver in one LLM call.

Include RESOLVER.md verbatim in the Tier-2 prompt — never summarize. Include the directory scan output, .resolver/evals.yml, and .briefs/<slug>.brief.md (if present).

4. Tier-3 Cross-Artifact Checks

5. Report Findings

Merge findings from all three sources (3 detection scripts' JSON envelopes + 4 Tier-2 judgment findings + optional --run-evals results) into a unified table:

| Tier | rule_id | Location | Status | Reasoning |
|------|---------|----------|--------|-----------|

Sort: fail before warn before inapplicable; Tier-1 before Tier-2 before Tier-3 within severity. Each finding's Recommendation: line copies through recommended_changes verbatim.

Close with:

Resolver audited — no findings or
Resolver audited, N findings (X fail, Y warn)

6. Opt-In Repair Loop

Ask exactly once:

"Apply fixes? Enter y (all), n (skip), or comma-separated numbers."

For each selected finding, route per the recipe in recommended_changes:

Direct edit — managed-region row corrections, AGENTS.md pointer text, eval-pass timestamp refresh. Show diff; write on confirmation.
Routed to another skill — large structural drift → /build:build-resolver --regenerate; missing filing rows → /build:build-resolver --add-filing <type>.
Tier-2 judgment — filing coverage, context actionability, eval representativeness, brief content quality. Ask the user; rewrite the section; show diff; write on confirmation.

After each applied fix, re-run the affected Tier-1 script (or re-judge the Tier-2 dimension). Terminate when the user enters n or exhausts findings.

Anti-Pattern Guards

LLM-evaluating path existence. Path existence is Tier-1's job (deterministic file checks); paths either resolve or they don't.
Per-dimension Tier-2 calls. Use one locked-rubric call per resolver — a unified rubric produces stable scoring.
Hand-managed region edits treated as valid. Any row in the managed region that doesn't regenerate from disk is drift — FAIL or WARN depending on whether the row still resolves.
Reporting without recommendations. Every finding's recommended_changes is canonical; copy it through.
Silent out-of-scope expansion. If the user asks to suppress a dark-capability finding, add the directory to the explicit out-of-scope list in RESOLVER.md; don't silently ignore.
Re-evaluating scripted rules in Tier-2. Scripts are authoritative for the 11 Tier-1 rules; trust the pass envelope.
Suppressing the inapplicable envelope. When a sub-artifact (e.g., .resolver/evals.yml) is missing, the affected Tier-1 rule emits fail — do not collapse downstream rules to silent skip.
Embellishing scripts' recommended_changes. Each rule's recipe constant is canonical guidance sourced from resolver-best-practices.md. Copy it through; do not paraphrase.

Key Instructions

Run Tier-1 first; the FAIL exclusion list above gates judgment evaluation.
Present the 4 Tier-2 dimensions in a single locked-rubric call; per-dimension calls degrade agreement.
Include RESOLVER.md verbatim in the Tier-2 prompt — never summarize.
The dark-capability scan is gated to depth 1–2 — deeper scans overwhelm with transient build outputs.
Run evals only when --run-evals is passed; eval execution is slow.
Recovery: read-only outside the Repair Loop; edits revertable via git diff / git checkout.

Handoff

Chainable to: /build:build-resolver --regenerate (rebuild managed region); /build:build-resolver --add-filing <type> (add missing filing row).

Related Skills

bcbeidel/check-help-skill

tools

VerifiedTrustedCommunity

Use when the user wants to "audit a help skill", "review my plugin index", or "verify my help-skill is up to date". Audits a plugins/<plugin>/skills/help/SKILL.md against the help-skill rubric — coverage, freshness, frontmatter fidelity, plus five judgment dimensions and a trigger-collision check.

1SKILL.mdUpdated May 3, 2026

bcbeidel/check-help-skill

bcbeidel/build-help-skill

tools

VerifiedTrustedCommunity

Use when the user wants to "scaffold a help skill", "add a /<plugin>:help command", or "build a plugin index skill", or wants to give a plugin an orientation surface that lists its skills and common workflows. Produces a SKILL.md at plugins/<plugin>/skills/help/SKILL.md.

1SKILL.mdUpdated May 3, 2026

bcbeidel/build-help-skill

bcbeidel/check-skill-pair

tools

VerifiedTrustedCommunity

Audits pair-level integrity of a primitive-pair (the artifact `/build:build-skill-pair` produces) by walking the four required artifact slots — principles doc, `build-<primitive>/SKILL.md`, `check-<primitive>/SKILL.md`, and the `primitive-routing.md` registration — and reports cross-artifact issues a per-SKILL.md checker cannot see: missing principles doc, divergent principles paths between halves, absent routing registration, missing build→check handoff. Per-half structural compliance with the unified pattern (`check-skill-pattern.md`) is delegated to `plugins/build/_shared/scripts/check_skill_pattern.py`. Use when the user wants to "audit a skill pair", "review a primitive pair", or "validate the skill pair for X". Not for auditing a single SKILL.md — route to `/build:check-skill`. Not for re-distilling a stale principles doc — route to `/build:build-skill-pair`.

1SKILL.mdUpdated Apr 24, 2026

bcbeidel/check-skill-pair

bcbeidel/check-readme

tools

VerifiedTrustedCommunity

Audits a project's top-level README.md against 28 deterministic checks across seven scripts (secret scanning, H1 uniqueness & position, heading-hierarchy skips, section presence & order, TOC threshold, line count & length, code-block language tags, shell-prompt prefixes, smart quotes in code, relative-link resolution, fragment-anchor resolution, image alt text, badge/image byte size, destructive-command flagging, pipe-to-shell patterns, TLS-disable instructions, non-reserved hostnames/IPs, emoji in headings, LICENSE file presence & link, CONTRIBUTING link, TODO/FIXME/XXX markers, README gitignore status) plus seven judgment dimensions and a Tier-3 cross-README collision check. Use when the user wants to "audit a README", "lint a README", or "run linters on README.md". Not for sub-package READMEs (different rubric) or docs-site pages (different toolchain).

1SKILL.mdUpdated Apr 24, 2026

bcbeidel/check-readme

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/bcbeidel/wos.git

# Copy into Claude Code skills folder (global)
cp -r wos/plugins/build/skills/check-resolver ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

bcbeidel/wos

1 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT