Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

nwave-ai/nw-abr-critique-dimensions

Name: nw-abr-critique-dimensions
Author: nwave-ai

plugins/nw/skills/nw-abr-critique-dimensions/SKILL.md

npx skillsauth add nwave-ai/nwave nw-abr-critique-dimensions

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Agent Quality Critique Dimensions

Use these dimensions when reviewing or validating agent definitions.

Dimension 1: Template Compliance

Does the agent follow official Claude Code format?

Check: YAML frontmatter with name and description (required) | Markdown body as system prompt | No embedded YAML config blocks | No activation-instructions or IDE-FILE-RESOLUTION sections | Skills referenced in frontmatter, not inline

Severity: High -- non-compliant agents may not load correctly.

Dimension 2: Size and Focus

Check: Core definition under 400 lines | Domain knowledge in Skills | Single clear responsibility | No monolithic sections (>50 lines without structure) | No redundant Claude default behaviors

Measurement: wc -l {agent-file}. Target: 200-400 lines.

Severity: High -- oversized agents suffer context rot.

Dimension 3: Divergence Quality

Does the agent specify only what diverges from Claude defaults?

Check: No file operation instructions | No generic quality principles ("be thorough") | No tool usage guidelines | Core principles are domain-specific and non-obvious | Each instruction justifies why Claude wouldn't do this naturally

Severity: Medium -- redundant instructions waste tokens, cause overtriggering.

Dimension 4: Safety Implementation

Check: Tools restricted via frontmatter tools field | maxTurns set | No prose-based security layers (use hooks) | No embedded enterprise safety frameworks | permissionMode set for risky actions

Severity: High -- prose safety is ineffective and token-wasteful.

Dimension 5: Language and Tone

Check: No "CRITICAL:", "MANDATORY:", "ABSOLUTE" language | Direct statements ("Do X" not "You MUST X") | Affirmative phrasing ("Do Y" not "Don't do X") | Consistent terminology | No repetitive emphasis

Severity: Medium -- aggressive language causes overtriggering on Opus 4.6.

Dimension 6: Examples Quality

Check: 3-5 canonical examples present | Cover critical/subtle decisions (not obvious cases) | Good/bad paired where useful | Concise (not full implementations)

Severity: Medium -- missing examples cause edge case failures.

Dimension 7: Skill Loading Effectiveness

Does the agent ensure skills are actually loaded during execution?

Check: Skill Loading Strategy table present for agents with 3+ skills | Every frontmatter skill has matching Load: directive in workflow | Skills path documented (~/.claude/skills/nw-{skill-name}/SKILL.md) | Phase-gated loading (not "load everything at start")

Severity: High — orphan skills (declared but never loaded) mean sub-agents operate without domain knowledge. The skills: frontmatter field is declarative only; Claude Code does not auto-load skill files.

Gold standard: nw-product-owner.md — Skill Loading Strategy table mapping phases to skills with triggers + explicit Load: directives in each workflow phase.

Dimension 8: Token Efficiency

Is the agent definition compressed without losing semantic content?

Check: No verbose prose where pipe-delimited lists suffice | Imperative voice throughout | No filler words ("in order to", "it is important to") | ### Example N: headers preserved verbatim (not inlined) | AskUserQuestion options preserved with numbered descriptions | Code blocks preserved verbatim | No duplicate content already in skills

Severity: Medium — bloated definitions waste context window and degrade performance via context rot.

Compression safe: prose descriptions, bullet lists, related items -> pipe-delimited Compression unsafe: example headers, code blocks, decision tree options, YAML frontmatter

Dimension 9: Priority Validation

Questions: 1. Is this the largest bottleneck? (Evidence required) | 2. Simpler alternatives considered? | 3. Constraint prioritization correct? | 4. Architecture data-justified?

Severity: High if agent addresses secondary concern while larger problem exists.

Review Output Format

review:
  agent: "{agent-name}"
  dimensions:
    template_compliance: {pass|fail}
    size_and_focus: {pass|fail}
    divergence_quality: {pass|fail}
    safety_implementation: {pass|fail}
    language_and_tone: {pass|fail}
    examples_quality: {pass|fail}
    skill_loading: {pass|fail|n/a}
    token_efficiency: {pass|fail}
    priority_validation: {pass|fail}
  issues:
    - dimension: "{dimension}"
      severity: "{high|medium|low}"
      finding: "{description}"
      recommendation: "{fix}"
  verdict: "{approved|revisions_needed}"

Failure Conditions

Review blocked (verdict: revisions_needed) if: any high-severity dimension fails | 3+ medium-severity fail | Agent exceeds 400 lines without Skills extraction | Zero examples provided | Agent with 3+ skills missing Skill Loading Strategy table

nwave-ai/nw-abr-critique-dimensions

plugins/nw/skills/nw-abr-critique-dimensions/SKILL.md

Review dimensions for validating agent quality - template compliance, safety, testing, and priority validation

352 stars

testing

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add nwave-ai/nwave nw-abr-critique-dimensions

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 9, 2026, 2:41 AM5.6s1 file scanned

SKILL.md

name:: nw-abr-critique-dimensions
agent:: nw-agent-builder-reviewer
description:: Review dimensions for validating agent quality - template compliance, safety, testing, and priority validation
user-invocable:: false
disable-model-invocation:: true

Agent Quality Critique Dimensions

Use these dimensions when reviewing or validating agent definitions.

Dimension 1: Template Compliance

Does the agent follow official Claude Code format?

Severity: High -- non-compliant agents may not load correctly.

Dimension 2: Size and Focus

Check: Core definition under 400 lines | Domain knowledge in Skills | Single clear responsibility | No monolithic sections (>50 lines without structure) | No redundant Claude default behaviors

Measurement: wc -l {agent-file}. Target: 200-400 lines.

Severity: High -- oversized agents suffer context rot.

Dimension 3: Divergence Quality

Does the agent specify only what diverges from Claude defaults?

Severity: Medium -- redundant instructions waste tokens, cause overtriggering.

Dimension 4: Safety Implementation

Check: Tools restricted via frontmatter tools field | maxTurns set | No prose-based security layers (use hooks) | No embedded enterprise safety frameworks | permissionMode set for risky actions

Severity: High -- prose safety is ineffective and token-wasteful.

Dimension 5: Language and Tone

Severity: Medium -- aggressive language causes overtriggering on Opus 4.6.

Dimension 6: Examples Quality

Check: 3-5 canonical examples present | Cover critical/subtle decisions (not obvious cases) | Good/bad paired where useful | Concise (not full implementations)

Severity: Medium -- missing examples cause edge case failures.

Dimension 7: Skill Loading Effectiveness

Does the agent ensure skills are actually loaded during execution?

Gold standard: nw-product-owner.md — Skill Loading Strategy table mapping phases to skills with triggers + explicit Load: directives in each workflow phase.

Dimension 8: Token Efficiency

Is the agent definition compressed without losing semantic content?

Severity: Medium — bloated definitions waste context window and degrade performance via context rot.

Compression safe: prose descriptions, bullet lists, related items -> pipe-delimited Compression unsafe: example headers, code blocks, decision tree options, YAML frontmatter

Dimension 9: Priority Validation

Questions: 1. Is this the largest bottleneck? (Evidence required) | 2. Simpler alternatives considered? | 3. Constraint prioritization correct? | 4. Architecture data-justified?

Severity: High if agent addresses secondary concern while larger problem exists.

Review Output Format

review:
  agent: "{agent-name}"
  dimensions:
    template_compliance: {pass|fail}
    size_and_focus: {pass|fail}
    divergence_quality: {pass|fail}
    safety_implementation: {pass|fail}
    language_and_tone: {pass|fail}
    examples_quality: {pass|fail}
    skill_loading: {pass|fail|n/a}
    token_efficiency: {pass|fail}
    priority_validation: {pass|fail}
  issues:
    - dimension: "{dimension}"
      severity: "{high|medium|low}"
      finding: "{description}"
      recommendation: "{fix}"
  verdict: "{approved|revisions_needed}"

Failure Conditions

Related Skills

nwave-ai/nw-distill

testing

VerifiedTrustedCommunity

Acceptance test creation methodology for the DISTILL wave. Domain knowledge for the acceptance designer agent: port-to-port principle, prior wave reading, wave-decision reconciliation, graceful degradation, and document back-propagation.

563SKILL.mdUpdated May 21, 2026

nwave-ai/nw-collaboration-and-handoffs

development

VerifiedTrustedCommunity

Cross-agent collaboration protocols, workflow handoff patterns, and commit message formats for TDD/Mikado/refactoring workflows

563SKILL.mdUpdated Apr 16, 2026

nwave-ai/nw-collaboration-and-handoffs

nwave-ai/nw-roadmap

development

VerifiedTrustedCommunity

Creates a phased roadmap.json for a feature goal with acceptance criteria and TDD steps. Use when planning implementation steps before execution.

563SKILL.mdUpdated Apr 9, 2026

nwave-ai/nw-distill

testing

VerifiedTrustedCommunity

563SKILL.mdUpdated Apr 9, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/nwave-ai/nwave.git

# Copy into Claude Code skills folder (global)
cp -r nwave/plugins/nw/skills/nw-abr-critique-dimensions ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

nwave-ai/nwave

352 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT