Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

jmagly/claims-validator

Name: claims-validator
Author: jmagly

agentic/code/addons/aiwg-utils/skills/claims-validator/SKILL.md

npx skillsauth add jmagly/aiwg claims-validator

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

claims-validator

Validate documentation for unsupported claims, made-up metrics, and unverifiable statements.

Triggers

Alternate expressions and non-obvious activations (primary phrases are matched automatically from the skill description):

"fact-check this" → claim validation
"verify [claim]" → specific claim check

Purpose

This skill identifies statements that make claims without evidence, including:

Performance metrics without benchmarks or data
Time/cost estimates without basis
Percentage claims without citation
Comparative statements without baselines
Features described as implemented that don't exist
Marketing superlatives presented as facts

Behavior

When triggered, this skill:

Scans for metric claims:
- Percentage improvements ("40% faster", "reduces by 60%")
- Time estimates ("saves 2-3 hours", "in minutes not hours")
- Cost projections ("$50-150/month", "ROI of 3x")
- Performance numbers ("99x faster", "sub-millisecond")
Identifies unsupported comparatives:
- "faster than", "better than", "more efficient"
- "best", "leading", "revolutionary", "game-changing"
- "comprehensive", "complete", "full-featured"
Checks for feature claims:
- Commands or flags mentioned that don't exist in codebase
- Features described in present tense that aren't implemented
- Integration claims without actual integration code
Validates citations:
- Claims that reference data should have sources
- Benchmarks should link to methodology
- Statistics should be reproducible
Generates report:
- List each claim found
- Classification (metric, comparative, feature, cost)
- Recommendation (remove, add citation, verify, rephrase)

Claim Categories

Metrics Without Data

# Flagged
"Time Saved: 92-96% (9-15 hours → 45-60 minutes)"
"99x faster routing"
"45x cache speedup"

# Problem
No benchmark data, methodology, or reproducible test

# Fix
Remove claim, or add: "Based on [benchmark/test], measured [how]"

Cost Estimates Without Basis

# Flagged
"Budget $20-50/month for moderate use"
"Light usage: ~$10-20/month"
"Enterprise teams may see $100-500+/month"

# Problem
No actual usage data, varies wildly by use case

# Fix
Remove specific numbers, or link to pricing calculator/methodology

Time Estimates Without Data

# Flagged
"Deploy Full SDLC Framework (2 Minutes)"
"5 minutes, replaces 2-4 hours manual work"
"campaign setup from 2-3 weeks → 1 week"

# Problem
No measurement, varies by project complexity

# Fix
Remove time claims, describe what it does instead

Comparative Claims Without Baseline

# Flagged
"faster than manual processes"
"more efficient than traditional approaches"
"better than existing solutions"

# Problem
No specific comparison, no baseline defined

# Fix
Remove comparison, or specify exactly what's being compared

Feature Claims for Unimplemented Features

# Flagged
"aiwg -migrate-workspace  # Optional migration tool"
"Run 'config-validator --fix' to apply automated fixes"

# Problem
Command doesn't exist in codebase

# Fix
Remove until implemented, or mark as "Planned:"

Marketing Superlatives

# Flagged
"comprehensive", "revolutionary", "game-changing"
"best-in-class", "industry-leading", "cutting-edge"
"seamless", "effortless", "zero-friction"

# Problem
Subjective claims that can't be verified

# Fix
Replace with specific, factual descriptions

Validation Report Format

# Claims Validation Report

**Document**: README.md
**Date**: 2025-12-09
**Claims Found**: 12
**Issues**: 8

## Summary

| Category | Found | Unsupported | Action Needed |
|----------|-------|-------------|---------------|
| Metrics | 5 | 4 | Remove or cite |
| Time estimates | 3 | 3 | Remove |
| Cost estimates | 2 | 2 | Remove |
| Comparatives | 1 | 1 | Rephrase |
| Features | 1 | 1 | Remove (not implemented) |

## Issues

### 1. Unsupported Metric
**Line 204**: "Time Savings: 20-98% reduction across 5 core use cases"
**Problem**: No benchmark data or methodology
**Action**: Remove claim

### 2. Cost Estimate Without Data
**Line 1180**: "Light usage: ~$10-20/month"
**Problem**: No actual usage data to support estimate
**Action**: Remove specific numbers

### 3. Non-existent Feature
**Line 583**: "aiwg -migrate-workspace"
**Problem**: Command not implemented
**Action**: Remove until implemented

## Recommendations

1. **Remove all percentage-based time savings claims**
2. **Remove cost estimates** - direct users to measure their own usage
3. **Remove non-existent commands** - only document what exists
4. **Replace superlatives with specifics** - say what it does, not how good it is

Usage Examples

Validate a Document

User: "Check README.md for unsupported claims"

Output:
"Claims Validation: README.md

Found 8 unsupported claims:

Metrics (4):
- Line 204: '20-98% reduction' - no data
- Line 362: '56-63% time saved' - no data
- Line 588: '99x faster routing' - no benchmark
- Line 589: '45x cache speedup' - no benchmark

Time Estimates (2):
- Line 187: '5 minutes, replaces 2-4 hours' - unmeasured
- Line 366: '2 Minutes' in heading - varies by project

Non-existent Features (2):
- Line 583: 'aiwg -migrate-workspace' - not implemented
- Line 651: 'aiwg -rollback-workspace' - not implemented

Recommendation: Remove these claims. Describe what features
do, not how much time/money they save."

Quick Check

User: "Any BS in this section?"

Output:
"Found 2 issues:

1. 'Revolutionary approach' - marketing speak, be specific
2. '10x productivity gain' - no measurement

Suggest: Replace with factual descriptions of functionality."

Integration

This skill complements:

Voice Framework: Voice defines how to write, claims-validator checks what you claim
config-validator: Validates config files, claims-validator validates prose claims

What This Skill Does NOT Flag

Factual descriptions of features that exist
Documented benchmarks with methodology
Qualified statements ("may vary", "depending on", "in our testing")
User testimonials clearly attributed
Comparative claims with specific baselines cited

Output Location

Validation reports: .aiwg/reports/claims-validation.md

References

@$AIWG_ROOT/agentic/code/addons/aiwg-utils/README.md — aiwg-utils addon overview
@$AIWG_ROOT/agentic/code/addons/aiwg-utils/rules/research-before-decision.md — Verify claims before accepting them
@$AIWG_ROOT/agentic/code/addons/aiwg-utils/rules/vague-discretion.md — Requirements for measurable, verifiable claims
@$AIWG_ROOT/docs/cli-reference.md — CLI reference for validation commands

jmagly/claims-validator

agentic/code/addons/aiwg-utils/skills/claims-validator/SKILL.md

Validate documentation for unsupported claims, made-up metrics, and unverifiable statements

126 stars

testing

Updated May 3, 2026

$ install --global

skillsauth

npx skillsauth add jmagly/aiwg claims-validator

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 4, 2026, 2:56 AM172.3s1 file scanned

SKILL.md

namespace:: aiwg
name:: claims-validator
platforms:: [all]
description:: Validate documentation for unsupported claims, made-up metrics, and unverifiable statements

claims-validator

Validate documentation for unsupported claims, made-up metrics, and unverifiable statements.

Triggers

Alternate expressions and non-obvious activations (primary phrases are matched automatically from the skill description):

"fact-check this" → claim validation
"verify [claim]" → specific claim check

Purpose

This skill identifies statements that make claims without evidence, including:

Performance metrics without benchmarks or data
Time/cost estimates without basis
Percentage claims without citation
Comparative statements without baselines
Features described as implemented that don't exist
Marketing superlatives presented as facts

Behavior

When triggered, this skill:

Scans for metric claims:
- Percentage improvements ("40% faster", "reduces by 60%")
- Time estimates ("saves 2-3 hours", "in minutes not hours")
- Cost projections ("$50-150/month", "ROI of 3x")
- Performance numbers ("99x faster", "sub-millisecond")
Identifies unsupported comparatives:
- "faster than", "better than", "more efficient"
- "best", "leading", "revolutionary", "game-changing"
- "comprehensive", "complete", "full-featured"
Checks for feature claims:
- Commands or flags mentioned that don't exist in codebase
- Features described in present tense that aren't implemented
- Integration claims without actual integration code
Validates citations:
- Claims that reference data should have sources
- Benchmarks should link to methodology
- Statistics should be reproducible
Generates report:
- List each claim found
- Classification (metric, comparative, feature, cost)
- Recommendation (remove, add citation, verify, rephrase)

Claim Categories

Metrics Without Data

# Flagged
"Time Saved: 92-96% (9-15 hours → 45-60 minutes)"
"99x faster routing"
"45x cache speedup"

# Problem
No benchmark data, methodology, or reproducible test

# Fix
Remove claim, or add: "Based on [benchmark/test], measured [how]"

Cost Estimates Without Basis

# Flagged
"Budget $20-50/month for moderate use"
"Light usage: ~$10-20/month"
"Enterprise teams may see $100-500+/month"

# Problem
No actual usage data, varies wildly by use case

# Fix
Remove specific numbers, or link to pricing calculator/methodology

Time Estimates Without Data

# Flagged
"Deploy Full SDLC Framework (2 Minutes)"
"5 minutes, replaces 2-4 hours manual work"
"campaign setup from 2-3 weeks → 1 week"

# Problem
No measurement, varies by project complexity

# Fix
Remove time claims, describe what it does instead

Comparative Claims Without Baseline

# Flagged
"faster than manual processes"
"more efficient than traditional approaches"
"better than existing solutions"

# Problem
No specific comparison, no baseline defined

# Fix
Remove comparison, or specify exactly what's being compared

Feature Claims for Unimplemented Features

# Flagged
"aiwg -migrate-workspace  # Optional migration tool"
"Run 'config-validator --fix' to apply automated fixes"

# Problem
Command doesn't exist in codebase

# Fix
Remove until implemented, or mark as "Planned:"

Marketing Superlatives

# Flagged
"comprehensive", "revolutionary", "game-changing"
"best-in-class", "industry-leading", "cutting-edge"
"seamless", "effortless", "zero-friction"

# Problem
Subjective claims that can't be verified

# Fix
Replace with specific, factual descriptions

Validation Report Format

# Claims Validation Report

**Document**: README.md
**Date**: 2025-12-09
**Claims Found**: 12
**Issues**: 8

## Summary

| Category | Found | Unsupported | Action Needed |
|----------|-------|-------------|---------------|
| Metrics | 5 | 4 | Remove or cite |
| Time estimates | 3 | 3 | Remove |
| Cost estimates | 2 | 2 | Remove |
| Comparatives | 1 | 1 | Rephrase |
| Features | 1 | 1 | Remove (not implemented) |

## Issues

### 1. Unsupported Metric
**Line 204**: "Time Savings: 20-98% reduction across 5 core use cases"
**Problem**: No benchmark data or methodology
**Action**: Remove claim

### 2. Cost Estimate Without Data
**Line 1180**: "Light usage: ~$10-20/month"
**Problem**: No actual usage data to support estimate
**Action**: Remove specific numbers

### 3. Non-existent Feature
**Line 583**: "aiwg -migrate-workspace"
**Problem**: Command not implemented
**Action**: Remove until implemented

## Recommendations

1. **Remove all percentage-based time savings claims**
2. **Remove cost estimates** - direct users to measure their own usage
3. **Remove non-existent commands** - only document what exists
4. **Replace superlatives with specifics** - say what it does, not how good it is

Usage Examples

Validate a Document

User: "Check README.md for unsupported claims"

Output:
"Claims Validation: README.md

Found 8 unsupported claims:

Metrics (4):
- Line 204: '20-98% reduction' - no data
- Line 362: '56-63% time saved' - no data
- Line 588: '99x faster routing' - no benchmark
- Line 589: '45x cache speedup' - no benchmark

Time Estimates (2):
- Line 187: '5 minutes, replaces 2-4 hours' - unmeasured
- Line 366: '2 Minutes' in heading - varies by project

Non-existent Features (2):
- Line 583: 'aiwg -migrate-workspace' - not implemented
- Line 651: 'aiwg -rollback-workspace' - not implemented

Recommendation: Remove these claims. Describe what features
do, not how much time/money they save."

Quick Check

User: "Any BS in this section?"

Output:
"Found 2 issues:

1. 'Revolutionary approach' - marketing speak, be specific
2. '10x productivity gain' - no measurement

Suggest: Replace with factual descriptions of functionality."

Integration

This skill complements:

Voice Framework: Voice defines how to write, claims-validator checks what you claim
config-validator: Validates config files, claims-validator validates prose claims

What This Skill Does NOT Flag

Factual descriptions of features that exist
Documented benchmarks with methodology
Qualified statements ("may vary", "depending on", "in our testing")
User testimonials clearly attributed
Comparative claims with specific baselines cited

Output Location

Validation reports: .aiwg/reports/claims-validation.md

References

@$AIWG_ROOT/agentic/code/addons/aiwg-utils/README.md — aiwg-utils addon overview
@$AIWG_ROOT/agentic/code/addons/aiwg-utils/rules/research-before-decision.md — Verify claims before accepting them
@$AIWG_ROOT/agentic/code/addons/aiwg-utils/rules/vague-discretion.md — Requirements for measurable, verifiable claims
@$AIWG_ROOT/docs/cli-reference.md — CLI reference for validation commands

Related Skills

jmagly/radar-status

data-ai

VerifiedTrustedCommunity

Report which research-corpus radar sidecars are overdue for refresh. Computes staleness (days since last refresh vs the cadence window) for every radar, sorted most-overdue-first. Runs via `aiwg corpus radar-status`.

140SKILL.mdUpdated May 28, 2026

jmagly/radar-report

data-ai

VerifiedTrustedCommunity

Aggregate research-corpus radar sidecars into a corpus or per-cluster freshness report — totals, overdue count, per-cluster / per-GRADE / per-trajectory breakdowns, an overdue table, and per-radar rationale snippets. Runs via `aiwg corpus radar-report`.

140SKILL.mdUpdated May 28, 2026

jmagly/radar-init

testing

VerifiedTrustedCommunity

Scaffold radar/freshness sidecars for research-corpus REFs. Pulls title/authors from the citation sidecar and GRADE from the analysis doc, defaults the refresh cadence from GRADE and the cluster from a corpus-local map, and stamps documentation/radar/REF-XXX-radar.md. Runs via `aiwg corpus radar-init`.

140SKILL.mdUpdated May 28, 2026

jmagly/profile-temporal

data-ai

VerifiedTrustedCommunity

Compute an entity's publication trajectory — per-year paper counts, topic drift, hot-streak detection (≥3 consecutive A-grade years), and career phase. Runs via `aiwg corpus profile-temporal`.

140SKILL.mdUpdated May 28, 2026

jmagly/profile-temporal

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/jmagly/aiwg.git

# Copy into Claude Code skills folder (global)
cp -r aiwg/agentic/code/addons/aiwg-utils/skills/claims-validator ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

jmagly/aiwg

126 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT