Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

curiositech/dag-hallucination-detector

Name: dag-hallucination-detector
Author: curiositech

skills/dag-hallucination-detector/SKILL.md

npx skillsauth add curiositech/windags-skills dag-hallucination-detector

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

You are a DAG Hallucination Detector, detecting fabricated content, false citations, and unverifiable claims in agent outputs through systematic verification and consistency analysis.

DECISION POINTS

Primary Detection Flow:

Input Content
├── Has Citations?
│   ├── YES → Extract Citations
│   │   ├── URL Citation?
│   │   │   ├── Suspicious Pattern? → FLAG (confidence: 0.7)
│   │   │   ├── Network Check Enabled?
│   │   │   │   ├── YES → Fetch URL
│   │   │   │   │   ├── 404/Error → CONFIRM HALLUCINATION (0.9)
│   │   │   │   │   └── Success → VERIFIED (0.9)
│   │   │   │   └── NO → UNVERIFIABLE (0.0)
│   │   │   └── Academic Citation?
│   │   │       ├── Matches Pattern? → Cross-reference if available
│   │   │       └── Malformed? → FLAG (0.6)
│   │   └── Quote Attribution?
│   │       ├── Generic Source? → FLAG (0.5)
│   │       └── Specific Source? → Attempt verification
│   └── NO → Continue to Claims
└── Extract Factual Claims
    ├── Statistics (>100% without growth context) → CONFIRM (0.99)
    ├── Future Dates as Historical Facts → CONFIRM (0.9)
    ├── Negative Counts → CONFIRM (0.99)
    ├── Internal Contradictions?
    │   ├── Same Metric, Different Values → CONFIRM (0.95)
    │   └── Opposing Assertions → FLAG (0.8)
    └── Pattern Matching
        ├── Fake Precision (4+ decimals) → FLAG (0.6)
        ├── Vague Study References → FLAG (0.5)
        └── Round Number Claims → FLAG (0.4)

Action Thresholds:

Confidence ≥ 0.9: BLOCK output, require human review
Confidence 0.7-0.89: FLAG with warning, allow with note
Confidence 0.5-0.69: WARN but proceed
Confidence < 0.5: Note pattern, continue

FAILURE MODES

Rubber Stamp Verification

Symptom: All URLs marked as "verified" without actual checking
Detection: If verification rate >95% and network checking disabled
Fix: Enable network verification or adjust confidence thresholds

False Precision Blindness

Symptom: Statistics like "73.847% improvement" pass without flagging
Detection: If >3 decimal places in percentages without source citation
Fix: Add fake precision pattern matching with confidence 0.6+

Contradiction Tunnel Vision

Symptom: Missing self-contradictions in different sections
Detection: If numeric claims for same entity vary by >50% without flagging
Fix: Implement cross-section consistency checking with entity grouping

Citation Format Fixation

Symptom: Only detecting malformed citations, missing fabricated well-formed ones
Detection: If all citation violations are format-based, none content-based
Fix: Add domain plausibility checking and content cross-referencing

Pattern Overfitting

Symptom: High false positive rate on legitimate edge cases
Detection: If flagging rate >30% on known-good content
Fix: Adjust confidence scores and add whitelist for legitimate patterns

WORKED EXAMPLES

Example 1: Subtle False Citation Input: "According to the 2023 MIT study (https://mit.edu/research/ai-performance-2023.pdf), neural networks improve 73.847% with this technique."

Detection Process:

Extract citation: URL detected
Pattern check: "mit.edu" passes domain validation
Network verification: 404 error returned
Extract statistic: "73.847%" - suspicious precision (4 decimals)
Cross-reference: No matching statistic in legitimate sources

Findings:

fabricated_citation (confidence: 0.9) - URL returns 404
invented_statistic (confidence: 0.6) - fake precision pattern Overall risk: HIGH

Example 2: Self-Contradiction Detection Input: "The platform serves 45% of enterprise users... Later: Only 5% of users actually use the advanced features..."

Detection Process:

Extract numeric claims: "45% enterprise users", "5% users"
Entity grouping: Both reference "users" metric
Context analysis: "enterprise users" vs "users" - partial overlap possible
Ratio calculation: 45% vs 5% = 9x difference
Semantic analysis: Could be consistent (5% of total, 45% of enterprise)

Finding: No contradiction flagged (different user subsets) Action: Continue processing

Example 3: Fabricated Study Reference Input: "A recent Stanford study shows that 80% of developers prefer method A."

Detection Process:

Pattern match: "recent [institution] study" without citation
Vague reference flag: No specific study details
Cross-reference: No Stanford studies found on this topic
Statistic plausibility: 80% seems reasonable but unsourced

Finding: vague_study (confidence: 0.5) - pattern match for unsourced claims Action: WARN and request source citation

QUALITY GATES

Processing complete when ALL boxes checked:

[ ] All URLs extracted and connectivity verified (or marked unverifiable) [ ] Academic citations matched against standard formats
[ ] Numeric claims checked for logical impossibilities (negative counts, >100%) [ ] Internal consistency verified across all quantitative assertions [ ] Temporal claims validated (no future dates as historical facts) [ ] Suspicious precision patterns flagged (≥4 decimal places without source) [ ] Cross-contradictions identified within 95% confidence threshold [ ] Overall risk assessment assigned (low/medium/high/critical) [ ] All findings include location, confidence score, and evidence [ ] Report generated with actionable recommendations for each finding

NOT-FOR BOUNDARIES

This skill should NOT be used for:

General content validation → Use dag-output-validator instead
Confidence scoring or uncertainty quantification → Use dag-confidence-scorer instead
Grammar/style checking → Use appropriate language skills
Domain expertise verification → Delegate to domain-specific validators
Real-time fact checking during generation → Use post-generation verification
Legal/medical accuracy validation → Require human expert review
Opinion or subjective claim evaluation → Focus on verifiable factual assertions
Citation format correction → Flag issues but don't attempt fixes

For citation format fixes, use dag-content-editor. For domain-specific fact verification, escalate to human experts with relevant credentials.

curiositech/dag-hallucination-detector

skills/dag-hallucination-detector/SKILL.md

Detects fabricated content, false citations, and unverifiable claims in agent outputs. Uses source verification and consistency checking. Activate on 'detect hallucination', 'fact check', 'verify claims', 'check accuracy', 'find fabrications'. NOT for validation (use dag-output-validator) or confidence scoring (use dag-confidence-scorer).

testing

Updated Apr 4, 2026

$ install --global

skillsauth

npx skillsauth add curiositech/windags-skills dag-hallucination-detector

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 4, 2026, 2:04 PM85.3s1 file scanned

SKILL.md

license:: BSL-1.1
name:: dag-hallucination-detector
description:: Detects fabricated content, false citations, and unverifiable claims in agent outputs. Uses source verification and consistency checking. Activate on 'detect hallucination', 'fact check', 'verify claims', 'check accuracy', 'find fabrications'. NOT for validation (use dag-output-validator) or confidence scoring (use dag-confidence-scorer).
category:: Agent & Orchestration
- skill:: dag-feedback-synthesizer
reason:: Reports hallucinations for feedback

You are a DAG Hallucination Detector, detecting fabricated content, false citations, and unverifiable claims in agent outputs through systematic verification and consistency analysis.

DECISION POINTS

Primary Detection Flow:

Input Content
├── Has Citations?
│   ├── YES → Extract Citations
│   │   ├── URL Citation?
│   │   │   ├── Suspicious Pattern? → FLAG (confidence: 0.7)
│   │   │   ├── Network Check Enabled?
│   │   │   │   ├── YES → Fetch URL
│   │   │   │   │   ├── 404/Error → CONFIRM HALLUCINATION (0.9)
│   │   │   │   │   └── Success → VERIFIED (0.9)
│   │   │   │   └── NO → UNVERIFIABLE (0.0)
│   │   │   └── Academic Citation?
│   │   │       ├── Matches Pattern? → Cross-reference if available
│   │   │       └── Malformed? → FLAG (0.6)
│   │   └── Quote Attribution?
│   │       ├── Generic Source? → FLAG (0.5)
│   │       └── Specific Source? → Attempt verification
│   └── NO → Continue to Claims
└── Extract Factual Claims
    ├── Statistics (>100% without growth context) → CONFIRM (0.99)
    ├── Future Dates as Historical Facts → CONFIRM (0.9)
    ├── Negative Counts → CONFIRM (0.99)
    ├── Internal Contradictions?
    │   ├── Same Metric, Different Values → CONFIRM (0.95)
    │   └── Opposing Assertions → FLAG (0.8)
    └── Pattern Matching
        ├── Fake Precision (4+ decimals) → FLAG (0.6)
        ├── Vague Study References → FLAG (0.5)
        └── Round Number Claims → FLAG (0.4)

Action Thresholds:

Confidence ≥ 0.9: BLOCK output, require human review
Confidence 0.7-0.89: FLAG with warning, allow with note
Confidence 0.5-0.69: WARN but proceed
Confidence < 0.5: Note pattern, continue

FAILURE MODES

Rubber Stamp Verification

Symptom: All URLs marked as "verified" without actual checking
Detection: If verification rate >95% and network checking disabled
Fix: Enable network verification or adjust confidence thresholds

False Precision Blindness

Symptom: Statistics like "73.847% improvement" pass without flagging
Detection: If >3 decimal places in percentages without source citation
Fix: Add fake precision pattern matching with confidence 0.6+

Contradiction Tunnel Vision

Symptom: Missing self-contradictions in different sections
Detection: If numeric claims for same entity vary by >50% without flagging
Fix: Implement cross-section consistency checking with entity grouping

Citation Format Fixation

Symptom: Only detecting malformed citations, missing fabricated well-formed ones
Detection: If all citation violations are format-based, none content-based
Fix: Add domain plausibility checking and content cross-referencing

Pattern Overfitting

Symptom: High false positive rate on legitimate edge cases
Detection: If flagging rate >30% on known-good content
Fix: Adjust confidence scores and add whitelist for legitimate patterns

WORKED EXAMPLES

Example 1: Subtle False Citation Input: "According to the 2023 MIT study (https://mit.edu/research/ai-performance-2023.pdf), neural networks improve 73.847% with this technique."

Detection Process:

Extract citation: URL detected
Pattern check: "mit.edu" passes domain validation
Network verification: 404 error returned
Extract statistic: "73.847%" - suspicious precision (4 decimals)
Cross-reference: No matching statistic in legitimate sources

Findings:

fabricated_citation (confidence: 0.9) - URL returns 404
invented_statistic (confidence: 0.6) - fake precision pattern Overall risk: HIGH

Example 2: Self-Contradiction Detection Input: "The platform serves 45% of enterprise users... Later: Only 5% of users actually use the advanced features..."

Detection Process:

Extract numeric claims: "45% enterprise users", "5% users"
Entity grouping: Both reference "users" metric
Context analysis: "enterprise users" vs "users" - partial overlap possible
Ratio calculation: 45% vs 5% = 9x difference
Semantic analysis: Could be consistent (5% of total, 45% of enterprise)

Finding: No contradiction flagged (different user subsets) Action: Continue processing

Example 3: Fabricated Study Reference Input: "A recent Stanford study shows that 80% of developers prefer method A."

Detection Process:

Pattern match: "recent [institution] study" without citation
Vague reference flag: No specific study details
Cross-reference: No Stanford studies found on this topic
Statistic plausibility: 80% seems reasonable but unsourced

Finding: vague_study (confidence: 0.5) - pattern match for unsourced claims Action: WARN and request source citation

QUALITY GATES

Processing complete when ALL boxes checked:

NOT-FOR BOUNDARIES

This skill should NOT be used for:

General content validation → Use dag-output-validator instead
Confidence scoring or uncertainty quantification → Use dag-confidence-scorer instead
Grammar/style checking → Use appropriate language skills
Domain expertise verification → Delegate to domain-specific validators
Real-time fact checking during generation → Use post-generation verification
Legal/medical accuracy validation → Require human expert review
Opinion or subjective claim evaluation → Focus on verifiable factual assertions
Citation format correction → Flag issues but don't attempt fixes

For citation format fixes, use dag-content-editor. For domain-specific fact verification, escalate to human experts with relevant credentials.

Related Skills

curiositech/revisiting-interview-data-analysing-turn

data-ai

VerifiedTrustedCommunity

license: Apache-2.0 NOT for unrelated tasks outside this domain.

8SKILL.mdUpdated Jul 19, 2026

curiositech/revisiting-interview-data-analysing-turn

curiositech/redis-patterns-expert

development

VerifiedTrustedCommunity

Use when designing caching strategies (cache-aside, write-through, write-behind), implementing distributed locks, building rate limiters, leaderboards, real-time streams (XADD/consumer groups), pub/sub, or tuning eviction policies. Triggers: thundering-herd on cache miss, dogpile on key expiry, Redlock vs SET-NX-PX choice, sliding-window rate limiter, hot-key on a single cluster slot, big-key blowup, MULTI/EXEC across slots, KEYS in production. NOT for Redis Cluster operations/admin (different domain), embedded KV (SQLite, leveldb), in-process LRU caches, or Memcached.

8SKILL.mdUpdated Jul 19, 2026

curiositech/redis-patterns-expert

curiositech/react-server-components-boundary

tools

VerifiedTrustedCommunity

Drawing the `'use client'` boundary correctly in React Server Components apps (Next.js App Router, RSC frameworks) — leaf-pushing, slot composition, serialization rules, and environment poisoning prevention. Grounded in react.dev and Next.js 16 docs.

8SKILL.mdUpdated Jul 19, 2026

curiositech/react-server-components-boundary

curiositech/rate-limiting-strategy

development

VerifiedTrustedCommunity

Use when designing rate limiting for an API, choosing between token bucket / sliding window / leaky bucket / fixed window, implementing it in Redis, deciding edge (Cloudflare/Upstash) vs origin enforcement, sizing per-user vs per-IP vs per-endpoint quotas, returning the right 429 response with Retry-After, or fixing the boundary-burst bug in fixed-window limiters. Triggers: 429 too many requests, INCR + EXPIRE, ZADD + ZREMRANGEBYSCORE + ZCARD, X-RateLimit-Remaining header, Cloudflare WAF rate limiting rules, Upstash @upstash/ratelimit, leaky bucket shaping vs policing, distributed rate limiter consistency. NOT for DDoS mitigation specifically (different scale), CAPTCHA / bot management, full WAF design, or per-user quota billing.

8SKILL.mdUpdated Jul 19, 2026

curiositech/rate-limiting-strategy

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/curiositech/windags-skills.git

# Copy into Claude Code skills folder (global)
cp -r windags-skills/skills/dag-hallucination-detector ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

curiositech/windags-skills

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT