Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

curiositech/conditions-for-intuitive-expertise-a-fai

Name: conditions-for-intuitive-expertise-a-fai
Author: curiositech

skills/conditions-for-intuitive-expertise-a-fai/SKILL.md

npx skillsauth add curiositech/windags-skills conditions-for-intuitive-expertise-a-fai

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Conditions for Intuitive Expertise

Use this skill when the hard problem is not generating an answer, but deciding whether intuition, expert judgment, or agent confidence deserves trust in the first place.

When to Use

An agent is confidently fluent in a domain where outcome feedback is weak, delayed, or confounded.
You need to decide whether a task should route to algorithmic scoring, expert judgment, or a hybrid review path.
A system keeps overreaching from one domain into an adjacent but structurally different one.
You are designing escalation logic for ambiguous, noisy, or high-stakes decisions.
You need to audit whether repeated practice in a domain actually builds skill or just entrenches confident noise.

NOT for Boundaries

This skill is not the primary lens for:

Deterministic implementation work with explicit acceptance criteria and fast feedback.
Pure syntax debugging, schema repair, or tasks where correctness is directly testable.
Situations where the right move is already specified by hard policy or exact computation.
Generic "trust the expert" arguments that do not examine environment validity or feedback quality.

Core Mental Models

Environment Validity Comes First

Expert intuition is only trustworthy when the environment has stable, learnable regularities. If the domain is mostly noise, confidence will still form; it just will not track truth.

Learning Opportunity Is the Second Gate

Even a valid environment will not produce expertise without timely, accurate feedback. Delayed, corrupted, or selectively remembered feedback produces practiced error rather than practiced skill.

Confidence Measures Coherence, Not Accuracy

Confidence tracks how internally consistent the available cues feel. In low-validity environments, high confidence is often a hazard signal rather than reassurance.

Expertise Is Fractionated

Skill at one subtask does not reliably transfer to a neighboring subtask that merely feels similar. The boundary problem matters more than the prestige of the prior success.

Match Decision Mechanism to Ecology

Use judgment-heavy approaches when tacit cues are real and feedback is clean. Use algorithms or structured ensembles when noise dominates and intuitions cannot be trained safely.

Decision Points

flowchart TD
  A[New decision domain] --> B{Stable cues that predict outcomes?}
  B -->|No| C[Low-validity environment]
  B -->|Yes| D{Fast, accurate, repeated feedback?}
  D -->|No| E[Wicked or degraded learning environment]
  D -->|Yes| F[High-validity environment]
  C --> G[Prefer algorithmic or structured ensemble path]
  E --> H[Use hybrid path with suppressed confidence and external checks]
  F --> I{Task matches demonstrated expertise boundary?}
  I -->|No| J[Escalate or re-route for fractionated expertise]
  I -->|Yes| K[Allow expert judgment or recognition-primed handling]

1. Classify the Environment Before the Actor

Ask whether the domain contains repeatable cue-to-outcome regularities.
Ask whether the learner receives honest enough feedback to tune those cues.
Only after that should you decide whether judgment deserves weight.

2. Separate Confidence Review from Accuracy Review

Treat raw confidence as a report about felt coherence.
If the environment is low-validity, confidence should not drive autonomy.
If confidence and evidence disagree, trust the evidence and investigate the cue story.

3. Check for Fractionated Expertise

Compare the current task structure to the situations that created the skill.
If the resemblance is lexical but not structural, downgrade trust.
Adjacent domains need fresh validation, not inherited authority.

Failure Modes

1. Confidence-As-Evidence

The system treats fluent, high-confidence output as proof of correctness. This repeats the exact illusion-of-validity failure the paper warns about.

2. Invalid-Environment Optimism

A team assumes repetition in a noisy domain will produce expertise. It instead produces stronger stories, better rhetoric, and no real predictive gain.

3. Wicked-Feedback Training

Feedback arrives late, is politically distorted, or only surfaces successes. Agents then learn the wrong cues and become confidently brittle.

4. Fractionation Blindness

A skill that works for one subtask is invoked on a neighboring task with different causal structure. The output sounds plausible because the vocabulary overlaps, but the competence boundary has already been crossed.

5. Escalation Theater

Human review is added only after confidence gets high, rather than when environment validity or task-boundary fit is poor. Review then becomes a rubber stamp instead of a real safeguard.

Worked Examples

Example 1: Stock Commentary Agent vs. Valuation Model

A team wants an "expert market intuition" agent to decide whether a stock is underpriced. The framework says the environment is low-validity and feedback is heavily confounded, so the agent should not get autonomy based on fluent market narratives. Route instead to an actuarial or ensemble baseline, then use judgment only for anomaly explanation.

Example 2: Imaging Triage with Domain-Limited Judgment

A radiology-adjacent agent is strong on spotting abnormalities in one imaging modality and is asked to generalize to another that shares surface vocabulary but different cue structure. The framework flags fractionated expertise, so the system should degrade autonomy and require modality-specific validation before trusting the carryover.

Quality Gates

The environment has been explicitly classified as high-validity, degraded-validity, or low-validity.
Feedback-loop quality has been examined, not assumed.
Confidence is treated as coherence metadata rather than direct evidence.
The task has been checked against demonstrated competence boundaries.
Escalation rules are stricter in low-validity domains than in high-validity ones.

Reference Files

| File | Load when... | | --- | --- | | references/validity-environment-and-agent-trust.md | You need the full environment-validity diagnostic and its implications for trust. | | references/algorithms-vs-intuition-routing-framework.md | You are choosing between algorithmic, judgment-heavy, or hybrid routing. | | references/overconfidence-and-the-illusion-of-validity.md | High-confidence output needs scrutiny and calibration policy. | | references/fractured-expertise-and-domain-boundary-detection.md | A skill appears adjacent to the task, but you are unsure the competence really transfers. | | references/recognition-primed-decision-making-for-agents.md | The environment is moderately valid and time pressure favors recognition plus simulation over exhaustive analysis. | | references/the-boundary-problem-knowing-when-not-to-trust-yourself.md | You are designing escalation logic or competence self-monitoring. |

Anti-Patterns

Using confidence scores as a primary autonomy gate in noisy domains.
Assuming years of exposure imply expertise without checking feedback quality.
Porting a proven skill into adjacent tasks because the terms look familiar.
Treating algorithmic methods as universally inferior or universally superior.
Auditing outputs without auditing the ecology that produced them.

Shibboleths

You have internalized this skill if you naturally ask:

"What kind of environment is this before we talk about trust?"
"What feedback actually taught this agent or expert?"
"Is this the same task structure, or just an adjacent one?"
"Does confidence here mean evidence, or just coherence?"

curiositech/conditions-for-intuitive-expertise-a-fai

skills/conditions-for-intuitive-expertise-a-fai/SKILL.md

Diagnose when intuitive judgment, agent confidence, or expert routing can be trusted by classifying environment validity, feedback quality, and task-boundary fit. Use for confidence calibration, agent routing, expertise audits, and escalation design. NOT for deterministic implementation tasks, pure syntax debugging, or domains with explicit verifiable answers.

3 stars

development

Updated May 4, 2026

$ install --global

skillsauth

npx skillsauth add curiositech/windags-skills conditions-for-intuitive-expertise-a-fai

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 1, 2026, 3:33 AM399.3s7 files scanned

SKILL.md

name:: conditions-for-intuitive-expertise-a-fai
description:: >-
license:: Apache-2.0
category:: Cognitive Science & Decision Making
- skill:: task-decomposer
reason:: Fractionated expertise is a decomposition problem before it becomes a routing failure.
kind:: legacy-recovered
sourceDocument:: Conditions for Intuitive Expertise: A Failure to Disagree
sourceArtifact:: .claude/skills/conditions-for-intuitive-expertise-a-fai/_book_identity.json
importedFrom:: legacy-recovery

Conditions for Intuitive Expertise

Use this skill when the hard problem is not generating an answer, but deciding whether intuition, expert judgment, or agent confidence deserves trust in the first place.

When to Use

An agent is confidently fluent in a domain where outcome feedback is weak, delayed, or confounded.
You need to decide whether a task should route to algorithmic scoring, expert judgment, or a hybrid review path.
A system keeps overreaching from one domain into an adjacent but structurally different one.
You are designing escalation logic for ambiguous, noisy, or high-stakes decisions.
You need to audit whether repeated practice in a domain actually builds skill or just entrenches confident noise.

NOT for Boundaries

This skill is not the primary lens for:

Deterministic implementation work with explicit acceptance criteria and fast feedback.
Pure syntax debugging, schema repair, or tasks where correctness is directly testable.
Situations where the right move is already specified by hard policy or exact computation.
Generic "trust the expert" arguments that do not examine environment validity or feedback quality.

Core Mental Models

Environment Validity Comes First

Expert intuition is only trustworthy when the environment has stable, learnable regularities. If the domain is mostly noise, confidence will still form; it just will not track truth.

Learning Opportunity Is the Second Gate

Even a valid environment will not produce expertise without timely, accurate feedback. Delayed, corrupted, or selectively remembered feedback produces practiced error rather than practiced skill.

Confidence Measures Coherence, Not Accuracy

Confidence tracks how internally consistent the available cues feel. In low-validity environments, high confidence is often a hazard signal rather than reassurance.

Expertise Is Fractionated

Skill at one subtask does not reliably transfer to a neighboring subtask that merely feels similar. The boundary problem matters more than the prestige of the prior success.

Match Decision Mechanism to Ecology

Use judgment-heavy approaches when tacit cues are real and feedback is clean. Use algorithms or structured ensembles when noise dominates and intuitions cannot be trained safely.

Decision Points

flowchart TD
  A[New decision domain] --> B{Stable cues that predict outcomes?}
  B -->|No| C[Low-validity environment]
  B -->|Yes| D{Fast, accurate, repeated feedback?}
  D -->|No| E[Wicked or degraded learning environment]
  D -->|Yes| F[High-validity environment]
  C --> G[Prefer algorithmic or structured ensemble path]
  E --> H[Use hybrid path with suppressed confidence and external checks]
  F --> I{Task matches demonstrated expertise boundary?}
  I -->|No| J[Escalate or re-route for fractionated expertise]
  I -->|Yes| K[Allow expert judgment or recognition-primed handling]

1. Classify the Environment Before the Actor

Ask whether the domain contains repeatable cue-to-outcome regularities.
Ask whether the learner receives honest enough feedback to tune those cues.
Only after that should you decide whether judgment deserves weight.

2. Separate Confidence Review from Accuracy Review

Treat raw confidence as a report about felt coherence.
If the environment is low-validity, confidence should not drive autonomy.
If confidence and evidence disagree, trust the evidence and investigate the cue story.

3. Check for Fractionated Expertise

Compare the current task structure to the situations that created the skill.
If the resemblance is lexical but not structural, downgrade trust.
Adjacent domains need fresh validation, not inherited authority.

Failure Modes

1. Confidence-As-Evidence

The system treats fluent, high-confidence output as proof of correctness. This repeats the exact illusion-of-validity failure the paper warns about.

2. Invalid-Environment Optimism

A team assumes repetition in a noisy domain will produce expertise. It instead produces stronger stories, better rhetoric, and no real predictive gain.

3. Wicked-Feedback Training

Feedback arrives late, is politically distorted, or only surfaces successes. Agents then learn the wrong cues and become confidently brittle.

4. Fractionation Blindness

5. Escalation Theater

Human review is added only after confidence gets high, rather than when environment validity or task-boundary fit is poor. Review then becomes a rubber stamp instead of a real safeguard.

Worked Examples

Example 1: Stock Commentary Agent vs. Valuation Model

Example 2: Imaging Triage with Domain-Limited Judgment

Quality Gates

The environment has been explicitly classified as high-validity, degraded-validity, or low-validity.
Feedback-loop quality has been examined, not assumed.
Confidence is treated as coherence metadata rather than direct evidence.
The task has been checked against demonstrated competence boundaries.
Escalation rules are stricter in low-validity domains than in high-validity ones.

Reference Files

Anti-Patterns

Using confidence scores as a primary autonomy gate in noisy domains.
Assuming years of exposure imply expertise without checking feedback quality.
Porting a proven skill into adjacent tasks because the terms look familiar.
Treating algorithmic methods as universally inferior or universally superior.
Auditing outputs without auditing the ecology that produced them.

Shibboleths

You have internalized this skill if you naturally ask:

"What kind of environment is this before we talk about trust?"
"What feedback actually taught this agent or expert?"
"Is this the same task structure, or just an adjacent one?"
"Does confidence here mean evidence, or just coherence?"

Related Skills

curiositech/revisiting-interview-data-analysing-turn

data-ai

VerifiedTrustedCommunity

license: Apache-2.0 NOT for unrelated tasks outside this domain.

8SKILL.mdUpdated Jul 19, 2026

curiositech/revisiting-interview-data-analysing-turn

curiositech/redis-patterns-expert

development

VerifiedTrustedCommunity

Use when designing caching strategies (cache-aside, write-through, write-behind), implementing distributed locks, building rate limiters, leaderboards, real-time streams (XADD/consumer groups), pub/sub, or tuning eviction policies. Triggers: thundering-herd on cache miss, dogpile on key expiry, Redlock vs SET-NX-PX choice, sliding-window rate limiter, hot-key on a single cluster slot, big-key blowup, MULTI/EXEC across slots, KEYS in production. NOT for Redis Cluster operations/admin (different domain), embedded KV (SQLite, leveldb), in-process LRU caches, or Memcached.

8SKILL.mdUpdated Jul 19, 2026

curiositech/redis-patterns-expert

curiositech/react-server-components-boundary

tools

VerifiedTrustedCommunity

Drawing the `'use client'` boundary correctly in React Server Components apps (Next.js App Router, RSC frameworks) — leaf-pushing, slot composition, serialization rules, and environment poisoning prevention. Grounded in react.dev and Next.js 16 docs.

8SKILL.mdUpdated Jul 19, 2026

curiositech/react-server-components-boundary

curiositech/rate-limiting-strategy

development

VerifiedTrustedCommunity

Use when designing rate limiting for an API, choosing between token bucket / sliding window / leaky bucket / fixed window, implementing it in Redis, deciding edge (Cloudflare/Upstash) vs origin enforcement, sizing per-user vs per-IP vs per-endpoint quotas, returning the right 429 response with Retry-After, or fixing the boundary-burst bug in fixed-window limiters. Triggers: 429 too many requests, INCR + EXPIRE, ZADD + ZREMRANGEBYSCORE + ZCARD, X-RateLimit-Remaining header, Cloudflare WAF rate limiting rules, Upstash @upstash/ratelimit, leaky bucket shaping vs policing, distributed rate limiter consistency. NOT for DDoS mitigation specifically (different scale), CAPTCHA / bot management, full WAF design, or per-user quota billing.

8SKILL.mdUpdated Jul 19, 2026

curiositech/rate-limiting-strategy

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/curiositech/windags-skills.git

# Copy into Claude Code skills folder (global)
cp -r windags-skills/skills/conditions-for-intuitive-expertise-a-fai ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

curiositech/windags-skills

3 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT