Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

curiositech/cse-state-of-practice

Name: cse-state-of-practice
Author: curiositech

skills/cse-state-of-practice/SKILL.md

npx skillsauth add curiositech/windags-skills cse-state-of-practice

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Cognitive Systems Engineering State of Practice

Apply insights from cognitive systems engineering research to design resilient agent architectures, diagnose coordination failures, and encode expert knowledge that performs under pressure.

Decision Points

Agent Architecture Design

START: Need to design multi-agent system
│
├─ Is this a well-defined, stable task sequence?
│  ├─ YES → Use pipeline architecture BUT build 3 failure recovery paths
│  └─ NO → Use goal-oriented architecture with alternative methods
│
├─ Does task require expertise under time pressure?
│  ├─ YES → Implement Recognition-Primed Decision Making pattern
│  │        (situation recognition → rapid simulation → action)
│  └─ NO → Standard deliberative architecture acceptable
│
└─ Will humans supervise or intervene?
   ├─ YES → Mandatory: mode transparency + shared state representation
   └─ NO → Focus on agent-to-agent coordination interfaces

Task Decomposition Strategy

Given complex task to decompose:
│
├─ Can expert describe complete process reliably?
│  ├─ YES → Verify with Critical Decision Method anyway
│  └─ NO → Use structured cognitive task analysis FIRST
│
├─ Are there natural failure/degradation points?
│  ├─ YES → Design alternative paths for each failure mode
│  └─ NO → Suspicious - dig deeper for hidden failure modes
│
└─ Will agents need to adapt methods to context?
   ├─ YES → Separate goals from methods in specification
   └─ NO → Fixed sequence acceptable (rare case)

Failure Diagnosis Protocol

System failing unexpectedly:
│
├─ Does it work in demos but fail in production?
│  └─ YES → Invariant sequence assumption violated
│
├─ Are humans surprised by agent actions?
│  └─ YES → Automation surprise - check mode transparency
│
├─ Do agents fail when tools/data unavailable?
│  └─ YES → Missing alternative paths in decomposition
│
└─ Does agent understand but can't execute effectively?
   └─ YES → Knowing-doing gap - check situated context

Failure Modes

1. Pipeline Worship

Symptoms: System works perfectly in happy path, crashes at first unexpected condition Detection Rule: If you hear "we need to handle the edge case" more than once, you're in this anti-pattern Root Cause: Designed for idealized sequence, no alternative paths Fix: Redesign with goal/method separation, build 3 recovery paths for most common failures

2. Behavioral Specification Fallacy

Symptoms: Agent mimics expert actions but can't adapt to novel situations Detection Rule: If expert says "I don't know how I knew that," but system specification doesn't capture cue recognition Root Cause: Encoded surface behavior without underlying reasoning structure Fix: Use Critical Decision Method to elicit tacit knowledge and situation assessment patterns

3. Silent Mode Transitions

Symptoms: Humans/agents surprised when system changes behavior or strategy Detection Rule: If stakeholders say "I had no idea it was doing that," automation surprise is occurring Root Cause: State changes not communicated across coordination boundaries Fix: Make every mode transition an explicit coordination event with shared state updates

4. Representation Divergence

Symptoms: Agent handoffs produce errors despite individual agents working correctly Detection Rule: If output from Agent A is misinterpreted by Agent B consistently Root Cause: Agents maintain different models of task/world state Fix: Explicit shared ontology and interface state verification

5. Novice Architecture for Expert Tasks

Symptoms: System follows rules perfectly but fails under pressure or novel conditions Detection Rule: If system can't explain WHY it chose an action, only WHAT rule it followed Root Cause: Rule-based architecture deployed for expertise-requiring task Fix: Upgrade to recognition-primed or case-based reasoning architecture

Worked Examples

Example 1: Multi-Agent Code Review System

Scenario: Design system where Agent A writes code, Agent B reviews, Agent C handles deployment

Initial Design (Flawed):

Linear pipeline: Code → Review → Deploy
Binary review outcome: Pass/Fail
Fixed criteria checklist

CSE Analysis Reveals:

Expert code reviewers don't use checklists - they recognize code smells and risk patterns
Reviews adapt to code complexity, author experience, deployment criticality
Real reviewers often iterate with authors, not just reject

Improved Design:

Agent A: Code Generation
├─ Includes intention metadata (what problem solving, why this approach)
├─ Context flags (urgency, risk level, author confidence)

Agent B: Recognition-Primed Review
├─ Situation assessment (code type, risk factors, author patterns)
├─ Pattern matching against failure libraries
├─ Graduated response: approve/iterate/escalate/reject

Agent C: Context-Sensitive Deployment  
├─ Deployment strategy adapts to review confidence + context flags
├─ Rollback paths pre-planned based on risk assessment

Key Decision Points Applied:

Separated goals (ensure code quality) from methods (checklist vs pattern recognition)
Added alternative paths (iteration loop, escalation)
Made handoff state explicit (intention metadata, confidence levels)

Example 2: Diagnosing Customer Service Agent Failures

Problem: AI customer service agent handles routine queries well but escalates too often on complex issues

Diagnosis Process:

Check for Invariant Sequence: Agent follows script linearly, can't adapt when customer deviates
Examine Situation Recognition: Agent uses keyword matching, not contextual assessment
Test Alternative Paths: Agent has no recovery strategies when first approach fails

Root Cause: Behavioral specification fallacy - system trained on successful interaction transcripts but missing expert reasoning about when/how to adapt

Solution:

Interview expert human agents using Critical Decision Method
Identify situation types and recognition cues
Build case library of adaptation strategies
Add confidence scoring to enable graduated escalation

Quality Gates

Task completion checklist for CSE-informed agent design:

[ ] Alternative Path Coverage: System has defined recovery paths for 3 most likely failure modes [ ] Situation Recognition: Agent can classify situation type, not just process inputs [ ] Mode Transparency: All state changes are observable by supervisors/coordinators
[ ] Representation Alignment: Agent handoffs use explicit, shared state models [ ] Expertise Stage Match: Architecture complexity matches required expertise level [ ] Tacit Knowledge Elicitation: Used structured methods (not just self-report) for expert knowledge [ ] Context Sensitivity: System adapts methods to situational factors [ ] Knowing-Doing Verification: Tested execution capability, not just comprehension [ ] Coordination Failure Recovery: System handles representational divergence gracefully [ ] Automation Surprise Prevention: Mode changes communicated across all coordination boundaries

NOT-FOR Boundaries

This skill is NOT for:

Pure UI/UX design problems → Use interaction design skills instead
Simple deterministic tasks with no failure modes → Use standard pipeline architecture
Tasks where behavioral observation captures all relevant expertise → Use direct behavioral modeling
Systems with no human supervision or agent coordination → Use individual agent optimization

Use OTHER skills for:

Interface layout and visual design → UI/UX skills
Mathematical optimization problems → Operations research skills
Data pipeline engineering → Data engineering skills
Individual agent prompt optimization → Prompt engineering skills

This skill IS specifically for:

Multi-agent coordination architecture
Expert knowledge elicitation and encoding
Failure mode prediction and mitigation
Human-AI handoff design
Complex task decomposition under uncertainty

curiositech/cse-state-of-practice

skills/cse-state-of-practice/SKILL.md

Review of Cognitive Systems Engineering applications and current practice in safety-critical domains

data-ai

Updated Apr 4, 2026

$ install --global

skillsauth

npx skillsauth add curiositech/windags-skills cse-state-of-practice

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 4, 2026, 1:59 PM30.3s21 files scanned

SKILL.md

license:: Apache-2.0
name:: cse-state-of-practice
description:: Review of Cognitive Systems Engineering applications and current practice in safety-critical domains
category:: Cognitive Science & Decision Making

Cognitive Systems Engineering State of Practice

Apply insights from cognitive systems engineering research to design resilient agent architectures, diagnose coordination failures, and encode expert knowledge that performs under pressure.

Decision Points

Agent Architecture Design

START: Need to design multi-agent system
│
├─ Is this a well-defined, stable task sequence?
│  ├─ YES → Use pipeline architecture BUT build 3 failure recovery paths
│  └─ NO → Use goal-oriented architecture with alternative methods
│
├─ Does task require expertise under time pressure?
│  ├─ YES → Implement Recognition-Primed Decision Making pattern
│  │        (situation recognition → rapid simulation → action)
│  └─ NO → Standard deliberative architecture acceptable
│
└─ Will humans supervise or intervene?
   ├─ YES → Mandatory: mode transparency + shared state representation
   └─ NO → Focus on agent-to-agent coordination interfaces

Task Decomposition Strategy

Given complex task to decompose:
│
├─ Can expert describe complete process reliably?
│  ├─ YES → Verify with Critical Decision Method anyway
│  └─ NO → Use structured cognitive task analysis FIRST
│
├─ Are there natural failure/degradation points?
│  ├─ YES → Design alternative paths for each failure mode
│  └─ NO → Suspicious - dig deeper for hidden failure modes
│
└─ Will agents need to adapt methods to context?
   ├─ YES → Separate goals from methods in specification
   └─ NO → Fixed sequence acceptable (rare case)

Failure Diagnosis Protocol

System failing unexpectedly:
│
├─ Does it work in demos but fail in production?
│  └─ YES → Invariant sequence assumption violated
│
├─ Are humans surprised by agent actions?
│  └─ YES → Automation surprise - check mode transparency
│
├─ Do agents fail when tools/data unavailable?
│  └─ YES → Missing alternative paths in decomposition
│
└─ Does agent understand but can't execute effectively?
   └─ YES → Knowing-doing gap - check situated context

Failure Modes

1. Pipeline Worship

2. Behavioral Specification Fallacy

3. Silent Mode Transitions

4. Representation Divergence

5. Novice Architecture for Expert Tasks

Worked Examples

Example 1: Multi-Agent Code Review System

Scenario: Design system where Agent A writes code, Agent B reviews, Agent C handles deployment

Initial Design (Flawed):

Linear pipeline: Code → Review → Deploy
Binary review outcome: Pass/Fail
Fixed criteria checklist

CSE Analysis Reveals:

Expert code reviewers don't use checklists - they recognize code smells and risk patterns
Reviews adapt to code complexity, author experience, deployment criticality
Real reviewers often iterate with authors, not just reject

Improved Design:

Agent A: Code Generation
├─ Includes intention metadata (what problem solving, why this approach)
├─ Context flags (urgency, risk level, author confidence)

Agent B: Recognition-Primed Review
├─ Situation assessment (code type, risk factors, author patterns)
├─ Pattern matching against failure libraries
├─ Graduated response: approve/iterate/escalate/reject

Agent C: Context-Sensitive Deployment  
├─ Deployment strategy adapts to review confidence + context flags
├─ Rollback paths pre-planned based on risk assessment

Key Decision Points Applied:

Separated goals (ensure code quality) from methods (checklist vs pattern recognition)
Added alternative paths (iteration loop, escalation)
Made handoff state explicit (intention metadata, confidence levels)

Example 2: Diagnosing Customer Service Agent Failures

Problem: AI customer service agent handles routine queries well but escalates too often on complex issues

Diagnosis Process:

Check for Invariant Sequence: Agent follows script linearly, can't adapt when customer deviates
Examine Situation Recognition: Agent uses keyword matching, not contextual assessment
Test Alternative Paths: Agent has no recovery strategies when first approach fails

Root Cause: Behavioral specification fallacy - system trained on successful interaction transcripts but missing expert reasoning about when/how to adapt

Solution:

Interview expert human agents using Critical Decision Method
Identify situation types and recognition cues
Build case library of adaptation strategies
Add confidence scoring to enable graduated escalation

Quality Gates

Task completion checklist for CSE-informed agent design:

NOT-FOR Boundaries

This skill is NOT for:

Pure UI/UX design problems → Use interaction design skills instead
Simple deterministic tasks with no failure modes → Use standard pipeline architecture
Tasks where behavioral observation captures all relevant expertise → Use direct behavioral modeling
Systems with no human supervision or agent coordination → Use individual agent optimization

Use OTHER skills for:

Interface layout and visual design → UI/UX skills
Mathematical optimization problems → Operations research skills
Data pipeline engineering → Data engineering skills
Individual agent prompt optimization → Prompt engineering skills

This skill IS specifically for:

Multi-agent coordination architecture
Expert knowledge elicitation and encoding
Failure mode prediction and mitigation
Human-AI handoff design
Complex task decomposition under uncertainty

Related Skills

curiositech/revisiting-interview-data-analysing-turn

data-ai

VerifiedTrustedCommunity

license: Apache-2.0 NOT for unrelated tasks outside this domain.

8SKILL.mdUpdated Jul 19, 2026

curiositech/revisiting-interview-data-analysing-turn

curiositech/redis-patterns-expert

development

VerifiedTrustedCommunity

Use when designing caching strategies (cache-aside, write-through, write-behind), implementing distributed locks, building rate limiters, leaderboards, real-time streams (XADD/consumer groups), pub/sub, or tuning eviction policies. Triggers: thundering-herd on cache miss, dogpile on key expiry, Redlock vs SET-NX-PX choice, sliding-window rate limiter, hot-key on a single cluster slot, big-key blowup, MULTI/EXEC across slots, KEYS in production. NOT for Redis Cluster operations/admin (different domain), embedded KV (SQLite, leveldb), in-process LRU caches, or Memcached.

8SKILL.mdUpdated Jul 19, 2026

curiositech/redis-patterns-expert

curiositech/react-server-components-boundary

tools

VerifiedTrustedCommunity

Drawing the `'use client'` boundary correctly in React Server Components apps (Next.js App Router, RSC frameworks) — leaf-pushing, slot composition, serialization rules, and environment poisoning prevention. Grounded in react.dev and Next.js 16 docs.

8SKILL.mdUpdated Jul 19, 2026

curiositech/react-server-components-boundary

curiositech/rate-limiting-strategy

development

VerifiedTrustedCommunity

Use when designing rate limiting for an API, choosing between token bucket / sliding window / leaky bucket / fixed window, implementing it in Redis, deciding edge (Cloudflare/Upstash) vs origin enforcement, sizing per-user vs per-IP vs per-endpoint quotas, returning the right 429 response with Retry-After, or fixing the boundary-burst bug in fixed-window limiters. Triggers: 429 too many requests, INCR + EXPIRE, ZADD + ZREMRANGEBYSCORE + ZCARD, X-RateLimit-Remaining header, Cloudflare WAF rate limiting rules, Upstash @upstash/ratelimit, leaky bucket shaping vs policing, distributed rate limiter consistency. NOT for DDoS mitigation specifically (different scale), CAPTCHA / bot management, full WAF design, or per-user quota billing.

8SKILL.mdUpdated Jul 19, 2026

curiositech/rate-limiting-strategy

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/curiositech/windags-skills.git

# Copy into Claude Code skills folder (global)
cp -r windags-skills/skills/cse-state-of-practice ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

curiositech/windags-skills

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT