skills/shadowbox-expertise-transfer/SKILL.md
ShadowBox training method for accelerating expertise transfer through expert decision comparison exercises
npx skillsauth add curiositech/windags-skills shadowbox-expertise-transferInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Accelerate expertise transfer through comparison-based calibration. Trainees experience the gap between their thinking and expert thinking, then discover why that gap exists through self-directed reflection. Creates 18% performance improvement through divergence detection, not instruction.
IF scenario reveals critical cue recognition gaps
→ Use ATTENTION box ("What are you noticing right now?")
IF scenario reveals prioritization failures
→ Use ACTION PRIORITY box ("What should be done first?")
IF scenario involves information gathering phase
→ Use INFORMATION box ("What information do you need?")
IF scenario has temporal dynamics/future states
→ Use ANTICIPATION box ("What do you expect to happen?")
IF scenario requires situational evaluation
→ Use ASSESSMENT box ("What's your evaluation of the situation?")
IF scenario involves ongoing tracking needs
→ Use MONITORING box ("What will you track going forward?")
IF expert consensus ≥ 90%
→ Present as strong pattern, show minority rationale
→ Flag as "core expertise marker"
IF expert consensus 70-89%
→ Present as typical expert range
→ Emphasize reasoning quality over conformity
IF expert consensus 60-69%
→ Present as legitimate disagreement zone
→ Deep dive into minority positions with full rationale
IF expert consensus < 60%
→ Flag scenario as too ambiguous or poorly designed
→ Consider revision or removal
IF trainee response matches expert consensus (70%+)
→ Show expert rationale to reinforce pattern
→ Minimal intervention
IF trainee response partially aligns (30-70% match)
→ Reveal expert distribution + rationale
→ Ask: "What differences do you notice?"
IF trainee response diverges completely (<30% match)
→ Show expert consensus + minority positions
→ Force reflection: "Why might experts have seen this differently?"
IF trainee shows frustration with expert disagreement
→ Reinforce that 100% consensus never occurs
→ Frame disagreement as expertise complexity, not confusion
Symptoms: Adding explanatory content, principles, or "here's what you should learn" guidance alongside scenarios Detection Rule: If you find yourself explaining what the gap means or providing teaching moments beyond expert rationale Root Cause: Discomfort with self-directed discovery process Fix: Remove all instructional content. Trust comparison process. When learners ask "what should I learn?", redirect: "What differences do you notice between your response and the expert panel?"
Symptoms: Removing scenarios where experts disagree or trying to adjudicate "correct" expert responses Detection Rule: If expert consensus consistently exceeds 90% across all scenarios or minority positions are hidden Root Cause: Misunderstanding that expertise involves defensible reasoning under uncertainty, not convergence Fix: Embrace 70-85% consensus as ideal teaching range. Actively share minority expert positions with full rationale. Frame disagreement as valuable learning about reasoning quality.
Symptoms: Allowing trainees to revise earlier responses after seeing new information or expert responses Detection Rule: If trainees can change previous box responses after subsequent reveals Root Cause: Avoiding discomfort of consequential early decisions Fix: Lock responses permanently once submitted. Make "no look-back" rule explicit. The discomfort of living with flawed initial assessment is where learning happens.
Symptoms: Using same box type repeatedly or randomly selecting box types without strategic purpose Detection Rule: If all decision points use same cognitive dimension or box selection appears arbitrary Root Cause: Not understanding that different dimensions assess different expertise facets Fix: Map each decision point to specific cognitive patterns you want to reveal. Use dimensional diagnosis over time to identify specific expertise gaps (e.g., strong at noticing, weak at prioritizing).
Symptoms: Evaluating whether trainee responses "match" expert responses rather than comparing reasoning patterns Detection Rule: If feedback focuses on whether conclusions align rather than reasoning quality Root Cause: Treating method as assessment tool rather than learning intervention Fix: Compare rationale, not just conclusions. A trainee with different but sound reasoning may be developing expertise along different pathway—explore rather than correct.
Context: Training AI agents to review system architecture decisions. Scenario involves microservices decomposition decision with performance, maintainability, and team structure trade-offs.
Stage 1 - Information Gathering
Stage 2 - Initial Assessment
Stage 3 - Action Priority
Key Trade-offs Revealed:
Session completion checklist:
Do NOT use ShadowBox method for:
Delegate to other skills when:
cognitive-bias-identification skill insteadexpertise-assessment-frameworks skill insteadcompetency-progression-mapping skill insteadexpert-shadowing-protocols skill insteaddecision-quality-assessment skill insteadAppropriate domains:
tools
Building resilient distributed systems with circuit breakers, retries with full-jitter exponential backoff, retry budgets (per-request 3-attempt + per-client 10% ratio per Google SRE), deadline propagation, and the cascading-failure math (4 layers × 3 retries = 64x amplification). Grounded in Resilience4j, Microsoft Cloud Patterns, AWS Architecture Blog (Marc Brooker), and Google SRE Book.
testing
Designing HTTP cache headers that work correctly across browsers, CDNs, and shared proxies — `Cache-Control` directives per RFC 9111, `stale-while-revalidate` and `stale-if-error` per RFC 5861, the Vary header for varying responses, and surrogate keys for tag-based purging. Grounded in IETF RFCs and Cloudflare/Fastly docs.
development
Use when designing or fixing a Content Security Policy on a real site, choosing between nonce-based and hash-based CSP, adding strict-dynamic, debugging "Refused to execute inline script" errors, deploying CSP in report-only mode first, configuring report-to / report-uri, or auditing an existing policy for unsafe-inline / unsafe-eval / wildcards. Triggers: "CSP blocks legitimate inline script", strict-dynamic, nonce-{RANDOM}, sha256-{HASH}, object-src none, base-uri none, frame-ancestors, Trusted Types, X-Content-Security-Policy obsolete, report-only vs enforced. NOT for general HTTP security headers (HSTS, COOP/COEP), Trusted Types deep dive, CORS configuration, or building a WAF.
tools
Choosing and operating an HTTP API versioning strategy that doesn't break clients — Stripe's date-based pinned versions, the Deprecation/Sunset header pair (RFC 9745 + RFC 8594), URI vs header vs media-type approaches, and the version-transformer pattern. Grounded in Stripe's published architecture and IETF RFCs.