skills/mlb-regression-flagger/SKILL.md
Identifies fantasy baseball players whose surface stats (wOBA, ERA, batting average) are diverging from their underlying Statcast quality (xwOBA, FIP, xBA) — emits a `regression_index` from -100 (very lucky, sell high) to +100 (very unlucky, buy low). Primary signal for buy-low/sell-high decisions on trades and waivers. Use when user mentions "buy low", "sell high", "regression candidate", "lucky", "unlucky", "xwOBA gap", "ERA-FIP gap", "BABIP", "due for regression", or is deciding whether to trade for / trade away a player based on over- or under-performance.
npx skillsauth add lyndonkl/claude mlb-regression-flaggerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Scenario: User asks "Is Junior Caminero a buy-low candidate or am I chasing noise?"
Raw data gathered from Baseball Savant + FanGraphs (cite URLs):
Regression index calculation:
regression_index = (xwOBA - wOBA) x 500
= (.380 - .310) x 500
= +35
Secondary check — BABIP: .255 is below the .240 lower-outlier threshold for hitters only if we tighten further, but .255 plus elite contact quality confirms unlucky. Not a "will correct to .300" player, but unlikely to stay at .255 given the batted-ball profile.
Verdict: regression_index = +35 → BUY. Caminero has been unlucky; underlying contact quality is elite. Expect his surface stats to catch up over the next 30 days.
User-facing translation (per CLAUDE.md rule #5 — beginner writing):
"Junior Caminero's raw numbers look mediocre (.310 wOBA — that's a catch-all hitting stat), but the quality of his actual contact (how hard and at what angle he hits the ball) suggests he should be hitting .380. The gap is one of the biggest in the league. ADD if available, or OFFER a trade for him before his stats correct."
Signal file emission (per signal-framework.md):
---
type: regression
date: 2026-04-17
emitted_by: mlb-regression-flagger
player: Junior Caminero
regression_index: +35
direction: BUY
confidence: 0.78
source_urls:
- https://baseballsavant.mlb.com/savant-player/junior-caminero-...
- https://www.fangraphs.com/players/junior-caminero/...
---
Copy this checklist and track progress:
Regression Flagger Progress:
- [ ] Step 1: Define the candidate list (who to check)
- [ ] Step 2: Web-search Savant for expected stats (xwOBA, xERA, xBA)
- [ ] Step 3: Web-search FanGraphs for surface stats (wOBA, ERA, BABIP, HR/FB)
- [ ] Step 4: Compute regression_index per player
- [ ] Step 5: Apply secondary flags (BABIP, HR/FB)
- [ ] Step 6: Emit signal file + user-facing translation
Step 1: Define the candidate list
Who are we checking? Three entry points:
Step 2: Web-search Baseball Savant for expected stats
See resources/methodology.md for exact queries and URL patterns.
xwOBA, xBA, xSLG, Barrel %, Hard-Hit %xERA, Expected Batting Average against, Whiff %, Chase %source_urls for the signal fileStep 3: Web-search FanGraphs for surface stats
See resources/methodology.md for query patterns.
wOBA, BABIP, AVG, OBP, SLGERA, FIP, xFIP, HR/FB %, LOB %Step 4: Compute regression_index
See resources/methodology.md for the formulas.
regression_index = (xwOBA - wOBA) x 500, clamped to ±100regression_index = (FIP - ERA) x 50, clamped to ±100
Step 5: Apply secondary flags
regression_index. When they agree, confidence goes up; when they disagree, confidence drops and the player goes into a "watchlist" bucket instead of an actionable BUY/SELL.Step 6: Emit signal file + user-facing translation
signals/YYYY-MM-DD-regression.md with frontmatter per signal-framework.mdBUY / SELL / HOLD / ADD / DROP (per CLAUDE.md rule #6)Pattern 1: The Elite-Contact, Bad-Luck Hitter (classic BUY)
+35, BUY.Pattern 2: The Lucky Journeyman (classic SELL)
-22, SELL.Pattern 3: The Pitcher-Lucky Starter (SELL)
(FIP - ERA) x 50 = (4.20 - 2.50) x 50 = +85. But for pitchers a large FIP-ERA gap means he has been LUCKY → flip sign → -85, SELL. See methodology for sign convention.Pattern 4: The Pitcher-Unlucky Starter (BUY)
Pattern 5: Conflicting Signals (WATCHLIST, not actionable)
regression_index with reduced confidence (< 0.5) and place on watchlist for another week.Sample-size minimum: Do not emit a regression_index for a hitter with fewer than 80 PAs or a pitcher with fewer than 30 IP. Statcast expected stats are noisy below that. If below threshold, emit confidence: 0.3 and label too-early.
Sign convention for pitchers: (FIP - ERA) x 50 gives a raw number. A POSITIVE raw number means ERA < FIP → the pitcher has been LUCKY → negate to produce a NEGATIVE regression_index (SELL signal). A NEGATIVE raw number means ERA > FIP → UNLUCKY → negate to POSITIVE (BUY). Always negate. See methodology for the worked example. Hitters use the raw sign directly.
Clamp to ±100: Extreme xwOBA gaps (>.200) are almost always small-sample artifacts. Clamp hard. Do not report regression_index of +140.
Park and platoon adjustments: xwOBA from Savant is park-neutral but does not adjust for platoon splits. For a LHH playing in a tough-for-lefties park (e.g., Oracle) with many LHP starts ahead, the regression may be slower than the index suggests. Note as a caveat; do not recompute.
BABIP is secondary, not primary: A high BABIP alone is not a SELL signal if underlying contact quality is elite. Hitters with extreme exit velocity (e.g., Judge, Ohtani) sustain BABIPs above .340 for full seasons. Always cross-check BABIP against Hard-Hit % before flagging.
HR/FB normalization for pitchers: League-average HR/FB is ~12%. Pitchers regress toward their career HR/FB, not league average. A flyball pitcher's "normal" HR/FB is 14%; a groundball pitcher's is 9%. Use career or 3-year average when available; fall back to 12% only if no history.
Web-search every fact (per CLAUDE.md): Every number that enters the formula must come from a live web search with the URL recorded in source_urls:. Never fill in from memory or training data. If a search fails, degrade to confidence: 0.3 and flag in the red-team pass.
Translate for the beginner: The user has zero baseball knowledge. Every user-facing sentence that uses "xwOBA", "FIP", "BABIP" must include an inline translation on first use. See CLAUDE.md rule #5 and the Example above.
Core formulas:
HITTER: regression_index = clamp((xwOBA - wOBA) x 500, -100, +100)
positive = UNLUCKY (BUY); negative = LUCKY (SELL)
PITCHER: raw = (FIP - ERA) x 50
regression_index = clamp(-raw, -100, +100)
positive = UNLUCKY (BUY); negative = LUCKY (SELL)
SECONDARY (hitter): BABIP > .370 → lucky flag; BABIP < .240 → unlucky flag
SECONDARY (pitcher): HR/FB < 8% → lucky flag; HR/FB > 17% → unlucky flag
Action thresholds:
| regression_index | Direction | Fantasy action |
|---|---|---|
| ≥ +30 | Very unlucky | Aggressive BUY / ADD / high FAAB bid |
| +15 to +29 | Mildly unlucky | BUY if available cheaply |
| -14 to +14 | Noise | HOLD / no action |
| -29 to -15 | Mildly lucky | SELL if offered market-rate |
| ≤ -30 | Very lucky | Aggressive SELL / DROP / decline offers |
Data source map (per data-sources.md):
| Stat | Primary source | URL pattern | |---|---|---| | xwOBA, xBA, xSLG, Barrel %, Hard-Hit % | Baseball Savant | https://baseballsavant.mlb.com/savant-player/{slug}-{id} | | wOBA, BABIP (hitter) | FanGraphs | https://www.fangraphs.com/players/{slug}/{id} | | ERA, FIP, xFIP, HR/FB, LOB% | FanGraphs | same | | xERA | Baseball Savant | same |
Key resources:
Inputs required:
context/league-config.md and mlb-category-state-analyzer signals)Outputs produced:
regression_index per player (-100 to +100)BUY / SELL / HOLD / WATCHLISTsignals/YYYY-MM-DD-regression.mdtesting
--- name: advisory-edit description: A strict advisory-only editing discipline for a writer who dictates ("speaks out") essays and wants help WITHOUT having their voice changed. The editor directs structure, flags grammar, and suggests strategic language — but never modifies the writer's text unless the writer explicitly says "apply" / "make that change" / "rewrite this." Produces a line-referenced, suggestion-only critique where every item is marked the writer's call. Four passes: structural, l
testing
Provides the house style for analyst-grade strategist writing — third-person register with sparing first-person, no em dashes, no "not X, not Y, not Z" negation cascades, numbered footnote citations rather than inline source parentheticals, specific opinion-signaling phrases, and topic-forward paragraph structure modeled on voice patterns observed in Damodaran's Musings on Markets and Thompson's Stratechery. Use when consolidating working notes into a finished long-form strategist or analyst report that must read as written by a senior human analyst rather than an AI assistant.
testing
Renders a markdown report to a PDF using pandoc with xelatex (11pt serif body, 1-inch margins, numbered footnotes, formal heading hierarchy). Requires a one-time install of pandoc and a LaTeX engine on the user's machine — basictex on macOS or texlive-xetex on Linux. Does not attempt automatic install. Fails loudly with the exact install commands if pandoc or xelatex is missing on the user's PATH. Use when producing a finished strategist or analyst report PDF from a polished markdown source.
testing
Produces step-by-step computational walkthroughs of vector and matrix operations as a sequence of numbered "frames", showing the explicit state at each step. The text-equivalent of a 3Blue1Brown animation — each frame shows what changed and why, so the learner can re-trace the operation by hand. Use when the learner needs to *see* a computation unfold (eigenvalue computation, attention with 3 tokens, gradient descent step, SVD on a 2×2, layer norm on a 3-vector, softmax of a small input), when an explanation has been given but the learner needs to ground it in a worked example, or when introducing an operation that's intimidating in symbol form but trivial in pencil-and-paper form.