Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

wenmin-wu/timeseries-recursive-multistep-forecasting

Name: timeseries-recursive-multistep-forecasting
Author: wenmin-wu

skills/timeseries/recursive-multistep-forecasting/SKILL.md

npx skillsauth add wenmin-wu/ds-skills timeseries-recursive-multistep-forecasting

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Overview

Direct multi-step forecasting (one model per horizon day) is expensive: 28 days = 28 models. Recursive forecasting trains one one-step model and reuses it for every horizon day by feeding its own predictions back as inputs. The trick is that lag features (sales.shift(7)) and rolling features (sales.rolling(28).mean()) at horizon day H+k depend on predictions from days H..H+k-1, so you must recompute them inside the prediction loop, not once before. Done correctly, you get 28-day forecasts with the same model that scored well on 1-step validation. Done wrong (precomputed features that don't see the predictions), you get garbage.

Quick Start

import pandas as pd
from datetime import timedelta

base_test = build_panel_with_unknown_target()   # rows for entire horizon, sales=NaN

for h in range(1, 29):                          # 28-day horizon
    day = first_forecast_day + timedelta(days=h - 1)

    # window includes max_lags days before today so we can recompute features
    window = base_test[
        (base_test.date >= day - timedelta(days=max_lags)) &
        (base_test.date <= day)
    ].copy()

    create_features(window)                     # lags + rollings using up-to-day data

    today = window.loc[window.date == day, train_cols]
    yhat = alpha * model.predict(today)         # alpha = bias-correction multiplier

    base_test.loc[base_test.date == day, 'sales'] = yhat

Workflow

Build a panel that already contains the future horizon rows with sales=NaN
Loop h over the horizon days
Slice a sliding window covering [day - max_lags, day] so feature creation has enough history
Recompute lag and rolling features on the window — this is the step everyone misses
Predict for the current day, optionally apply a bias-correction multiplier (alpha ≈ 1.02-1.03 for Poisson)
Write the prediction back into base_test so the next iteration's features see it
After the loop, the entire horizon column is filled

Key Decisions

Recompute features inside the loop: precomputing lag-7 once before the loop means day 8's "lag-7" is actually a previously-predicted day-1 — but you need that prediction to exist before you compute it. The only correct order is predict → write → recompute → predict.
Window slice for speed: don't recompute features on the entire panel — just on [day - max_lags, day]. The rest is irrelevant for today's prediction.
Bias-correction multiplier alpha: Poisson and Tweedie LightGBM consistently underpredict by 2-3%; multiply by ~1.02-1.03 (tuned on validation) before writing back, otherwise the bias compounds across the horizon.
Single-store batches: if you train per-store models, group by store inside the loop to amortize the model load.
vs. direct multi-output: direct is more accurate for short horizons, recursive scales better and uses one model.

References

M5 First Public Notebook Under 0.50
M5 - Three shades of Dark: Darker magic

wenmin-wu/timeseries-recursive-multistep-forecasting

skills/timeseries/recursive-multistep-forecasting/SKILL.md

Forecast a multi-step horizon by predicting one day ahead, writing the prediction back into the panel as the new "actual", recomputing all lag and rolling features that depend on it, then predicting the next day — turns a one-step LightGBM regressor into a 28-day forecaster without changing the model

31 stars

data-ai

Updated Apr 22, 2026

$ install --global

skillsauth

npx skillsauth add wenmin-wu/ds-skills timeseries-recursive-multistep-forecasting

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 22, 2026, 10:55 AM188.4s1 file scanned

SKILL.md

name:: timeseries-recursive-multistep-forecasting
description:: Forecast a multi-step horizon by predicting one day ahead, writing the prediction back into the panel as the new "actual", recomputing all lag and rolling features that depend on it, then predicting the next day — turns a one-step LightGBM regressor into a 28-day forecaster without changing the model

Overview

Quick Start

import pandas as pd
from datetime import timedelta

base_test = build_panel_with_unknown_target()   # rows for entire horizon, sales=NaN

for h in range(1, 29):                          # 28-day horizon
    day = first_forecast_day + timedelta(days=h - 1)

    # window includes max_lags days before today so we can recompute features
    window = base_test[
        (base_test.date >= day - timedelta(days=max_lags)) &
        (base_test.date <= day)
    ].copy()

    create_features(window)                     # lags + rollings using up-to-day data

    today = window.loc[window.date == day, train_cols]
    yhat = alpha * model.predict(today)         # alpha = bias-correction multiplier

    base_test.loc[base_test.date == day, 'sales'] = yhat

Workflow

Build a panel that already contains the future horizon rows with sales=NaN
Loop h over the horizon days
Slice a sliding window covering [day - max_lags, day] so feature creation has enough history
Recompute lag and rolling features on the window — this is the step everyone misses
Predict for the current day, optionally apply a bias-correction multiplier (alpha ≈ 1.02-1.03 for Poisson)
Write the prediction back into base_test so the next iteration's features see it
After the loop, the entire horizon column is filled

Key Decisions

Recompute features inside the loop: precomputing lag-7 once before the loop means day 8's "lag-7" is actually a previously-predicted day-1 — but you need that prediction to exist before you compute it. The only correct order is predict → write → recompute → predict.
Window slice for speed: don't recompute features on the entire panel — just on [day - max_lags, day]. The rest is irrelevant for today's prediction.
Bias-correction multiplier alpha: Poisson and Tweedie LightGBM consistently underpredict by 2-3%; multiply by ~1.02-1.03 (tuned on validation) before writing back, otherwise the bias compounds across the horizon.
Single-store batches: if you train per-store models, group by store inside the loop to amortize the model load.
vs. direct multi-output: direct is more accurate for short horizons, recursive scales better and uses one model.

References

M5 First Public Notebook Under 0.50
M5 - Three shades of Dark: Darker magic

Related Skills

wenmin-wu/timeseries-scaled-pinball-loss

data-ai

VerifiedTrustedCommunity

Scaled Pinball Loss (SPL) metric for evaluating quantile forecasts, normalized by mean absolute successive differences of training data

31SKILL.mdUpdated Apr 23, 2026

wenmin-wu/timeseries-scaled-pinball-loss

wenmin-wu/timeseries-retroactive-outlier-rescaling

data-ai

VerifiedTrustedCommunity

Walk backward through a time series and multiplicatively rescale segments when jumps exceed a fraction of the running mean to correct data collection anomalies

31SKILL.mdUpdated Apr 23, 2026

wenmin-wu/timeseries-retroactive-outlier-rescaling

wenmin-wu/timeseries-ratio-target-for-smape

testing

VerifiedTrustedCommunity

Transform forecasting target to next/current ratio minus one so that optimizing MAE or squared error implicitly minimizes SMAPE

31SKILL.mdUpdated Apr 23, 2026

wenmin-wu/timeseries-ratio-target-for-smape

wenmin-wu/timeseries-quantile-ratio-scaling

tools

VerifiedTrustedCommunity

Convert point forecasts to prediction intervals by scaling with logit-transformed quantile ratios passed through a Normal CDF

31SKILL.mdUpdated Apr 23, 2026

wenmin-wu/timeseries-quantile-ratio-scaling

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/wenmin-wu/ds-skills.git

# Copy into Claude Code skills folder (global)
cp -r ds-skills/skills/timeseries/recursive-multistep-forecasting ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

wenmin-wu/ds-skills

31 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT