skills/nlp/validation-finetune-continuation/SKILL.md
After training on the primary split, continues fine-tuning on the validation set to adapt the model to the target distribution before inference.
npx skillsauth add wenmin-wu/ds-skills nlp-validation-finetune-continuationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
In cross-lingual or domain-shift competitions, training data comes from one distribution (e.g., English) and test data from another (e.g., multilingual). After standard training, continue fine-tuning on the validation set (which shares the test distribution) for a few extra epochs. This squeezes out extra performance by exposing the model to the target domain's patterns before submission.
# Phase 1: train on primary data
EPOCHS = 3
n_steps = len(x_train) // BATCH_SIZE
model.fit(train_dataset, steps_per_epoch=n_steps,
validation_data=valid_dataset, epochs=EPOCHS)
# Phase 2: continue on validation data (target distribution)
n_steps = len(x_valid) // BATCH_SIZE
model.fit(valid_dataset.repeat(),
steps_per_epoch=n_steps,
epochs=EPOCHS)
.repeat() if the validation set is small (avoids incomplete epochs)data-ai
Scaled Pinball Loss (SPL) metric for evaluating quantile forecasts, normalized by mean absolute successive differences of training data
data-ai
Walk backward through a time series and multiplicatively rescale segments when jumps exceed a fraction of the running mean to correct data collection anomalies
testing
Transform forecasting target to next/current ratio minus one so that optimizing MAE or squared error implicitly minimizes SMAPE
tools
Convert point forecasts to prediction intervals by scaling with logit-transformed quantile ratios passed through a Normal CDF