skills/nlp/cross-model-candidate-pool/SKILL.md
Merge candidate pools from multiple independent seq2seq models before MBR selection to reduce shared failure modes
npx skillsauth add wenmin-wu/ds-skills nlp-cross-model-candidate-poolInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Single-model MBR candidates share the same biases. Run 2+ independent models (different architectures or checkpoints), merge their candidate pools, then apply MBR on the combined pool. Reduces correlated errors and improves consensus quality.
def cross_model_mbr(models, tokenizer, input_ids, attn_mask, mbr_fn):
combined_pool = []
for model in models:
candidates = generate_diverse_candidates(model, tokenizer, input_ids, attn_mask)
combined_pool.extend(candidates)
# Unload model to free GPU memory
model.cpu()
del model
torch.cuda.empty_cache()
return mbr_fn(combined_pool)
data-ai
Scaled Pinball Loss (SPL) metric for evaluating quantile forecasts, normalized by mean absolute successive differences of training data
data-ai
Walk backward through a time series and multiplicatively rescale segments when jumps exceed a fraction of the running mean to correct data collection anomalies
testing
Transform forecasting target to next/current ratio minus one so that optimizing MAE or squared error implicitly minimizes SMAPE
tools
Convert point forecasts to prediction intervals by scaling with logit-transformed quantile ratios passed through a Normal CDF