skills/nlp/short-to-long-span-promotion/SKILL.md
Promotes a predicted short answer span to its enclosing long-answer candidate by matching token boundaries against pre-extracted document structure.
npx skillsauth add wenmin-wu/ds-skills nlp-short-to-long-span-promotionInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
In Natural Questions-style QA, models predict both a short answer (exact span) and a long answer (enclosing paragraph/table). Rather than predicting long answers independently, promote the short answer to its enclosing top-level candidate. This leverages the model's more precise short-answer prediction while deriving the long answer from document structure — consistently more accurate than direct long-answer prediction.
def promote_to_long_answer(short_start, short_end, candidates):
"""Find the smallest top-level candidate enclosing the short span.
Args:
short_start: start token index of short answer
short_end: end token index of short answer
candidates: list of dicts with start_token, end_token, top_level
Returns:
(long_start, long_end) or None if no enclosing candidate
"""
best = None
best_len = float("inf")
for c in candidates:
if not c.get("top_level", False):
continue
if c["start_token"] <= short_start and c["end_token"] >= short_end:
span_len = c["end_token"] - c["start_token"]
if span_len < best_len:
best = (c["start_token"], c["end_token"])
best_len = span_len
return best
# Example usage
short_span = (142, 158) # predicted short answer
long_span = promote_to_long_answer(short_span[0], short_span[1], doc_candidates)
data-ai
Scaled Pinball Loss (SPL) metric for evaluating quantile forecasts, normalized by mean absolute successive differences of training data
data-ai
Walk backward through a time series and multiplicatively rescale segments when jumps exceed a fraction of the running mean to correct data collection anomalies
testing
Transform forecasting target to next/current ratio minus one so that optimizing MAE or squared error implicitly minimizes SMAPE
tools
Convert point forecasts to prediction intervals by scaling with logit-transformed quantile ratios passed through a Normal CDF