skills/cv/pairwise-tracking-feature-merge/SKILL.md
Double left-join on tracking data to create pairwise features (positions, velocities, distance) for both entities in an interaction pair
npx skillsauth add wenmin-wu/ds-skills cv-pairwise-tracking-feature-mergeInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Interaction detection tasks (contact, collision, handoff) require features for both entities in each candidate pair. When tracking data has one row per entity per timestep, a double left-join merges entity 1's features first, then entity 2's features, producing a single row with both sets of kinematics. The resulting pairwise features (positions, speeds, accelerations, relative distance) serve as tabular input alongside visual features.
import numpy as np
import pandas as pd
def merge_pairwise_features(pairs_df, tracking_df, merge_col='step',
feature_cols=None):
"""Merge tracking features for both entities in each pair."""
if feature_cols is None:
feature_cols = ['x_position', 'y_position', 'speed', 'direction']
track = tracking_df[['game_play', merge_col, 'nfl_player_id'] + feature_cols]
df = (pairs_df
.merge(track, left_on=['game_play', merge_col, 'player_id_1'],
right_on=['game_play', merge_col, 'nfl_player_id'], how='left')
.rename(columns={c: f'{c}_1' for c in feature_cols})
.drop('nfl_player_id', axis=1)
.merge(track, left_on=['game_play', merge_col, 'player_id_2'],
right_on=['game_play', merge_col, 'nfl_player_id'], how='left')
.rename(columns={c: f'{c}_2' for c in feature_cols})
.drop('nfl_player_id', axis=1))
df['distance'] = np.sqrt(
(df['x_position_1'] - df['x_position_2'])**2
+ (df['y_position_1'] - df['y_position_2'])**2)
return df
df_pairs = merge_pairwise_features(contact_pairs, tracking, merge_col='step')
_1_2_1/_2 suffixesdata-ai
Scaled Pinball Loss (SPL) metric for evaluating quantile forecasts, normalized by mean absolute successive differences of training data
data-ai
Walk backward through a time series and multiplicatively rescale segments when jumps exceed a fraction of the running mean to correct data collection anomalies
testing
Transform forecasting target to next/current ratio minus one so that optimizing MAE or squared error implicitly minimizes SMAPE
tools
Convert point forecasts to prediction intervals by scaling with logit-transformed quantile ratios passed through a Normal CDF