src/datapro/data/skills/machine-learning-lite/SKILL.md
Tactical and highly interpretable Machine Learning. Use for: (1) Extracting Feature Importance via Random Forest, (2) Running Permutation Tests, (3) Handling Imbalanced Data (SMOTE).
npx skillsauth add pablodiegoo/data-pro-skill machine-learning-liteInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill restricts the agent to using simple, highly interpretable Machine Learning methods focused on inference rather than production deployment. Deep learning or black-box predictions are strictly out of scope.
permutation_feature_importance.py: Calculates the true model importance of variables by randomly shuffling them.permutation_test_utilities.py: Rigorous statistical testing without assuming underlying data distributions.references/imbalanced_data_strategies.md.testing
Comprehensive time-series validation and analysis suite. Handles backtesting of trading and non-trading strategies with support for walk-forward validation (training vs testing windows), performance metric calculation (Sharpe, Drawdown, Win Rate), and event-driven resource allocation simulation. Use for: (1) Validating sequential logic on time-series data, (2) Calculating risk-adjusted performance, (3) Simulating constraints in resource distribution, (4) Detecting look-ahead bias through walk-forward testing.
tools
Core statistical analysis and pipeline automation for survey datasets. Use for: (1) Running standard Crosstabs, NPS, Top-Box calculations, (2) Generating complete EDA or Analytics notebooks, (3) Quantitative and qualitative processing of questionnaire data.
development
Business-level frameworks and actionable reporting for executives. Use for: (1) Plotting Priority Matrices, (2) Generating Pain Curves, (3) Conversion Funnels, (4) Removing Halo Effects to uncover true sentiment.
testing
Normalization and state-mapping of municipal data and generation of professional choropleth maps of Brazil (UF/State level). Use for: (1) Detecting city/state strings and normalizing names, (2) Attaching regional metadata, (3) Generating professional maps integrating survey data with shapefiles.