bundled/skills/preprocessing-data-with-automated-pipelines/SKILL.md
Design and implement repeatable preprocessing pipelines for cleaning, encoding, transforming, and validating ML input data. In governed ML routing this skill is a stage assistant: it helps on preprocessing-heavy steps after the main route owner is chosen, and should not take over the whole ML workflow by itself.
npx skillsauth add foryourhealth111-pixel/vco-skills-codex preprocessing-data-with-automated-pipelinesInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
In governed ML routing, treat this skill as a stage assistant. It is for preprocessing-heavy execution after the pack owner is chosen.
Use this skill when:
scikit-learn, ml-pipeline-workflow, or training-machine-learning-modelsml-data-leakage-guardscientific-data-preprocessingml-data-leakage-guard before trusting fitted preprocessing stepssplitting-datasets when the next narrow problem is partition strategydevelopment
Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model.
development
Use when the user asks to inspect Sentry issues or events, summarize recent production errors, or pull basic Sentry health data via the Sentry API; perform read-only queries with the bundled script and require `SENTRY_AUTH_TOKEN`.
development
World-class prompt engineering skill for LLM optimization, prompt patterns, structured outputs, and AI product development. Expertise in Claude, GPT-4, prompt design patterns, few-shot learning, chain-of-thought, and AI evaluation. Includes RAG optimization, agent design, and LLM system architecture. Use when building AI products, optimizing LLM performance, designing agentic systems, or implementing advanced prompting techniques.
development
World-class ML engineering skill for productionizing ML models, MLOps, and building scalable ML systems. Expertise in PyTorch, TensorFlow, model deployment, feature stores, model monitoring, and ML infrastructure. Includes LLM integration, fine-tuning, RAG systems, and agentic AI. Use when deploying ML models, building ML platforms, implementing MLOps, or integrating LLMs into production systems.