Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

foryourhealth111-pixel/preprocessing-data-with-automated-pipelines

Name: preprocessing-data-with-automated-pipelines
Author: foryourhealth111-pixel

bundled/skills/preprocessing-data-with-automated-pipelines/SKILL.md

npx skillsauth add foryourhealth111-pixel/vco-skills-codex preprocessing-data-with-automated-pipelines

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Data Preprocessing Pipeline

Positioning

Use this skill as the direct owner for ML input-preparation pipelines.

It covers preprocessing-heavy tasks where the requested deliverable is a repeatable pipeline for cleaning, encoding, transforming, and validating input data.

When to Use

Use this skill when:

Prepare raw data for machine learning models.
Automate data cleaning and transformation processes.
Implement a robust ETL (Extract, Transform, Load) pipeline.

Not For / Boundaries

Whole-task ML ownership: use scikit-learn or ml-pipeline-workflow
Leakage and prediction-time auditing: use ml-data-leakage-guard
Grouped scientific preprocessing with stronger methodological constraints: use scientific-data-preprocessing

Typical Outputs

A preprocessing pipeline plan or implementation sketch
Clear sequencing for clean, encode, transform, and validate steps
Notes that identify where leakage review, training, or evaluation should be run next

Related Skills

ml-data-leakage-guard before trusting fitted preprocessing steps
splitting-datasets when the next narrow problem is partition strategy

foryourhealth111-pixel/preprocessing-data-with-automated-pipelines

bundled/skills/preprocessing-data-with-automated-pipelines/SKILL.md

Design and implement repeatable preprocessing pipelines for cleaning, encoding, transforming, and validating ML input data.

2,393 stars

devops

Updated Jul 19, 2026

$ install --global

skillsauth

npx skillsauth add foryourhealth111-pixel/vco-skills-codex preprocessing-data-with-automated-pipelines

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jul 19, 2026, 3:23 AM126.6s8 files scanned

SKILL.md

name:: preprocessing-data-with-automated-pipelines
description:: |
allowed-tools:: Read, Write, Edit, Grep, Glob, Bash(cmd:*)
version:: 1.0.0
author:: Jeremy Longshore <[email protected]>
license:: MIT

Data Preprocessing Pipeline

Positioning

Use this skill as the direct owner for ML input-preparation pipelines.

It covers preprocessing-heavy tasks where the requested deliverable is a repeatable pipeline for cleaning, encoding, transforming, and validating input data.

When to Use

Use this skill when:

Prepare raw data for machine learning models.
Automate data cleaning and transformation processes.
Implement a robust ETL (Extract, Transform, Load) pipeline.

Not For / Boundaries

Whole-task ML ownership: use scikit-learn or ml-pipeline-workflow
Leakage and prediction-time auditing: use ml-data-leakage-guard
Grouped scientific preprocessing with stronger methodological constraints: use scientific-data-preprocessing

Typical Outputs

A preprocessing pipeline plan or implementation sketch
Clear sequencing for clean, encode, transform, and validate steps
Notes that identify where leakage review, training, or evaluation should be run next

Related Skills

ml-data-leakage-guard before trusting fitted preprocessing steps
splitting-datasets when the next narrow problem is partition strategy

Related Skills

foryourhealth111-pixel/zarr-python

development

VerifiedTrustedCommunity

Chunked N-D arrays for cloud storage. Compressed arrays, parallel I/O, S3/GCS integration, NumPy/Dask/Xarray compatible, for large-scale scientific computing pipelines.

2,438SKILL.mdUpdated Jul 23, 2026

foryourhealth111-pixel/zarr-python

foryourhealth111-pixel/yeet

tools

VerifiedTrustedCommunity

Use only when the user explicitly asks to stage, commit, push, and open a GitHub pull request in one flow using the GitHub CLI (`gh`).

2,438SKILL.mdUpdated Jul 23, 2026

foryourhealth111-pixel/yeet

foryourhealth111-pixel/xlsx

tools

VerifiedTrustedCommunity

Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.

2,438SKILL.mdUpdated Jul 23, 2026

foryourhealth111-pixel/xlsx

foryourhealth111-pixel/xan

tools

VerifiedTrustedCommunity

High-performance CSV processing with xan CLI for large tabular datasets, streaming transformations, and low-memory pipelines.

2,438SKILL.mdUpdated Jul 23, 2026

foryourhealth111-pixel/xan

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/foryourhealth111-pixel/vco-skills-codex.git

# Copy into Claude Code skills folder (global)
cp -r vco-skills-codex/bundled/skills/preprocessing-data-with-automated-pipelines ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

foryourhealth111-pixel/vco-skills-codex

2,393 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT