Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

harsh040506/mlops-pipeline

Name: mlops-pipeline
Author: harsh040506

engineering/advanced-ml-engineering/skills/mlops-pipeline/SKILL.md

npx skillsauth add harsh040506/claude-code-unified-skill-plugin-library mlops-pipeline

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

MLOps Pipeline — End-to-End ML Lifecycle Management

Provides systematic guidance for operationalizing machine learning models: from data ingestion and feature engineering through experiment tracking, deployment, and production monitoring. Directly addresses the "Hidden Technical Debt in ML Systems" (Sculley et al.) by making the surrounding infrastructure as rigorous as the model itself.

The MLOps Stack

Data Sources → Feature Store → Training Pipeline → Model Registry → Serving Infrastructure → Monitoring

Each stage must be reproducible, versioned, and auditable.

Data Ingestion and Quality

Schema inference is the first gate:

Detect column types automatically: numeric, categorical, datetime, text, target
Compute: null rates, cardinality, value distributions, duplicate rows
Block pipeline on: > 20% missing values in target, > 5% duplicate rows, schema drift from expected

Outlier detection (run before any feature engineering):

Numeric: IQR method (flag outside 1.5×IQR as outlier, > 3×IQR as extreme outlier)
Categorical: flag categories with < 0.1% frequency (likely data entry errors)

See references/data-engineering.md for full data quality checks, validation schemas, and data contract patterns.

Feature Engineering

Normalization (always log transformation params for inference parity):

Standard scaling: x' = (x − μ) / σ — save μ, σ from training set; apply at inference
Min-max: x' = (x − x_min) / (x_max − x_min)
Log1p: for right-skewed features (income, counts, prices)

Encoding:

Low-cardinality categorical (< 30): one-hot encoding
High-cardinality categorical (30–1000): target encoding (mean of target per category)
Very high cardinality (> 1000): entity embeddings (learned dense representations)

Temporal features: extract as cyclical sin/cos pairs to preserve periodicity:

hour_sin = sin(2π · hour / 24), hour_cos = cos(2π · hour / 24)

See references/feature-stores.md for feature store architecture, point-in-time correct joins, and online/offline serving patterns.

Experiment Tracking

Every training run must record: | Artifact | Purpose | |---|---| | Git commit SHA | Code reproducibility | | Dataset hash (MD5/SHA256) | Data reproducibility | | Full hyperparameter config | Experiment reproducibility | | Random seed | Run reproducibility | | Environment (Python + library versions) | Dependency reproducibility |

Use MLflow or Weights & Biases for automatic artifact logging.

Model Deployment Patterns

Dev: Local FastAPI endpoint for integration testing Staging: Docker container → Kubernetes (namespace: staging) + smoke tests Production: Blue-green or canary deployment (see self-healing-models skill)

Model serialization formats:

model.pt — PyTorch, for fine-tuning and retraining
model.onnx — runtime-agnostic, for cross-platform serving
model.pkl — Scikit-learn pipeline including preprocessing steps

Always include the preprocessing pipeline in the serialized model artifact to prevent training-serving skew.

See references/monitoring.md for production monitoring setup, alerting rules, and dashboard templates.

Quality Gates

Before promoting any model to production:

Primary metric exceeds configured threshold
Fairness checks pass (if sensitive attributes present)
Latency P99 < configured SLA (e.g., 50ms)
No statistically significant regression vs. current champion
Model card completed and reviewed

harsh040506/mlops-pipeline

engineering/advanced-ml-engineering/skills/mlops-pipeline/SKILL.md

This skill should be used when the user asks about "MLOps", "ML pipeline", "data pipeline", "feature engineering", "feature store", "data preprocessing", "model deployment", "model serving", "model registry", "experiment tracking", "MLflow", "Weights and Biases", "model versioning", "CI/CD for ML", "model monitoring", "data quality", "schema validation", "reproducibility", "technical debt in ML", or when operationalizing a machine learning model for production.

2 stars

testing

Updated Apr 5, 2026

$ install --global

skillsauth

npx skillsauth add harsh040506/claude-code-unified-skill-plugin-library mlops-pipeline

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 5, 2026, 5:10 PM4.3s4 files scanned

SKILL.md

name:: mlops-pipeline
description:: This skill should be used when the user asks about "MLOps", "ML pipeline", "data pipeline", "feature engineering", "feature store", "data preprocessing", "model deployment", "model serving", "model registry", "experiment tracking", "MLflow", "Weights and Biases", "model versioning", "CI/CD for ML", "model monitoring", "data quality", "schema validation", "reproducibility", "technical debt in ML", or when operationalizing a machine learning model for production.
version:: 1.0.0

MLOps Pipeline — End-to-End ML Lifecycle Management

The MLOps Stack

Data Sources → Feature Store → Training Pipeline → Model Registry → Serving Infrastructure → Monitoring

Each stage must be reproducible, versioned, and auditable.

Data Ingestion and Quality

Schema inference is the first gate:

Detect column types automatically: numeric, categorical, datetime, text, target
Compute: null rates, cardinality, value distributions, duplicate rows
Block pipeline on: > 20% missing values in target, > 5% duplicate rows, schema drift from expected

Outlier detection (run before any feature engineering):

Numeric: IQR method (flag outside 1.5×IQR as outlier, > 3×IQR as extreme outlier)
Categorical: flag categories with < 0.1% frequency (likely data entry errors)

See references/data-engineering.md for full data quality checks, validation schemas, and data contract patterns.

Feature Engineering

Normalization (always log transformation params for inference parity):

Standard scaling: x' = (x − μ) / σ — save μ, σ from training set; apply at inference
Min-max: x' = (x − x_min) / (x_max − x_min)
Log1p: for right-skewed features (income, counts, prices)

Encoding:

Low-cardinality categorical (< 30): one-hot encoding
High-cardinality categorical (30–1000): target encoding (mean of target per category)
Very high cardinality (> 1000): entity embeddings (learned dense representations)

Temporal features: extract as cyclical sin/cos pairs to preserve periodicity:

hour_sin = sin(2π · hour / 24), hour_cos = cos(2π · hour / 24)

See references/feature-stores.md for feature store architecture, point-in-time correct joins, and online/offline serving patterns.

Experiment Tracking

Use MLflow or Weights & Biases for automatic artifact logging.

Model Deployment Patterns

Model serialization formats:

model.pt — PyTorch, for fine-tuning and retraining
model.onnx — runtime-agnostic, for cross-platform serving
model.pkl — Scikit-learn pipeline including preprocessing steps

Always include the preprocessing pipeline in the serialized model artifact to prevent training-serving skew.

See references/monitoring.md for production monitoring setup, alerting rules, and dashboard templates.

Quality Gates

Before promoting any model to production:

Primary metric exceeds configured threshold
Fairness checks pass (if sensitive attributes present)
Latency P99 < configured SLA (e.g., 50ms)
No statistically significant regression vs. current champion
Model card completed and reviewed

Related Skills

harsh040506/single-cell-rna-qc

testing

VerifiedTrustedCommunity

Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations. Use when users request QC analysis, filtering low-quality cells, assessing data quality, or following scverse/scanpy best practices for single-cell analysis.

2SKILL.mdUpdated Apr 5, 2026

harsh040506/single-cell-rna-qc

harsh040506/scvi-tools

tools

VerifiedTrustedCommunity

Deep learning for single-cell analysis using scvi-tools. This skill should be used when users need (1) data integration and batch correction with scVI/scANVI, (2) ATAC-seq analysis with PeakVI, (3) CITE-seq multi-modal analysis with totalVI, (4) multiome RNA+ATAC analysis with MultiVI, (5) spatial transcriptomics deconvolution with DestVI, (6) label transfer and reference mapping with scANVI/scArches, (7) RNA velocity with veloVI, or (8) any deep learning-based single-cell method. Triggers include mentions of scVI, scANVI, totalVI, PeakVI, MultiVI, DestVI, veloVI, sysVI, scArches, variational autoencoder, VAE, batch correction, data integration, multi-modal, CITE-seq, multiome, reference mapping, latent space.

2SKILL.mdUpdated Apr 5, 2026

harsh040506/scvi-tools

harsh040506/scientific-problem-selection

testing

VerifiedTrustedCommunity

This skill should be used when scientists need help with research problem selection, project ideation, troubleshooting stuck projects, or strategic scientific decisions. Use this skill when users ask to pitch a new research idea, work through a project problem, evaluate project risks, plan research strategy, navigate decision trees, or get help choosing what scientific problem to work on. Typical requests include "I have an idea for a project", "I'm stuck on my research", "help me evaluate this project", "what should I work on", or "I need strategic advice about my research".

2SKILL.mdUpdated Apr 5, 2026

harsh040506/scientific-problem-selection

harsh040506/nextflow-development

development

VerifiedTrustedCommunity

Run nf-core bioinformatics pipelines (rnaseq, sarek, atacseq) on sequencing data. Use when analyzing RNA-seq, WGS/WES, or ATAC-seq data—either local FASTQs or public datasets from GEO/SRA. Triggers on nf-core, Nextflow, FASTQ analysis, variant calling, gene expression, differential expression, GEO reanalysis, GSE/GSM/SRR accessions, or samplesheet creation.

2SKILL.mdUpdated Apr 5, 2026

harsh040506/nextflow-development

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/harsh040506/claude-code-unified-skill-plugin-library.git

# Copy into Claude Code skills folder (global)
cp -r claude-code-unified-skill-plugin-library/engineering/advanced-ml-engineering/skills/mlops-pipeline ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

harsh040506/claude-code-unified-skill-plugin-library

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT