Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

harsh040506/ml-theory

Name: ml-theory
Author: harsh040506

engineering/advanced-ml-engineering/skills/ml-theory/SKILL.md

npx skillsauth add harsh040506/claude-code-unified-skill-plugin-library ml-theory

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

ML Theory — Mathematical Foundations

Provides the rigorous mathematical theory underlying all machine learning components in this plugin. Covers loss function derivation, optimization dynamics, and statistical learning guarantees that inform architectural and training decisions.

Core Domains

Loss Function Taxonomy

Loss functions define what a model is optimizing. The choice of loss encodes all assumptions about the output distribution:

Regression: MSE (assumes Gaussian noise), MAE (assumes Laplacian noise), Huber (robust to outliers)
Classification: Cross-Entropy (maximizes log-likelihood of correct class), Focal Loss (down-weights easy examples)
Generation: ELBO (variational lower bound on log p(x)), Score Matching (denoising objective)
Ranking: Pairwise hinge loss, ListNet, LambdaRank

See references/loss-functions.md for complete derivations and PyTorch implementations.

Optimization Theory

All training is an instance of empirical risk minimization: find θ* = argmin_θ (1/n) Σ L(f_θ(xᵢ), yᵢ).

Key optimization concepts:

Gradient Descent: θ ← θ − η·∇_θ L
SGD with Momentum: combines gradient history to dampen oscillations
Adam: adaptive per-parameter learning rates using first and second moment estimates
Learning Rate Schedules: warmup + cosine annealing prevents loss spikes and aids convergence

See references/optimization-theory.md for convergence proofs, saddle point analysis, and loss landscape geometry.

Statistical Learning Theory

Theory that bounds generalization error — the gap between training loss and test loss:

Bias-Variance Decomposition: E[MSE] = Bias² + Variance + Irreducible Noise
VC Dimension: upper bounds on hypothesis class complexity
PAC Learning: conditions under which learnability is guaranteed
Double Descent: phenomenon where test error can decrease again in the overparameterized regime

See references/statistical-learning.md for formal bounds, regularization theory, and empirical verification methods.

When to Apply Theory

Apply ML theory concepts when:

Choosing a loss function: match the loss to the output distribution assumption
Diagnosing training failure: use theory to distinguish underfitting vs. overfitting vs. optimization failure
Justifying model complexity: use VC dimension / scaling laws to reason about needed parameter count
Setting regularization strength: use bias-variance analysis to choose λ in L1/L2 regularization

Notation Reference

| Symbol | Meaning | |---|---| | θ | Model parameters | | η | Learning rate | | L(·) | Loss function | | f_θ(x) | Model prediction | | μ, σ | Mean, standard deviation | | ∇_θ | Gradient with respect to θ | | ᾱ_t | Cumulative noise schedule product (diffusion) | | γ | Discount factor (RL) |

harsh040506/ml-theory

engineering/advanced-ml-engineering/skills/ml-theory/SKILL.md

This skill should be used when the user asks about "loss functions", "gradient descent", "backpropagation", "vanishing gradients", "regularization", "bias-variance tradeoff", "statistical learning theory", "PAC learning", "VC dimension", "overfitting", "underfitting", "cross-entropy loss", "KL divergence", "ELBO", "maximum likelihood estimation", "Bayesian inference", or any foundational mathematical concepts underlying machine learning models and optimization.

2 stars

data-ai

Updated Apr 5, 2026

$ install --global

skillsauth

npx skillsauth add harsh040506/claude-code-unified-skill-plugin-library ml-theory

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 5, 2026, 5:10 PM4.4s4 files scanned

SKILL.md

name:: ml-theory
description:: This skill should be used when the user asks about "loss functions", "gradient descent", "backpropagation", "vanishing gradients", "regularization", "bias-variance tradeoff", "statistical learning theory", "PAC learning", "VC dimension", "overfitting", "underfitting", "cross-entropy loss", "KL divergence", "ELBO", "maximum likelihood estimation", "Bayesian inference", or any foundational mathematical concepts underlying machine learning models and optimization.
version:: 1.0.0

ML Theory — Mathematical Foundations

Core Domains

Loss Function Taxonomy

Loss functions define what a model is optimizing. The choice of loss encodes all assumptions about the output distribution:

Regression: MSE (assumes Gaussian noise), MAE (assumes Laplacian noise), Huber (robust to outliers)
Classification: Cross-Entropy (maximizes log-likelihood of correct class), Focal Loss (down-weights easy examples)
Generation: ELBO (variational lower bound on log p(x)), Score Matching (denoising objective)
Ranking: Pairwise hinge loss, ListNet, LambdaRank

See references/loss-functions.md for complete derivations and PyTorch implementations.

Optimization Theory

All training is an instance of empirical risk minimization: find θ* = argmin_θ (1/n) Σ L(f_θ(xᵢ), yᵢ).

Key optimization concepts:

Gradient Descent: θ ← θ − η·∇_θ L
SGD with Momentum: combines gradient history to dampen oscillations
Adam: adaptive per-parameter learning rates using first and second moment estimates
Learning Rate Schedules: warmup + cosine annealing prevents loss spikes and aids convergence

See references/optimization-theory.md for convergence proofs, saddle point analysis, and loss landscape geometry.

Statistical Learning Theory

Theory that bounds generalization error — the gap between training loss and test loss:

Bias-Variance Decomposition: E[MSE] = Bias² + Variance + Irreducible Noise
VC Dimension: upper bounds on hypothesis class complexity
PAC Learning: conditions under which learnability is guaranteed
Double Descent: phenomenon where test error can decrease again in the overparameterized regime

See references/statistical-learning.md for formal bounds, regularization theory, and empirical verification methods.

When to Apply Theory

Apply ML theory concepts when:

Choosing a loss function: match the loss to the output distribution assumption
Diagnosing training failure: use theory to distinguish underfitting vs. overfitting vs. optimization failure
Justifying model complexity: use VC dimension / scaling laws to reason about needed parameter count
Setting regularization strength: use bias-variance analysis to choose λ in L1/L2 regularization

Notation Reference

Related Skills

harsh040506/single-cell-rna-qc

testing

VerifiedTrustedCommunity

Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations. Use when users request QC analysis, filtering low-quality cells, assessing data quality, or following scverse/scanpy best practices for single-cell analysis.

2SKILL.mdUpdated Apr 5, 2026

harsh040506/single-cell-rna-qc

harsh040506/scvi-tools

tools

VerifiedTrustedCommunity

Deep learning for single-cell analysis using scvi-tools. This skill should be used when users need (1) data integration and batch correction with scVI/scANVI, (2) ATAC-seq analysis with PeakVI, (3) CITE-seq multi-modal analysis with totalVI, (4) multiome RNA+ATAC analysis with MultiVI, (5) spatial transcriptomics deconvolution with DestVI, (6) label transfer and reference mapping with scANVI/scArches, (7) RNA velocity with veloVI, or (8) any deep learning-based single-cell method. Triggers include mentions of scVI, scANVI, totalVI, PeakVI, MultiVI, DestVI, veloVI, sysVI, scArches, variational autoencoder, VAE, batch correction, data integration, multi-modal, CITE-seq, multiome, reference mapping, latent space.

2SKILL.mdUpdated Apr 5, 2026

harsh040506/scvi-tools

harsh040506/scientific-problem-selection

testing

VerifiedTrustedCommunity

This skill should be used when scientists need help with research problem selection, project ideation, troubleshooting stuck projects, or strategic scientific decisions. Use this skill when users ask to pitch a new research idea, work through a project problem, evaluate project risks, plan research strategy, navigate decision trees, or get help choosing what scientific problem to work on. Typical requests include "I have an idea for a project", "I'm stuck on my research", "help me evaluate this project", "what should I work on", or "I need strategic advice about my research".

2SKILL.mdUpdated Apr 5, 2026

harsh040506/scientific-problem-selection

harsh040506/nextflow-development

development

VerifiedTrustedCommunity

Run nf-core bioinformatics pipelines (rnaseq, sarek, atacseq) on sequencing data. Use when analyzing RNA-seq, WGS/WES, or ATAC-seq data—either local FASTQs or public datasets from GEO/SRA. Triggers on nf-core, Nextflow, FASTQ analysis, variant calling, gene expression, differential expression, GEO reanalysis, GSE/GSM/SRR accessions, or samplesheet creation.

2SKILL.mdUpdated Apr 5, 2026

harsh040506/nextflow-development

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/harsh040506/claude-code-unified-skill-plugin-library.git

# Copy into Claude Code skills folder (global)
cp -r claude-code-unified-skill-plugin-library/engineering/advanced-ml-engineering/skills/ml-theory ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

harsh040506/claude-code-unified-skill-plugin-library

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT