Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

harsh040506/generative-models

Name: generative-models
Author: harsh040506

engineering/advanced-ml-engineering/skills/generative-models/SKILL.md

npx skillsauth add harsh040506/claude-code-unified-skill-plugin-library generative-models

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Generative Models — Diffusion, GANs, and Transformer LMs

Provides architecture patterns, loss function derivations, training stability techniques, and evaluation protocols for diffusion models, GANs, and large-scale language models.

Diffusion Models

Mathematical Foundation:

Forward process (noising): q(x_t | x_{t-1}) = N(x_t; √(1−β_t)·x_{t-1}, β_t·I)
Closed-form marginal: q(x_t | x_0) = N(x_t; √ᾱ_t·x_0, (1−ᾱ_t)·I)
Reverse process (denoising): learn p_θ(x_{t-1} | x_t) by predicting ε_θ(x_t, t) ≈ ε

Training Objective (simplified ELBO): L_simple = E_{t, x_0, ε}[||ε − ε_θ(√ᾱ_t·x_0 + √(1−ᾱ_t)·ε, t)||²]

Architecture: U-Net with:

Residual blocks at each resolution
Self-attention at low resolutions (32×32 and 16×16)
Sinusoidal time-step embedding conditioning (d = 256)
GroupNorm (32 groups) for training stability

Noise Schedules: linear (DDPM), cosine (improved DDPM), or sigmoid (more uniform SNR)

Sampling Acceleration: DDIM (50 steps), DPM-Solver (10–20 steps) vs. DDPM (1000 steps)

See references/diffusion-models.md for full derivation, implementation, and latent diffusion (LDM) patterns.

Generative Adversarial Networks

Minimax Objective: min_G max_D V(D,G) = E_{x~p_data}[log D(x)] + E_{z~p_z}[log(1 − D(G(z)))]

Critical Stabilization Techniques (apply all for stable training):

Spectral normalization on D: constrains Lipschitz constant, prevents discriminator from becoming too powerful
Label smoothing: real labels = 0.9 (not 1.0), fake labels = 0.1 (not 0.0)
WGAN-GP gradient penalty: λ·E[(||∇D(x̂)||₂ − 1)²], λ = 10, eliminates mode collapse risk
TTUR: use LR_D = 4e-4, LR_G = 1e-4 (different timescales stabilize training)
MiniBatch discrimination: prevents G from always generating the same output

Evaluation Metric: FID (Fréchet Inception Distance) — lower is better; FID < 10 is state-of-the-art for faces.

See references/gans.md for architecture variants (DCGAN → StyleGAN2 → ProjectedGAN) and CTGAN for tabular data.

Transformer Language Models

Core Architecture (decoder-only, GPT-style):

Token embedding (vocab_size → d_model) + positional encoding (RoPE preferred)
N × blocks: [LayerNorm → MHA → residual] → [LayerNorm → FFN(4×d) → residual]
Attention: Attention(Q,K,V) = softmax(QK^T / √d_k)·V
Output: linear projection to vocab_size → softmax → token probabilities

Scaling Guidance (Chinchilla-optimal, Hoffmann et al. 2022):

Optimal tokens T for N parameters: T_opt = 20·N
Equal scaling of model size and data is key — do not over- or under-train

Training Techniques for LLMs:

Mixed precision BF16 (A100+), FP16 + GradScaler (V100)
FlashAttention-2 for memory-efficient O(n) attention
Gradient accumulation to achieve large effective batch sizes without large VRAM
Learning rate: warmup to peak over 1% of steps, then cosine decay to 10% of peak

See references/transformer-lms.md for full implementation, tokenization, and RLHF fine-tuning patterns.

Quality Evaluation by Model Type

| Model Type | Primary Metric | Secondary Metrics | |---|---|---| | Diffusion (images) | FID (↓) | IS (↑), CLIP score, human preference | | GAN (images) | FID (↓) | Precision, Recall, coverage | | Language Model | Perplexity (↓) | BLEU, ROUGE, BERTScore, task benchmarks | | Conditional generation | Alignment score | Diversity, fidelity vs. reference |

harsh040506/generative-models

engineering/advanced-ml-engineering/skills/generative-models/SKILL.md

This skill should be used when the user asks about "generative models", "diffusion models", "DDPM", "DDIM", "stable diffusion", "GANs", "GAN training", "generator", "discriminator", "mode collapse", "FID score", "language models", "LLM", "GPT", "transformer", "text generation", "image generation", "ELBO", "score matching", "latent diffusion", "variational autoencoder", "VAE", "CLIP", "multimodal generation", or when building any system that generates novel data samples.

2 stars

tools

Updated Apr 5, 2026

$ install --global

skillsauth

npx skillsauth add harsh040506/claude-code-unified-skill-plugin-library generative-models

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 5, 2026, 5:10 PM5.1s4 files scanned

SKILL.md

name:: generative-models
description:: This skill should be used when the user asks about "generative models", "diffusion models", "DDPM", "DDIM", "stable diffusion", "GANs", "GAN training", "generator", "discriminator", "mode collapse", "FID score", "language models", "LLM", "GPT", "transformer", "text generation", "image generation", "ELBO", "score matching", "latent diffusion", "variational autoencoder", "VAE", "CLIP", "multimodal generation", or when building any system that generates novel data samples.
version:: 1.0.0

Generative Models — Diffusion, GANs, and Transformer LMs

Provides architecture patterns, loss function derivations, training stability techniques, and evaluation protocols for diffusion models, GANs, and large-scale language models.

Diffusion Models

Mathematical Foundation:

Forward process (noising): q(x_t | x_{t-1}) = N(x_t; √(1−β_t)·x_{t-1}, β_t·I)
Closed-form marginal: q(x_t | x_0) = N(x_t; √ᾱ_t·x_0, (1−ᾱ_t)·I)
Reverse process (denoising): learn p_θ(x_{t-1} | x_t) by predicting ε_θ(x_t, t) ≈ ε

Training Objective (simplified ELBO): L_simple = E_{t, x_0, ε}[||ε − ε_θ(√ᾱ_t·x_0 + √(1−ᾱ_t)·ε, t)||²]

Architecture: U-Net with:

Residual blocks at each resolution
Self-attention at low resolutions (32×32 and 16×16)
Sinusoidal time-step embedding conditioning (d = 256)
GroupNorm (32 groups) for training stability

Noise Schedules: linear (DDPM), cosine (improved DDPM), or sigmoid (more uniform SNR)

Sampling Acceleration: DDIM (50 steps), DPM-Solver (10–20 steps) vs. DDPM (1000 steps)

See references/diffusion-models.md for full derivation, implementation, and latent diffusion (LDM) patterns.

Generative Adversarial Networks

Minimax Objective: min_G max_D V(D,G) = E_{x~p_data}[log D(x)] + E_{z~p_z}[log(1 − D(G(z)))]

Critical Stabilization Techniques (apply all for stable training):

Spectral normalization on D: constrains Lipschitz constant, prevents discriminator from becoming too powerful
Label smoothing: real labels = 0.9 (not 1.0), fake labels = 0.1 (not 0.0)
WGAN-GP gradient penalty: λ·E[(||∇D(x̂)||₂ − 1)²], λ = 10, eliminates mode collapse risk
TTUR: use LR_D = 4e-4, LR_G = 1e-4 (different timescales stabilize training)
MiniBatch discrimination: prevents G from always generating the same output

Evaluation Metric: FID (Fréchet Inception Distance) — lower is better; FID < 10 is state-of-the-art for faces.

See references/gans.md for architecture variants (DCGAN → StyleGAN2 → ProjectedGAN) and CTGAN for tabular data.

Transformer Language Models

Core Architecture (decoder-only, GPT-style):

Token embedding (vocab_size → d_model) + positional encoding (RoPE preferred)
N × blocks: [LayerNorm → MHA → residual] → [LayerNorm → FFN(4×d) → residual]
Attention: Attention(Q,K,V) = softmax(QK^T / √d_k)·V
Output: linear projection to vocab_size → softmax → token probabilities

Scaling Guidance (Chinchilla-optimal, Hoffmann et al. 2022):

Optimal tokens T for N parameters: T_opt = 20·N
Equal scaling of model size and data is key — do not over- or under-train

Training Techniques for LLMs:

Mixed precision BF16 (A100+), FP16 + GradScaler (V100)
FlashAttention-2 for memory-efficient O(n) attention
Gradient accumulation to achieve large effective batch sizes without large VRAM
Learning rate: warmup to peak over 1% of steps, then cosine decay to 10% of peak

See references/transformer-lms.md for full implementation, tokenization, and RLHF fine-tuning patterns.

Quality Evaluation by Model Type

Related Skills

harsh040506/single-cell-rna-qc

testing

VerifiedTrustedCommunity

Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations. Use when users request QC analysis, filtering low-quality cells, assessing data quality, or following scverse/scanpy best practices for single-cell analysis.

2SKILL.mdUpdated Apr 5, 2026

harsh040506/single-cell-rna-qc

harsh040506/scvi-tools

tools

VerifiedTrustedCommunity

Deep learning for single-cell analysis using scvi-tools. This skill should be used when users need (1) data integration and batch correction with scVI/scANVI, (2) ATAC-seq analysis with PeakVI, (3) CITE-seq multi-modal analysis with totalVI, (4) multiome RNA+ATAC analysis with MultiVI, (5) spatial transcriptomics deconvolution with DestVI, (6) label transfer and reference mapping with scANVI/scArches, (7) RNA velocity with veloVI, or (8) any deep learning-based single-cell method. Triggers include mentions of scVI, scANVI, totalVI, PeakVI, MultiVI, DestVI, veloVI, sysVI, scArches, variational autoencoder, VAE, batch correction, data integration, multi-modal, CITE-seq, multiome, reference mapping, latent space.

2SKILL.mdUpdated Apr 5, 2026

harsh040506/scvi-tools

harsh040506/scientific-problem-selection

testing

VerifiedTrustedCommunity

This skill should be used when scientists need help with research problem selection, project ideation, troubleshooting stuck projects, or strategic scientific decisions. Use this skill when users ask to pitch a new research idea, work through a project problem, evaluate project risks, plan research strategy, navigate decision trees, or get help choosing what scientific problem to work on. Typical requests include "I have an idea for a project", "I'm stuck on my research", "help me evaluate this project", "what should I work on", or "I need strategic advice about my research".

2SKILL.mdUpdated Apr 5, 2026

harsh040506/scientific-problem-selection

harsh040506/nextflow-development

development

VerifiedTrustedCommunity

Run nf-core bioinformatics pipelines (rnaseq, sarek, atacseq) on sequencing data. Use when analyzing RNA-seq, WGS/WES, or ATAC-seq data—either local FASTQs or public datasets from GEO/SRA. Triggers on nf-core, Nextflow, FASTQ analysis, variant calling, gene expression, differential expression, GEO reanalysis, GSE/GSM/SRR accessions, or samplesheet creation.

2SKILL.mdUpdated Apr 5, 2026

harsh040506/nextflow-development

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/harsh040506/claude-code-unified-skill-plugin-library.git

# Copy into Claude Code skills folder (global)
cp -r claude-code-unified-skill-plugin-library/engineering/advanced-ml-engineering/skills/generative-models ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

harsh040506/claude-code-unified-skill-plugin-library

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT