engineering/advanced-ml-engineering/skills/generative-models/SKILL.md
This skill should be used when the user asks about "generative models", "diffusion models", "DDPM", "DDIM", "stable diffusion", "GANs", "GAN training", "generator", "discriminator", "mode collapse", "FID score", "language models", "LLM", "GPT", "transformer", "text generation", "image generation", "ELBO", "score matching", "latent diffusion", "variational autoencoder", "VAE", "CLIP", "multimodal generation", or when building any system that generates novel data samples.
npx skillsauth add harsh040506/claude-code-unified-skill-plugin-library generative-modelsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Provides architecture patterns, loss function derivations, training stability techniques, and evaluation protocols for diffusion models, GANs, and large-scale language models.
Mathematical Foundation:
Training Objective (simplified ELBO): L_simple = E_{t, x_0, ε}[||ε − ε_θ(√ᾱ_t·x_0 + √(1−ᾱ_t)·ε, t)||²]
Architecture: U-Net with:
Noise Schedules: linear (DDPM), cosine (improved DDPM), or sigmoid (more uniform SNR)
Sampling Acceleration: DDIM (50 steps), DPM-Solver (10–20 steps) vs. DDPM (1000 steps)
See references/diffusion-models.md for full derivation, implementation, and latent diffusion (LDM) patterns.
Minimax Objective: min_G max_D V(D,G) = E_{x~p_data}[log D(x)] + E_{z~p_z}[log(1 − D(G(z)))]
Critical Stabilization Techniques (apply all for stable training):
Evaluation Metric: FID (Fréchet Inception Distance) — lower is better; FID < 10 is state-of-the-art for faces.
See references/gans.md for architecture variants (DCGAN → StyleGAN2 → ProjectedGAN) and CTGAN for tabular data.
Core Architecture (decoder-only, GPT-style):
Scaling Guidance (Chinchilla-optimal, Hoffmann et al. 2022):
Training Techniques for LLMs:
See references/transformer-lms.md for full implementation, tokenization, and RLHF fine-tuning patterns.
| Model Type | Primary Metric | Secondary Metrics | |---|---|---| | Diffusion (images) | FID (↓) | IS (↑), CLIP score, human preference | | GAN (images) | FID (↓) | Precision, Recall, coverage | | Language Model | Perplexity (↓) | BLEU, ROUGE, BERTScore, task benchmarks | | Conditional generation | Alignment score | Diversity, fidelity vs. reference |
testing
Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations. Use when users request QC analysis, filtering low-quality cells, assessing data quality, or following scverse/scanpy best practices for single-cell analysis.
tools
Deep learning for single-cell analysis using scvi-tools. This skill should be used when users need (1) data integration and batch correction with scVI/scANVI, (2) ATAC-seq analysis with PeakVI, (3) CITE-seq multi-modal analysis with totalVI, (4) multiome RNA+ATAC analysis with MultiVI, (5) spatial transcriptomics deconvolution with DestVI, (6) label transfer and reference mapping with scANVI/scArches, (7) RNA velocity with veloVI, or (8) any deep learning-based single-cell method. Triggers include mentions of scVI, scANVI, totalVI, PeakVI, MultiVI, DestVI, veloVI, sysVI, scArches, variational autoencoder, VAE, batch correction, data integration, multi-modal, CITE-seq, multiome, reference mapping, latent space.
testing
This skill should be used when scientists need help with research problem selection, project ideation, troubleshooting stuck projects, or strategic scientific decisions. Use this skill when users ask to pitch a new research idea, work through a project problem, evaluate project risks, plan research strategy, navigate decision trees, or get help choosing what scientific problem to work on. Typical requests include "I have an idea for a project", "I'm stuck on my research", "help me evaluate this project", "what should I work on", or "I need strategic advice about my research".
development
Run nf-core bioinformatics pipelines (rnaseq, sarek, atacseq) on sequencing data. Use when analyzing RNA-seq, WGS/WES, or ATAC-seq data—either local FASTQs or public datasets from GEO/SRA. Triggers on nf-core, Nextflow, FASTQ analysis, variant calling, gene expression, differential expression, GEO reanalysis, GSE/GSM/SRR accessions, or samplesheet creation.