skills/bioinformatics/SKILL.md
# Bioinformatics Analysis ## Overview Computational biology and genomics analysis pipelines. GENERAL: not locked to any specific tool — use Scanpy, Seurat, DESeq2, or any appropriate package. ## Common Workflows ### RNA-seq Analysis 1. Quality control (FastQC, MultiQC) 2. Alignment (STAR, HISAT2) or pseudo-alignment (Salmon, kallisto) 3. Quantification (featureCounts, Salmon quant) 4. Normalization (DESeq2 vst/rlog, edgeR TMM) 5. Differential expression (DESeq2, edgeR, limma-voom) 6. Visualiz
npx skillsauth add Zaoqu-Liu/ScienceClaw skills/bioinformaticsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Computational biology and genomics analysis pipelines. GENERAL: not locked to any specific tool — use Scanpy, Seurat, DESeq2, or any appropriate package.
| Format | Content | Tools | |--------|---------|-------| | FASTQ | Raw sequencing reads | FastQC, Trimmomatic | | BAM/SAM | Aligned reads | samtools, IGV | | VCF | Variant calls | bcftools, GATK | | h5ad | AnnData (scRNA-seq) | Scanpy | | RDS | R object | Seurat, DESeq2 | | BED | Genomic regions | bedtools |
testing
Therapeutics Data Commons. AI-ready drug discovery datasets (ADME, toxicity, DTI), benchmarks, scaffold splits, molecular oracles, for therapeutic ML and pharmacological prediction.
tools
Genomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.
development
Complete mass spectrometry analysis platform. Use for proteomics workflows feature detection, peptide identification, protein quantification, and complex LC-MS/MS pipelines. Supports extensive file formats and algorithms. Best for proteomics, comprehensive MS data processing. For simple spectral comparison and metabolite ID use matchms.
development
Multi-objective optimization framework. NSGA-II, NSGA-III, MOEA/D, Pareto fronts, constraint handling, benchmarks (ZDT, DTLZ), for engineering design and optimization problems.