skills/science/omics-tools/SKILL.md
Omics and single-cell workflow guide for AnnData, Scanpy-style dataset profiling, PyDESeq2-oriented count checks, pysam alignment inspection, and pyOpenMS mass-spectrometry summaries. Use when the user asks to inspect h5ad files, summarize BAM regions, profile omics count tables, or inventory mzML experiments before deeper modeling.
npx skillsauth add drugclaw/drugclaw omics-toolsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this skill when the user asks to inspect or triage omics datasets before deeper modeling.
Typical triggers:
h5ad or annotated single-cell matriceswhich python3 || true
python3 - <<'PY'
mods = ["pandas", "numpy", "anndata", "pysam"]
extra = ["scanpy", "pydeseq2", "pyopenms", "skbio"]
for name in mods + extra:
try:
__import__(name)
print(f"{name}: ok")
except Exception as exc:
print(f"{name}: missing ({exc})")
PY
Do not claim single-cell, alignment, or mass-spec analysis ran if the required module is absent.
templates/single_cell_profile.pytemplates/pysam_region_profile.pytemplates/mzml_summary.pypython3 templates/single_cell_profile.py \
--input data/pbmc.h5ad \
--cell-type-column cell_type \
--group-column batch \
--group-column donor \
--output omics/pbmc_profile.csv \
--summary omics/pbmc_profile.json
Use this first for:
python3 templates/pysam_region_profile.py \
--bam alignments/sample.bam \
--region chr7:55019017-55211628 \
--region chr12:25205246-25250928 \
--output omics/sample_region_profile.csv \
--summary omics/sample_region_profile.json
Use this for:
python3 templates/mzml_summary.py \
--input proteomics/run01.mzML \
--output omics/run01_mzml_profile.csv \
--summary omics/run01_mzml_profile.json
Use this for:
This skill is for data profiling and workflow triage. It does not replace full differential-expression analysis, trajectory inference, peptide identification, or validated clinical interpretation.
Good answers should mention:
For general sequence analysis or command-line bioinformatics, activate bio-tools.
For remote biology APIs such as GEO, Ensembl, UniProt, PDB, or Reactome, activate bio-db-tools.
For transcription-factor network inference from processed expression matrices, activate grn-tools.
For statistical modeling or survival analysis on omics-derived tables, activate stat-modeling-tools or survival-analysis-tools.
For static or interactive omics figures, activate scientific-visualization-tools.
For chemistry, ADMET, QSAR, or structure-aware affinity, activate chem-tools.
tools
Survival and time-to-event workflow guide for Kaplan-Meier summaries, log-rank tests, and Cox proportional hazards models with reproducible outputs. Use when the user asks for time-to-event analysis, censored data summaries, hazard ratios, or survival-group comparison for research datasets.
tools
Statistical modeling workflow guide for hypothesis tests, effect-size reporting, statsmodels regression, diagnostics, and structured result export. Use when the user asks for statistical test selection, OLS or logistic regression, coefficient tables, inference, or reproducible statistical summaries for scientific datasets.
tools
Research-method workflow guide for hypothesis framing, peer-review style critique, reproducibility planning, study-design checks, and scientific-writing structure. Use when the user asks for manuscript critique, research-gap framing, hypothesis generation, reproducibility checklists, or study-planning support that should stay on the research side rather than patient-care decisions.
tools
Scientific visualization workflow guide for publication-ready static figures with seaborn or matplotlib and interactive figures with Plotly. Use when the user asks for scientific plots, cohort or assay figures, publication graphics, dashboards, or reusable plotting scripts for research datasets.