skills/tooluniverse-spatial-omics-analysis/SKILL.md
Spatial multi-omics interpretation pipeline. Transforms spatially variable genes (SVGs), domain annotations, and tissue context into biological insights via domain-by-domain characterization, cell-type composition, spatial gene expression patterns, RNA+protein+metabolite integration. Use for Visium, MERFISH, seqFISH, Slide-seq, spatial proteomics, and spatial multi-omics interpretation. Goes beyond statistics to disease mechanisms and therapeutic opportunities.
npx skillsauth add mims-harvard/tooluniverse tooluniverse-spatial-omics-analysisInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Comprehensive biological interpretation of spatial omics data. Transforms spatially variable genes (SVGs), domain annotations, and tissue context into actionable biological insights.
KEY PRINCIPLES:
When uncertain about any scientific fact, SEARCH databases first (PubMed, UniProt, ChEMBL, ClinVar, etc.) rather than reasoning from memory. A database-verified answer is always more reliable than a guess.
When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.
Apply when users:
NOT for: Single gene interpretation (use target-research), variant interpretation, drug safety, bulk RNA-seq, GWAS analysis.
| Parameter | Required | Description | Example |
|-----------|----------|-------------|---------|
| svgs | Yes | Spatially variable genes | ['EGFR', 'CDH1', 'VIM', 'MYC', 'CD3E'] |
| tissue_type | Yes | Tissue/organ type | brain, liver, lung, breast |
| technology | No | Spatial omics platform | 10x Visium, MERFISH, DBiTplus |
| disease_context | No | Disease if applicable | breast cancer, Alzheimer disease |
| spatial_domains | No | Domain -> marker genes dict | {'Tumor core': ['MYC','EGFR']} |
| cell_types | No | Cell types from deconvolution | ['Epithelial', 'T cell'] |
| proteins | No | Proteins detected (multi-modal) | ['CD3', 'PD-L1', 'Ki67'] |
| metabolites | No | Metabolites (SpatialMETA) | ['glutamine', 'lactate'] |
Data Completeness (0-30): SVGs (5), Disease context (5), Spatial domains (5), Cell types (5), Multi-modal (5), Literature (5)
Biological Insight (0-40): Pathway enrichment FDR<0.05 (10), Cell-cell interactions (10), Disease mechanism (10), Druggable targets (10)
Evidence Quality (0-30): Cross-database validation 3+ DBs (10), Clinical validation (10), Literature support (10)
| Score | Tier | Interpretation | |-------|------|----------------| | 80-100 | Excellent | Comprehensive characterization, strong insights, druggable targets | | 60-79 | Good | Good pathway/interaction analysis, some therapeutic context | | 40-59 | Moderate | Basic enrichment, limited domain comparison | | 0-39 | Limited | Minimal data, gene-level annotation only |
| Tier | Criteria | Examples | |------|----------|----------| | [T1] | Direct human/clinical evidence | FDA-approved drug, validated biomarker | | [T2] | Experimental evidence | Validated spatial pattern, known L-R pair | | [T3] | Computational/database evidence | PPI prediction, pathway enrichment | | [T4] | Annotation/prediction only | GO annotation, text-mined association |
Resolve tissue/disease identifiers, establish analysis context. Get MONDO/EFO IDs for disease queries.
OpenTargets_get_disease_id_description_by_name, OpenTargets_get_disease_description_by_efoId, HPA_search_genes_by_queryResolve gene IDs, annotate functions, tissue specificity, subcellular localization.
MyGene_query_genes, UniProt_get_function_by_accession, HPA_get_subcellular_location, HPA_get_rna_expression_by_source, HPA_get_comprehensive_gene_details_by_ensembl_id, HPA_get_cancer_prognostics_by_gene, UniProtIDMap_gene_to_uniprotIdentify enriched pathways globally and per-domain. Filter FDR < 0.05.
STRING_functional_enrichment (PRIMARY), ReactomeAnalysis_pathway_enrichment, GO_get_annotations_for_gene, kegg_search_pathway, WikiPathways_searchCharacterize each domain biologically, assign cell types from markers, compare domains.
HPA_get_biological_processes_by_gene, HPA_get_protein_interactions_by_genePredict communication from spatial patterns. Check ligand-receptor pairs across domains.
STRING_get_interaction_partners, STRING_get_protein_interactions, intact_search_interactions, Reactome_get_interactor, DGIdb_get_drug_gene_interactionsConnect to disease mechanisms, identify druggable targets, find clinical trials.
OpenTargets_get_associated_targets_by_disease_efoId, OpenTargets_get_target_tractability_by_ensemblID, OpenTargets_get_associated_drugs_by_target_ensemblID, search_clinical_trials, DGIdb_get_gene_druggability, civic_search_genesIntegrate protein/RNA/metabolite data. Compare spatial RNA with protein detection.
HPA_get_subcellular_location, HPA_get_rna_expression_in_specific_tissues, Reactome_map_uniprot_to_pathways, kegg_get_pathway_infoClassify immune cells, check checkpoint expression, assess Hot vs Cold vs Excluded patterns.
STRING_functional_enrichment, OpenTargets_get_target_tractability_by_ensemblID, iedb_search_epitopesSearch published evidence, suggest validation experiments (smFISH, IHC, PLA).
PubMed_search_articles, openalex_literature_searchUse HuBMAP tools to find published spatial biology reference datasets for comparison, validation, or cross-study analysis.
| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| HuBMAP_search_datasets | Search published spatial datasets by organ/assay/keyword | organ (code: "LK"=Kidney, "BR"=Brain, "LU"=Lung, etc.), dataset_type ("RNAseq", "CODEX", "MALDI"), query, limit |
| HuBMAP_list_organs | List all available organs with codes and UBERON IDs | (no required params) |
| HuBMAP_get_dataset | Get detailed metadata for a specific HuBMAP dataset | hubmap_id (e.g. "HBM626.FHJD.938") |
When to use: Phase 0 (find reference datasets for the tissue), Phase 8 (cross-reference findings with published HuBMAP atlas data).
See phase-procedures.md for detailed workflows, decision logic, and tool parameter specifications per phase.
Create file: {tissue}_{disease}_spatial_omics_report.md
# Spatial Multi-Omics Analysis Report: {Tissue Type}
**Report Generated**: {date} | **Technology**: {platform}
**Tissue**: {tissue_type} | **Disease**: {disease or "Normal tissue"}
**Total SVGs**: {count} | **Spatial Domains**: {count}
**Spatial Omics Integration Score**: (calculated after analysis)
## Executive Summary
## 1. Tissue & Disease Context
## 2. Spatially Variable Gene Characterization
- 2.1 Gene ID Resolution
- 2.2 Tissue Expression Patterns
- 2.3 Subcellular Localization
- 2.4 Disease Associations
## 3. Pathway Enrichment Analysis
- 3.1 STRING, 3.2 Reactome, 3.3-3.5 GO (BP, MF, CC)
## 4. Spatial Domain Characterization (per-domain + comparison)
## 5. Cell-Cell Interaction Inference
- 5.1 PPI, 5.2 Ligand-Receptor, 5.3 Signaling Pathways
## 6. Disease & Therapeutic Context
- 6.1 Disease Gene Overlap, 6.2 Druggable Targets, 6.3 Drug Mechanisms, 6.4 Trials
## 7. Multi-Modal Integration (if data available)
## 8. Immune Microenvironment (if relevant)
## 9. Literature & Validation Context
## Spatial Omics Integration Score (breakdown table)
## Completeness Checklist
## References (tools used, database versions)
See report-template.md for full template with table structures.
Spatial Multi-Omics Analysis provides:
Outputs: Markdown report with Spatial Omics Integration Score (0-100) Uses: 70+ ToolUniverse tools across 9 analysis phases Time: ~10-20 minutes depending on gene list size
tools
Post-market safety surveillance and recall/adverse-event RETRIEVAL across the full spectrum of FDA-regulated products that are NOT covered by the drug-AE signal skills: medical devices, food / dietary supplements / cosmetics, veterinary drugs, and drug supply (shortages). Orchestrates openFDA endpoints (MAUDE device adverse events + device recalls + 510(k), CAERS food/supplement/ cosmetic adverse events, veterinary adverse events, drug shortages, and cross-product enforcement/recall reports). USE WHEN the user asks: "are there adverse events for [device / pacemaker / infusion pump / insulin pump]", "device recalls for [firm/product]", "supplement / vitamin / cosmetic adverse reactions", "is [drug] in shortage", "what injectables are on shortage", "veterinary / animal adverse events for [drug] in [dog/cat/horse]", "food recall for listeria", "MAUDE report for [device]", "CAERS reactions for [brand]". DO NOT USE for drug adverse-event SIGNAL detection or disproportionality (PRR / ROR / IC) or drug-AE association scoring — that is `tooluniverse-pharmacovigilance` / `tooluniverse-adverse-event-detection`. This skill is multi-product surveillance and retrieval, not drug-AE statistical signal mining.
tools
--- name: tooluniverse-phewas description: Cross-ancestry / cross-biobank phenome-wide association (PheWAS) and replication. Given ONE variant (rsID) or ONE gene, look up every phenotype it associates with across European/UK (UKB-TOPMed), Finnish (FinnGen), Japanese (BioBank Japan), and Taiwanese (TPMI) biobanks, plus exome-wide gene-burden PheWAS (Genebass), then judge whether an association replicates across ancestries or is population-specific. Use whenever the user asks "what else is this va
tools
Dereplicate a putative natural product and assign its chemical taxonomy. Use to answer "is [compound] a known natural product", "what microbe/organism produces [compound]", "what chemical class is [compound]", "dereplicate this metabolite (by formula/exact mass/InChIKey/SMILES)", or "classify this molecule into ChemOnt". Searches NPAtlas for known microbial natural products (producing organism + literature reference), assigns the ChemOnt kingdom→superclass→class→subclass hierarchy via ClassyFire, resolves systematic IUPAC names to structure via OPSIN, and cross-references identity in PubChem. NOT for general drug/compound identity or ADMET (use tooluniverse-chemical-compound-retrieval / tooluniverse-small-molecule-discovery) and NOT for metabolomics pathway/enrichment analysis (use tooluniverse-metabolomics skills).
tools
Genome-ASSEMBLY discovery, QC, and replicon mapping for any organism (bacteria, archaea, fungi, and beyond) using NCBI Datasets. Resolves an organism name or taxid to assemblies, picks the reference/representative or best-quality assembly, pulls assembly QC metrics (total length, contig/scaffold N50, contig count, GC%, assembly level, RefSeq category), enumerates chromosomes and plasmids via per-replicon sequence reports, and compares candidate assemblies on quality. Use for "what genomes are available for [organism]", "assembly stats / N50 / GC content for [GCF_/GCA_ accession]", "how many plasmids does [strain] have", "compare assemblies for [species]", "find the reference genome for [taxon]", "is this assembly Complete Genome or just contigs". NOT for gene-level orthology/synteny (use tooluniverse-comparative-genomics), plant gene structure (use tooluniverse-plant-genomics), de novo assembly from raw reads (no tool exists), or taxonomy-only name/lineage lookups.