plugin/skills/tooluniverse-multiomic-disease-characterization/SKILL.md
Comprehensive disease characterization across genomics, transcriptomics, proteomics, and pathways for systems-level understanding. Identifies therapeutic opportunities and biomarker candidates by integrating multi-layer molecular data. Use for full-omics disease deep-dive reports, mechanism mapping, and biomarker-and-target identification from multi-omics data.
npx skillsauth add mims-harvard/tooluniverse tooluniverse-multiomic-disease-characterizationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Characterize diseases across multiple molecular layers (genomics, transcriptomics, proteomics, pathways) to provide systems-level understanding of disease mechanisms, identify therapeutic opportunities, and discover biomarker candidates.
KEY PRINCIPLES:
Multi-omics disease characterization asks: what molecular layers are dysregulated? Genomic mutations → transcriptomic changes → proteomic effects → metabolomic consequences. Concordance across layers strengthens the finding. Discordance reveals regulatory complexity.
When uncertain about any scientific fact, SEARCH databases first rather than reasoning from memory. A database-verified answer is always more reliable than a guess.
When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.
Apply when users:
NOT for (use other skills instead):
tooluniverse-drug-target-validationtooluniverse-adverse-event-detectiontooluniverse-disease-researchtooluniverse-variant-interpretationtooluniverse-gwas-* skillstooluniverse-systems-biology| Parameter | Required | Description | Example |
|-----------|----------|-------------|---------|
| disease | Yes | Disease name, OMIM ID, EFO ID, or MONDO ID | Alzheimer disease, MONDO_0004975 |
| tissue | No | Tissue/organ of interest | brain, liver, blood |
| focus_layers | No | Specific omics layers to emphasize | genomics, transcriptomics, pathways |
The pipeline runs 9 phases sequentially. Each phase uses specific tools documented in detail in tool-reference.md.
Resolve disease to standard identifiers (MONDO/EFO) for all downstream queries.
OpenTargets_get_disease_id_description_by_nameMONDO_0004975), NOT colonIdentify genetic variants, GWAS associations, and genetically implicated genes.
gwas_search_associations (use efo_id for precision, not free-text disease_trait), gwas_get_snps_for_gene, ClinVar, OpenTargets associated targetsgnomad_get_gene_constraints — gene constraint metrics (pLI, oe_lof) to interpret whether LoF variants are tolerated vs. haploinsufficientIdentify differentially expressed genes, tissue-specific expression, and expression-based biomarkers.
GTEx_get_expression_summary — baseline expression across 54 tissues (accepts gene_symbol directly)Map protein-protein interactions, identify hub genes, and characterize interaction networks.
UniProt_get_function_by_accession — protein function narrative (essential for mechanistic context)STRING_get_network (param: identifiers, species=9606), intact_get_interactions, HumanBaseIdentify enriched biological pathways and cross-pathway connections.
ReactomeAnalysis_pathway_enrichment — identifiers are newline-separated (\n), NOT space-separatedenrichr_gene_enrichment_analysis — param: gene_list (array), libs (array). NOTE: data field is a JSON string that needs parsingkegg_search_pathway — pathway keyword searchCharacterize biological processes, molecular functions, and cellular components.
Map approved drugs, druggable targets, repurposing opportunities, and clinical trials.
DGIdb_get_drug_gene_interactions — drug interactions by gene (param: genes as array). Often more comprehensive than OpenTargets for drug-gene data.EFO_0000384 for Crohn's, not MONDO — MONDO IDs may return null for drug queries)search_clinical_trials — query_term is REQUIREDIntegrate findings across all layers. See integration-scoring.md for full details.
Write executive summary, calculate confidence score, verify completeness.
integration-scoring.md for quality checklist and scoring formulaThese are the most common parameter pitfalls:
OpenTargets disease IDs: underscore format (MONDO_0004975), NOT colonSTRING protein_ids: must be array (['APOE']), not stringenrichr libs: must be array (['KEGG_2021_Human'])HPA_get_rna_expression_by_source: ALL 3 params required (gene_name, source_type, source_name)humanbase_ppi_analysis: ALL params required (gene_list, tissue, max_node, interaction, string_mode)expression_atlas_disease_target_score: pageSize is REQUIREDsearch_clinical_trials: query_term is REQUIRED even if condition is providedFor full tool parameters and per-phase workflows, see tool-reference.md.
All detailed content is in reference files in this directory:
| File | Contents |
|------|----------|
| tool-reference.md | Full tool parameters, inputs/outputs, per-phase workflows, quick reference table |
| report-template.md | Complete report markdown template with all sections and checklists |
| integration-scoring.md | Confidence score formula (0-100), evidence grading (T1-T4), integration procedures, quality checklist |
| response-formats.md | Verified JSON response structures for key tools |
| use-patterns.md | Common use patterns, edge case handling, fallback strategies |
tools
Post-market safety surveillance and recall/adverse-event RETRIEVAL across the full spectrum of FDA-regulated products that are NOT covered by the drug-AE signal skills: medical devices, food / dietary supplements / cosmetics, veterinary drugs, and drug supply (shortages). Orchestrates openFDA endpoints (MAUDE device adverse events + device recalls + 510(k), CAERS food/supplement/ cosmetic adverse events, veterinary adverse events, drug shortages, and cross-product enforcement/recall reports). USE WHEN the user asks: "are there adverse events for [device / pacemaker / infusion pump / insulin pump]", "device recalls for [firm/product]", "supplement / vitamin / cosmetic adverse reactions", "is [drug] in shortage", "what injectables are on shortage", "veterinary / animal adverse events for [drug] in [dog/cat/horse]", "food recall for listeria", "MAUDE report for [device]", "CAERS reactions for [brand]". DO NOT USE for drug adverse-event SIGNAL detection or disproportionality (PRR / ROR / IC) or drug-AE association scoring — that is `tooluniverse-pharmacovigilance` / `tooluniverse-adverse-event-detection`. This skill is multi-product surveillance and retrieval, not drug-AE statistical signal mining.
tools
--- name: tooluniverse-phewas description: Cross-ancestry / cross-biobank phenome-wide association (PheWAS) and replication. Given ONE variant (rsID) or ONE gene, look up every phenotype it associates with across European/UK (UKB-TOPMed), Finnish (FinnGen), Japanese (BioBank Japan), and Taiwanese (TPMI) biobanks, plus exome-wide gene-burden PheWAS (Genebass), then judge whether an association replicates across ancestries or is population-specific. Use whenever the user asks "what else is this va
tools
Dereplicate a putative natural product and assign its chemical taxonomy. Use to answer "is [compound] a known natural product", "what microbe/organism produces [compound]", "what chemical class is [compound]", "dereplicate this metabolite (by formula/exact mass/InChIKey/SMILES)", or "classify this molecule into ChemOnt". Searches NPAtlas for known microbial natural products (producing organism + literature reference), assigns the ChemOnt kingdom→superclass→class→subclass hierarchy via ClassyFire, resolves systematic IUPAC names to structure via OPSIN, and cross-references identity in PubChem. NOT for general drug/compound identity or ADMET (use tooluniverse-chemical-compound-retrieval / tooluniverse-small-molecule-discovery) and NOT for metabolomics pathway/enrichment analysis (use tooluniverse-metabolomics skills).
tools
Genome-ASSEMBLY discovery, QC, and replicon mapping for any organism (bacteria, archaea, fungi, and beyond) using NCBI Datasets. Resolves an organism name or taxid to assemblies, picks the reference/representative or best-quality assembly, pulls assembly QC metrics (total length, contig/scaffold N50, contig count, GC%, assembly level, RefSeq category), enumerates chromosomes and plasmids via per-replicon sequence reports, and compares candidate assemblies on quality. Use for "what genomes are available for [organism]", "assembly stats / N50 / GC content for [GCF_/GCA_ accession]", "how many plasmids does [strain] have", "compare assemblies for [species]", "find the reference genome for [taxon]", "is this assembly Complete Genome or just contigs". NOT for gene-level orthology/synteny (use tooluniverse-comparative-genomics), plant gene structure (use tooluniverse-plant-genomics), de novo assembly from raw reads (no tool exists), or taxonomy-only name/lineage lookups.