Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

mims-harvard/tooluniverse-multiomic-disease-characterization

Name: tooluniverse-multiomic-disease-characterization
Author: mims-harvard

plugin/skills/tooluniverse-multiomic-disease-characterization/SKILL.md

npx skillsauth add mims-harvard/tooluniverse tooluniverse-multiomic-disease-characterization

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Multi-Omics Disease Characterization Pipeline

Characterize diseases across multiple molecular layers (genomics, transcriptomics, proteomics, pathways) to provide systems-level understanding of disease mechanisms, identify therapeutic opportunities, and discover biomarker candidates.

KEY PRINCIPLES:

Report-first approach - Create report file FIRST, then populate progressively
Disease disambiguation FIRST - Resolve all identifiers before omics analysis
Layer-by-layer analysis - Systematically cover all omics layers
Cross-layer integration - Identify genes/targets appearing in multiple layers
Evidence grading - Grade all evidence as T1 (human/clinical) to T4 (computational)
Tissue context - Emphasize disease-relevant tissues/organs
Quantitative scoring - Multi-Omics Confidence Score (0-100)
Druggable focus - Prioritize targets with therapeutic potential
Biomarker identification - Highlight diagnostic/prognostic markers
Mechanistic synthesis - Generate testable hypotheses
Source references - Every statement must cite tool/database
Completeness checklist - Mandatory section showing analysis coverage
English-first queries - Always use English terms in tool calls. Respond in user's language

Multi-omics disease characterization asks: what molecular layers are dysregulated? Genomic mutations → transcriptomic changes → proteomic effects → metabolomic consequences. Concordance across layers strengthens the finding. Discordance reveals regulatory complexity.

LOOK UP, DON'T GUESS

When uncertain about any scientific fact, SEARCH databases first rather than reasoning from memory. A database-verified answer is always more reliable than a guess.

COMPUTE, DON'T DESCRIBE

When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.

When to Use This Skill

Apply when users:

Ask about disease mechanisms across omics layers
Need multi-omics characterization of a disease
Want to understand disease at the systems biology level
Ask "What pathways/genes/proteins are involved in [disease]?"
Need biomarker discovery for a disease
Want to identify druggable targets from disease profiling
Ask for integrated genomics + transcriptomics + proteomics analysis
Need cross-layer concordance analysis
Ask about disease network biology / hub genes

NOT for (use other skills instead):

Single gene/target validation -> Use tooluniverse-drug-target-validation
Drug safety profiling -> Use tooluniverse-adverse-event-detection
General disease overview -> Use tooluniverse-disease-research
Variant interpretation -> Use tooluniverse-variant-interpretation
GWAS-specific analysis -> Use tooluniverse-gwas-* skills
Pathway-only analysis -> Use tooluniverse-systems-biology

Input Parameters

| Parameter | Required | Description | Example | |-----------|----------|-------------|---------| | disease | Yes | Disease name, OMIM ID, EFO ID, or MONDO ID | Alzheimer disease, MONDO_0004975 | | tissue | No | Tissue/organ of interest | brain, liver, blood | | focus_layers | No | Specific omics layers to emphasize | genomics, transcriptomics, pathways |

Pipeline Overview

The pipeline runs 9 phases sequentially. Each phase uses specific tools documented in detail in tool-reference.md.

Phase 0: Disease Disambiguation (ALWAYS FIRST)

Resolve disease to standard identifiers (MONDO/EFO) for all downstream queries.

Primary tool: OpenTargets_get_disease_id_description_by_name
Get description, synonyms, therapeutic areas, disease hierarchy, cross-references
CRITICAL: Disease IDs use underscore format (e.g., MONDO_0004975), NOT colon
If ambiguous, present top 3-5 options and ask user to select

Phase 1: Genomics Layer

Identify genetic variants, GWAS associations, and genetically implicated genes.

Tools: gwas_search_associations (use efo_id for precision, not free-text disease_trait), gwas_get_snps_for_gene, ClinVar, OpenTargets associated targets
gnomad_get_gene_constraints — gene constraint metrics (pLI, oe_lof) to interpret whether LoF variants are tolerated vs. haploinsufficient
Get top 10-15 genes with genetic evidence scores; track Ensembl IDs for downstream phases

Phase 2: Transcriptomics Layer

Identify differentially expressed genes, tissue-specific expression, and expression-based biomarkers.

GTEx_get_expression_summary — baseline expression across 54 tissues (accepts gene_symbol directly)
Tools: Expression Atlas, HPA (tissue expression), EuropePMC scores
Check expression in disease-relevant tissues for top genes from Phase 1

Phase 3: Proteomics & Interaction Layer

Map protein-protein interactions, identify hub genes, and characterize interaction networks.

UniProt_get_function_by_accession — protein function narrative (essential for mechanistic context)
Tools: STRING_get_network (param: identifiers, species=9606), intact_get_interactions, HumanBase
Build PPI network from top 15-20 genes; identify hub genes by degree centrality

Phase 4: Pathway & Network Layer

Identify enriched biological pathways and cross-pathway connections.

ReactomeAnalysis_pathway_enrichment — identifiers are newline-separated (\n), NOT space-separated
enrichr_gene_enrichment_analysis — param: gene_list (array), libs (array). NOTE: data field is a JSON string that needs parsing
kegg_search_pathway — pathway keyword search

Phase 5: Gene Ontology & Functional Annotation

Characterize biological processes, molecular functions, and cellular components.

Tools: Enrichr (GO libraries), QuickGO, GO annotations, OpenTargets GO
Run GO enrichment for all 3 aspects (BP, MF, CC)

Phase 6: Therapeutic Landscape

Map approved drugs, druggable targets, repurposing opportunities, and clinical trials.

DGIdb_get_drug_gene_interactions — drug interactions by gene (param: genes as array). Often more comprehensive than OpenTargets for drug-gene data.
OpenTargets drugs/tractability (use EFO IDs like EFO_0000384 for Crohn's, not MONDO — MONDO IDs may return null for drug queries)
search_clinical_trials — query_term is REQUIRED

Phase 7: Multi-Omics Integration

Integrate findings across all layers. See integration-scoring.md for full details.

Cross-layer gene concordance: count layers per gene, score multi-layer hub genes
Direction concordance: genetics + expression agreement
Biomarker identification: diagnostic, prognostic, predictive
Mechanistic hypothesis generation

Phase 8: Report Finalization

Write executive summary, calculate confidence score, verify completeness.

See integration-scoring.md for quality checklist and scoring formula

Key Tool Parameter Notes

These are the most common parameter pitfalls:

OpenTargets disease IDs: underscore format (MONDO_0004975), NOT colon
STRING protein_ids: must be array (['APOE']), not string
enrichr libs: must be array (['KEGG_2021_Human'])
HPA_get_rna_expression_by_source: ALL 3 params required (gene_name, source_type, source_name)
humanbase_ppi_analysis: ALL params required (gene_list, tissue, max_node, interaction, string_mode)
expression_atlas_disease_target_score: pageSize is REQUIRED
search_clinical_trials: query_term is REQUIRED even if condition is provided

For full tool parameters and per-phase workflows, see tool-reference.md.

Reference Files

All detailed content is in reference files in this directory:

| File | Contents | |------|----------| | tool-reference.md | Full tool parameters, inputs/outputs, per-phase workflows, quick reference table | | report-template.md | Complete report markdown template with all sections and checklists | | integration-scoring.md | Confidence score formula (0-100), evidence grading (T1-T4), integration procedures, quality checklist | | response-formats.md | Verified JSON response structures for key tools | | use-patterns.md | Common use patterns, edge case handling, fallback strategies |

mims-harvard/tooluniverse-multiomic-disease-characterization

plugin/skills/tooluniverse-multiomic-disease-characterization/SKILL.md

Comprehensive disease characterization across genomics, transcriptomics, proteomics, and pathways for systems-level understanding. Identifies therapeutic opportunities and biomarker candidates by integrating multi-layer molecular data. Use for full-omics disease deep-dive reports, mechanism mapping, and biomarker-and-target identification from multi-omics data.

1,429 stars

tools

Updated Jun 7, 2026

$ install --global

skillsauth

npx skillsauth add mims-harvard/tooluniverse tooluniverse-multiomic-disease-characterization

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 7, 2026, 6:22 AM156.0s6 files scanned

SKILL.md

name:: tooluniverse-multiomic-disease-characterization
description:: Comprehensive disease characterization across genomics, transcriptomics, proteomics, and pathways for systems-level understanding. Identifies therapeutic opportunities and biomarker candidates by integrating multi-layer molecular data. Use for full-omics disease deep-dive reports, mechanism mapping, and biomarker-and-target identification from multi-omics data.
disable-model-invocation:: true

Multi-Omics Disease Characterization Pipeline

KEY PRINCIPLES:

Report-first approach - Create report file FIRST, then populate progressively
Disease disambiguation FIRST - Resolve all identifiers before omics analysis
Layer-by-layer analysis - Systematically cover all omics layers
Cross-layer integration - Identify genes/targets appearing in multiple layers
Evidence grading - Grade all evidence as T1 (human/clinical) to T4 (computational)
Tissue context - Emphasize disease-relevant tissues/organs
Quantitative scoring - Multi-Omics Confidence Score (0-100)
Druggable focus - Prioritize targets with therapeutic potential
Biomarker identification - Highlight diagnostic/prognostic markers
Mechanistic synthesis - Generate testable hypotheses
Source references - Every statement must cite tool/database
Completeness checklist - Mandatory section showing analysis coverage
English-first queries - Always use English terms in tool calls. Respond in user's language

LOOK UP, DON'T GUESS

When uncertain about any scientific fact, SEARCH databases first rather than reasoning from memory. A database-verified answer is always more reliable than a guess.

COMPUTE, DON'T DESCRIBE

When to Use This Skill

Apply when users:

Ask about disease mechanisms across omics layers
Need multi-omics characterization of a disease
Want to understand disease at the systems biology level
Ask "What pathways/genes/proteins are involved in [disease]?"
Need biomarker discovery for a disease
Want to identify druggable targets from disease profiling
Ask for integrated genomics + transcriptomics + proteomics analysis
Need cross-layer concordance analysis
Ask about disease network biology / hub genes

NOT for (use other skills instead):

Single gene/target validation -> Use tooluniverse-drug-target-validation
Drug safety profiling -> Use tooluniverse-adverse-event-detection
General disease overview -> Use tooluniverse-disease-research
Variant interpretation -> Use tooluniverse-variant-interpretation
GWAS-specific analysis -> Use tooluniverse-gwas-* skills
Pathway-only analysis -> Use tooluniverse-systems-biology

Input Parameters

Pipeline Overview

The pipeline runs 9 phases sequentially. Each phase uses specific tools documented in detail in tool-reference.md.

Phase 0: Disease Disambiguation (ALWAYS FIRST)

Resolve disease to standard identifiers (MONDO/EFO) for all downstream queries.

Primary tool: OpenTargets_get_disease_id_description_by_name
Get description, synonyms, therapeutic areas, disease hierarchy, cross-references
CRITICAL: Disease IDs use underscore format (e.g., MONDO_0004975), NOT colon
If ambiguous, present top 3-5 options and ask user to select

Phase 1: Genomics Layer

Identify genetic variants, GWAS associations, and genetically implicated genes.

Tools: gwas_search_associations (use efo_id for precision, not free-text disease_trait), gwas_get_snps_for_gene, ClinVar, OpenTargets associated targets
gnomad_get_gene_constraints — gene constraint metrics (pLI, oe_lof) to interpret whether LoF variants are tolerated vs. haploinsufficient
Get top 10-15 genes with genetic evidence scores; track Ensembl IDs for downstream phases

Phase 2: Transcriptomics Layer

Identify differentially expressed genes, tissue-specific expression, and expression-based biomarkers.

GTEx_get_expression_summary — baseline expression across 54 tissues (accepts gene_symbol directly)
Tools: Expression Atlas, HPA (tissue expression), EuropePMC scores
Check expression in disease-relevant tissues for top genes from Phase 1

Phase 3: Proteomics & Interaction Layer

Map protein-protein interactions, identify hub genes, and characterize interaction networks.

UniProt_get_function_by_accession — protein function narrative (essential for mechanistic context)
Tools: STRING_get_network (param: identifiers, species=9606), intact_get_interactions, HumanBase
Build PPI network from top 15-20 genes; identify hub genes by degree centrality

Phase 4: Pathway & Network Layer

Identify enriched biological pathways and cross-pathway connections.

ReactomeAnalysis_pathway_enrichment — identifiers are newline-separated (\n), NOT space-separated
enrichr_gene_enrichment_analysis — param: gene_list (array), libs (array). NOTE: data field is a JSON string that needs parsing
kegg_search_pathway — pathway keyword search

Phase 5: Gene Ontology & Functional Annotation

Characterize biological processes, molecular functions, and cellular components.

Tools: Enrichr (GO libraries), QuickGO, GO annotations, OpenTargets GO
Run GO enrichment for all 3 aspects (BP, MF, CC)

Phase 6: Therapeutic Landscape

Map approved drugs, druggable targets, repurposing opportunities, and clinical trials.

DGIdb_get_drug_gene_interactions — drug interactions by gene (param: genes as array). Often more comprehensive than OpenTargets for drug-gene data.
OpenTargets drugs/tractability (use EFO IDs like EFO_0000384 for Crohn's, not MONDO — MONDO IDs may return null for drug queries)
search_clinical_trials — query_term is REQUIRED

Phase 7: Multi-Omics Integration

Integrate findings across all layers. See integration-scoring.md for full details.

Cross-layer gene concordance: count layers per gene, score multi-layer hub genes
Direction concordance: genetics + expression agreement
Biomarker identification: diagnostic, prognostic, predictive
Mechanistic hypothesis generation

Phase 8: Report Finalization

Write executive summary, calculate confidence score, verify completeness.

See integration-scoring.md for quality checklist and scoring formula

Key Tool Parameter Notes

These are the most common parameter pitfalls:

OpenTargets disease IDs: underscore format (MONDO_0004975), NOT colon
STRING protein_ids: must be array (['APOE']), not string
enrichr libs: must be array (['KEGG_2021_Human'])
HPA_get_rna_expression_by_source: ALL 3 params required (gene_name, source_type, source_name)
humanbase_ppi_analysis: ALL params required (gene_list, tissue, max_node, interaction, string_mode)
expression_atlas_disease_target_score: pageSize is REQUIRED
search_clinical_trials: query_term is REQUIRED even if condition is provided

For full tool parameters and per-phase workflows, see tool-reference.md.

Reference Files

All detailed content is in reference files in this directory:

Related Skills

mims-harvard/tooluniverse-self-review

tools

VerifiedTrustedCommunity

Generate the success criteria for a task or question, then review work against them. Given a task, goal, or open-ended question, decompose it into scenarios, evaluation perspectives, and fine-grained weighted YES/NO criteria using the Recursive Expansion Tree (RET) method; if work is supplied, score it criterion-by-criterion and surface what is missing or could be better. Use when asked to self-review or check your own work, judge whether a task is done well or completely, build a definition-of-done or completeness checklist, create an evaluation rubric or grading criteria, score or grade answers to a question, set up an LLM-as-judge rubric, or when the user mentions self-review, completeness check, success criteria, evaluation criteria, scoring rubric, Qworld, or the RET algorithm.

1,583SKILL.mdUpdated Jul 22, 2026

mims-harvard/tooluniverse-self-review

mims-harvard/tooluniverse-peptide-target-deorphanization

tools

VerifiedTrustedCommunity

Find the real protein target(s) of a peptide from its sequence — peptide target deorphanization / off-target identification, for ANY target class (GPCR, ion channel, protease, cytokine/growth-factor receptor, enzyme, integrin), not only GPCRs. Use when a peptide has a phenotype but does not bind its hypothesized target, when a peptide binds a target in one species or assay but not another, or to screen candidate targets for an orphan peptide. A target-class router steers a multi-route keyless pipeline (PROSITE/ELM motif, BLAST homology, HGNC/InterPro/GPCRdb/GtoPdb target-family enumeration, OpenTargets phenotype anchor, EnsemblCompara/Alliance cross-species reconciliation) plus optional NVIDIA-NIM co-folding (Boltz2, AlphaFold2-Multimer, OpenFold3) for structural confirmation.

1,583SKILL.mdUpdated Jul 22, 2026

mims-harvard/tooluniverse-peptide-target-deorphanization

mims-harvard/tooluniverse-cs-setup

tools

VerifiedTrustedCommunity

Install or update ToolUniverse in Claude Science — create the conda env, install the tooluniverse pip package, and (re)build the tooluniverse-research skill by fetching the current workflow library from GitHub. Use for first-time setup, upgrading the ToolUniverse version, refreshing the bundled workflows after an upstream release, or reinstalling on a new machine.

1,583SKILL.mdUpdated Jul 22, 2026

mims-harvard/tooluniverse-cs-setup

mims-harvard/tooluniverse-codex-plugin

tools

VerifiedTrustedCommunity

Install, set up, verify, update, pin, uninstall, or troubleshoot the ToolUniverse plugin on OpenAI Codex. ALWAYS consult this skill for any of those — don't answer from memory, because the exact marketplace name (mims-harvard/ToolUniverse), the "codex plugin marketplace add" then "codex plugin add -m tooluniverse" flow, Codex's startup auto-upgrade behavior, the uvx tooluniverse MCP server, and the API-key env vars are easy to get wrong. Use it whenever someone wants to get ToolUniverse (or "the 1000+ scientific tools" / "the harvard tools") working on Codex, says the Codex plugin or its tools/skills won't load, hits a uvx or MCP-server startup error, asks how Codex updates it, wants to pin or remove it, or finds it running an old tool version — even if they never say the word "plugin". Not for the Claude Code plugin (use tooluniverse-claude-code-plugin), for running research with the tools, or for authoring new tools or skills.

1,583SKILL.mdUpdated Jul 22, 2026

mims-harvard/tooluniverse-codex-plugin

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/mims-harvard/tooluniverse.git

# Copy into Claude Code skills folder (global)
cp -r tooluniverse/plugin/skills/tooluniverse-multiomic-disease-characterization ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

mims-harvard/tooluniverse

1,429 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT