plugin/skills/tooluniverse-metabolomics/SKILL.md
Metabolomics research — metabolite identification, study analysis, and database searches across HMDB, MetaboLights, Metabolomics Workbench, KEGG. Use for annotating mass-spec features to known metabolites, finding metabolomics studies of a disease, and structured metabolomics research reports with metabolite-pathway mapping.
npx skillsauth add mims-harvard/tooluniverse tooluniverse-metabolomicsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Comprehensive metabolomics research skill that identifies metabolites, analyzes studies, and searches metabolomics databases. Generates structured research reports with annotated metabolite information, study details, and database statistics.
Use this skill when asked to:
Example queries:
Primary metabolite databases:
The skill executes a 4-phase analysis pipeline:
For each metabolite in the input list:
For provided study IDs:
For keyword searches:
Always included in reports:
Input:
Output report includes:
Input:
Output report includes:
Input:
Output report includes:
Input:
Output report includes:
List of metabolite names to identify and annotate.
["glucose"], ["lactate", "pyruvate", "acetate"]MetaboLights or Metabolomics Workbench study identifier.
"MTBLS1", "ST000001"Keyword to search metabolomics studies.
"diabetes", "glucose metabolism", "LC-MS"Target organism for study filtering.
"Homo sapiens""Mus musculus", "Saccharomyces cerevisiae"Path for the generated markdown report.
"my_analysis.md", "metabolomics_report.md"All analyses generate a structured markdown report with:
Header section:
Phase sections:
Database overview:
Error handling:
HMDB tools are SOAP-based and require special parameter handling:
HMDB_search: Requires operation="search" parameterHMDB_get_metabolite: Requires operation="get_metabolite" parameterendpoint or method parameters (not applicable to SOAP)Tools return different response formats - handle all three:
{status: "success", data: [...], metadata: {...}}[...] (e.g., metabolights_list_studies){field1: ..., field2: ...} (e.g., some detail endpoints)Always check response type with isinstance() before accessing fields.
Follow this hierarchy for robustness:
Write report incrementally to avoid memory issues:
The skill automatically discovers and uses these tools from ToolUniverse:
HMDB Tools:
HMDB_search: Search metabolites by nameHMDB_get_metabolite: Get detailed metabolite informationMetaboLights Tools:
metabolights_list_studies: List available studiesmetabolights_search_studies: Search studies by keywordmetabolights_get_study: Get study details by IDMetabolomics Workbench Tools:
MetabolomicsWorkbench_get_study: Get study informationMetabolomicsWorkbench_search_compound_by_name: Search compoundsPubChem Tools:
PubChem_get_CID_by_compound_name: Get PubChem CIDPubChem_get_compound_properties_by_CID: Get chemical propertiesNo manual tool configuration required - all tools loaded automatically.
Cause: HMDB search returned empty results or index error accessing first result Solution: This is expected for uncommon metabolites; PubChem fallback will be attempted
Cause: Study ID not found or API unavailable Solution: Verify study ID format (MTBLS* or ST*), check if study is public
Cause: Missing API keys for some databases
Solution: Check .env.template, add required API keys to .env file (most metabolomics tools work without keys)
Cause: Pipeline queries each metabolite individually Solution: Reports limit to first 10 metabolites; consider batching for >20 metabolites
The Metabolomics Research skill provides comprehensive metabolomics analysis through a 4-phase pipeline that:
Key Features:
operation parameter)Best for:
Metabolite identification starts with the mass spectrum. LOOK UP DON'T GUESS — always search HMDB/PubChem with the calculated neutral mass rather than guessing identity from m/z alone.
Metabolite identification: HMDB IDs provide the strongest annotation when paired with experimental validation. A PubChem-only match (fallback) indicates the metabolite is chemically characterized but may lack biological context (pathways, disease associations). Always report the identification confidence level.
Pathway enrichment strategy: When multiple metabolites map to the same KEGG or HMDB pathway, enrichment is meaningful only if the input list is unbiased (not pre-selected for that pathway). Report hits vs. pathway size (3/5 detected is more informative than 3/500). LOOK UP DON'T GUESS — use HMDB_get_metabolite to get pathway annotations for each metabolite rather than assuming pathway membership from names alone.
Biomarker discovery reasoning: A candidate biomarker should show: (1) consistent direction of change across samples (fold-change > 1.5), (2) statistical significance (FDR-adjusted p < 0.05), (3) biological plausibility — LOOK UP the metabolite's known disease associations via HMDB, and (4) reproducibility in an independent cohort. Single-study HMDB associations are hypothesis-generating, not confirmatory. Check MetaboLights/Metabolomics Workbench for independent validation datasets.
A complete metabolomics report should answer:
Limitations:
See QUICK_START.md for Python SDK examples, MCP integration, and step-by-step tutorials.
tools
PCR / qPCR primer and oligo design — design forward/reverse primers for a target region (SantaLucia nearest-neighbor thermodynamics), compute melting temperature (Tm) and annealing temperature (Ta), check GC content, and screen an oligo for hairpins and primer-dimers. Use when you need primers for a sequence, want to QC an existing primer pair, or need the Tm of an oligo. Covers the primer-design rules (Tm matching, GC clamp, 3'-end, length) and the tools' constraint quirks.
tools
Pharmacokinetic (PK) analysis of concentration-time data — non-compartmental analysis (NCA) for Cmax, Tmax, AUC (0-t and 0-∞), terminal half-life, clearance (CL), volume of distribution (Vd), MRT, and absolute bioavailability (F). Also one-compartment fitting. Use when you have plasma/serum drug concentrations over time after a dose and need PK parameters, or to compute bioavailability from IV + oral AUCs. NOT for ADMET property prediction from structure (use tooluniverse-admet-prediction).
tools
Molecular cloning assembly design — Gibson Assembly (overlap design for seamless multi-fragment joining) and Golden Gate Assembly (Type IIS / BsaI / BbsI design with unique 4-bp fusion overhangs). Use when you need to plan how to join DNA fragments into a construct, design assembly overlaps/overhangs, or decide between cloning methods. Covers the domestication (internal-site removal), overhang-uniqueness, and overlap-Tm rules. For PCR primers to generate the fragments, see tooluniverse-primer-design.
tools
Meta-analysis / evidence synthesis — pool effect sizes across studies (odds ratios, risk ratios, hazard ratios, mean differences, correlations, GWAS betas) with fixed- or random-effects models, quantify heterogeneity (Q, I², τ²), and build a forest plot. Use when you have results from MULTIPLE studies and need a single pooled estimate, or to synthesize evidence from a systematic review / multiple GWAS / replicated experiments. Handles the error-prone effect-size + standard-error preparation (converting OR/HR/CI, two-group means±SD, proportions, and correlations into the (effect, SE) the pooling step needs).