plugin/skills/tooluniverse-plant-genomics/SKILL.md
Plant genomics and biology research — PlantReactome pathways, Ensembl Plants gene structure, POWO species taxonomy, UniProt annotation, KEGG plant pathways. Handles polyploidy (wheat hexaploidy etc.) and homeologous gene copies. Use for crop-gene annotation, plant secondary metabolism queries, and plant-disease/stress-response biology.
npx skillsauth add mims-harvard/tooluniverse tooluniverse-plant-genomicsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Pipeline for investigating plant genes, metabolic pathways, species taxonomy, and comparative plant biology using ToolUniverse tools.
Plant genomes are large (wheat is ~17 Gb, vs. 3 Gb for human) and often polyploid — wheat is hexaploid (AABBDD), meaning there are three homeologous copies of most genes. When comparing plant genes to Arabidopsis, always account for whole-genome duplications: a single Arabidopsis gene may have 2–4 paralogs in a crop species, all potentially with diverged functions. Gene families are massively expanded in plants relative to animals (e.g., receptor-like kinases, cytochrome P450s, transcription factors) — a BLAST hit does not mean functional equivalence. Arabidopsis thaliana is the primary model, but its small genome and rapid life cycle mean some features (wood formation, nitrogen fixation symbiosis, C4 photosynthesis) are absent and must be studied in other species.
LOOK UP DON'T GUESS: Do not assume gene function by sequence similarity alone in polyploid species; look up functional validation evidence via UniProt (reviewed entries) or PlantReactome. Do not assume KEGG organism codes — use the table or query kegg_search_pathway with the species name to confirm availability.
Key principles:
Not this skill: For general pathway analysis (human/mouse), use tooluniverse-systems-biology. For phylogenetics, use tooluniverse-phylogenetics.
| Tool | Use For |
|------|---------|
| PlantReactome_search_pathways | Search plant-specific pathways by keyword |
| PlantReactome_get_pathway | Get pathway details (genes, reactions, species) |
| PlantReactome_list_species | List all species covered by PlantReactome |
| POWO_search_plants | Search Plants of the World Online (taxonomy, distribution) |
| USDA_plants_get_profile | US plant profile by PLANTS symbol (e.g., symbol="ABBA") — taxonomy, growth habit, duration, native status; use for North American flora |
| USDA_plants_get_characteristics | Morphology/physiology trait records for a PLANTS symbol — use when a question needs growth-form or physiological traits |
| ensembl_lookup_gene | Gene lookup — use with plant species (e.g., species="arabidopsis_thaliana") |
| kegg_search_pathway | Search KEGG pathways (use plant organism codes: ath, osa, zma) |
| KEGG_get_pathway_genes | Get genes in a plant pathway (e.g., pathway_id="ath00941" for flavonoid in Arabidopsis) |
| UniProt_search | Search plant protein sequences (add taxonomy_id:3702 for Arabidopsis) |
| UniProt_get_function_by_accession | Get protein function annotation |
| PubMed_search_articles | Plant biology literature |
| EnsemblCompara_get_orthologues | Cross-species plant gene comparison |
Phase 0: Species & Gene Identification
Species name → POWO taxonomy; Gene symbol → Ensembl/UniProt IDs
|
Phase 1: Gene Function & Annotation
UniProt function, Ensembl annotation, InterPro domains
|
Phase 2: Pathway Analysis
PlantReactome → plant-specific pathways; KEGG → metabolism
|
Phase 3: Cross-Species Comparison
Ensembl Compara → orthologs in other plant species
|
Phase 4: Literature & Report
PubMed → published studies; synthesis
# Look up an Arabidopsis gene
ensembl_lookup_gene(gene_symbol="CHS", species="arabidopsis_thaliana")
# Get protein function
UniProt_search(query="CHS AND taxonomy_id:3702 AND reviewed:true")
Key plant-specific KEGG pathways:
| Pathway | KEGG ID (Arabidopsis) | Biological Significance | |---------|----------------------|------------------------| | Photosynthesis | ath00195 | Light reactions, electron transport | | Carbon fixation (Calvin cycle) | ath00710 | CO2 → sugar | | Flavonoid biosynthesis | ath00941 | UV protection, pigmentation, defense | | Carotenoid biosynthesis | ath00906 | Photoprotection, vitamin A precursors | | Auxin signaling | ath04075 | Growth, tropisms | | Brassinosteroid signaling | ath04712 | Cell elongation, stress response | | Circadian rhythm (plant) | ath04712 | Photoperiod, flowering time | | Terpenoid backbone | ath00900 | Secondary metabolite precursors | | Starch/sucrose metabolism | ath00500 | Carbon partitioning | | Nitrogen metabolism | ath00910 | Nitrogen assimilation |
# Search PlantReactome for flavonoid pathway
PlantReactome_search_pathways(query="flavonoid")
# Get genes in Arabidopsis flavonoid biosynthesis
KEGG_get_pathway_genes(pathway_id="ath00941")
KEGG organism codes for major crops:
| Species | Code | Common Name | |---------|------|-------------| | Arabidopsis thaliana | ath | Thale cress (model plant) | | Oryza sativa | osa | Rice | | Zea mays | zma | Maize/corn | | Triticum aestivum | tae | Wheat | | Glycine max | gmx | Soybean | | Solanum lycopersicum | sly | Tomato | | Nicotiana tabacum | nta | Tobacco | | Medicago truncatula | mtr | Barrel medic (legume model) |
Evidence grading: T1 = mutant phenotype confirms function; T2 = expression/localization data; T3 = ortholog has validated function in model species; T4 = computational annotation only (domain/GO term). Prioritize T1/T2 evidence; treat T3/T4 as hypotheses requiring further validation.
tools
Post-market safety surveillance and recall/adverse-event RETRIEVAL across the full spectrum of FDA-regulated products that are NOT covered by the drug-AE signal skills: medical devices, food / dietary supplements / cosmetics, veterinary drugs, and drug supply (shortages). Orchestrates openFDA endpoints (MAUDE device adverse events + device recalls + 510(k), CAERS food/supplement/ cosmetic adverse events, veterinary adverse events, drug shortages, and cross-product enforcement/recall reports). USE WHEN the user asks: "are there adverse events for [device / pacemaker / infusion pump / insulin pump]", "device recalls for [firm/product]", "supplement / vitamin / cosmetic adverse reactions", "is [drug] in shortage", "what injectables are on shortage", "veterinary / animal adverse events for [drug] in [dog/cat/horse]", "food recall for listeria", "MAUDE report for [device]", "CAERS reactions for [brand]". DO NOT USE for drug adverse-event SIGNAL detection or disproportionality (PRR / ROR / IC) or drug-AE association scoring — that is `tooluniverse-pharmacovigilance` / `tooluniverse-adverse-event-detection`. This skill is multi-product surveillance and retrieval, not drug-AE statistical signal mining.
tools
--- name: tooluniverse-phewas description: Cross-ancestry / cross-biobank phenome-wide association (PheWAS) and replication. Given ONE variant (rsID) or ONE gene, look up every phenotype it associates with across European/UK (UKB-TOPMed), Finnish (FinnGen), Japanese (BioBank Japan), and Taiwanese (TPMI) biobanks, plus exome-wide gene-burden PheWAS (Genebass), then judge whether an association replicates across ancestries or is population-specific. Use whenever the user asks "what else is this va
tools
Dereplicate a putative natural product and assign its chemical taxonomy. Use to answer "is [compound] a known natural product", "what microbe/organism produces [compound]", "what chemical class is [compound]", "dereplicate this metabolite (by formula/exact mass/InChIKey/SMILES)", or "classify this molecule into ChemOnt". Searches NPAtlas for known microbial natural products (producing organism + literature reference), assigns the ChemOnt kingdom→superclass→class→subclass hierarchy via ClassyFire, resolves systematic IUPAC names to structure via OPSIN, and cross-references identity in PubChem. NOT for general drug/compound identity or ADMET (use tooluniverse-chemical-compound-retrieval / tooluniverse-small-molecule-discovery) and NOT for metabolomics pathway/enrichment analysis (use tooluniverse-metabolomics skills).
tools
Genome-ASSEMBLY discovery, QC, and replicon mapping for any organism (bacteria, archaea, fungi, and beyond) using NCBI Datasets. Resolves an organism name or taxid to assemblies, picks the reference/representative or best-quality assembly, pulls assembly QC metrics (total length, contig/scaffold N50, contig count, GC%, assembly level, RefSeq category), enumerates chromosomes and plasmids via per-replicon sequence reports, and compares candidate assemblies on quality. Use for "what genomes are available for [organism]", "assembly stats / N50 / GC content for [GCF_/GCA_ accession]", "how many plasmids does [strain] have", "compare assemblies for [species]", "find the reference genome for [taxon]", "is this assembly Complete Genome or just contigs". NOT for gene-level orthology/synteny (use tooluniverse-comparative-genomics), plant gene structure (use tooluniverse-plant-genomics), de novo assembly from raw reads (no tool exists), or taxonomy-only name/lineage lookups.