skills/tooluniverse-precision-medicine-stratification/SKILL.md
Patient stratification for precision medicine — integrate genomic, clinical, and therapeutic data to split patients into responder/non-responder groups, risk tiers, or treatment-decision groups. Use for stratification-by-biomarker, treatment-selection logic, and personalized therapeutic strategy reports per patient subgroup.
npx skillsauth add mims-harvard/tooluniverse tooluniverse-precision-medicine-stratificationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Transform patient genomic and clinical profiles into actionable risk stratification, treatment recommendations, and personalized therapeutic strategies.
Stratification means splitting patients into groups that respond differently to a treatment or have different prognoses. Ask these questions before running any tools:
Route to the correct Phase 3 path BEFORE running Phase 2 tools — cancer, metabolic, CVD, rare disease, and autoimmune pipelines require different stratifiers.
LOOK UP DON'T GUESS: Never assume a variant is pathogenic, never assume a gene is relevant to a disease, never assign metabolizer status without PharmGKB or CPIC evidence.
KEY PRINCIPLES:
Reference files (same directory):
TOOLS_REFERENCE.md - Tool parameters, response formats, phase-by-phase tool listsSCORING_REFERENCE.md - Scoring matrices, risk tiers, pathogenicity tables, PGx tablesREPORT_TEMPLATE.md - Output report template, treatment algorithms, completeness requirementsEXAMPLES.md - Six worked examples (cancer, metabolic, NSCLC, CVD, rare, neuro)QUICK_START.md - Sample prompts and output summaryWhen analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.
Apply when user asks about patient risk stratification, treatment selection, prognosis prediction, or personalized therapeutic strategy for any disease with genomic/clinical data.
NOT for (use other skills instead):
tooluniverse-variant-interpretationtooluniverse-immunotherapy-response-predictiontooluniverse-adverse-event-detectiontooluniverse-drug-target-validationtooluniverse-clinical-trial-matchingtooluniverse-drug-drug-interactiontooluniverse-polygenic-risk-scoreClassify into one category (determines Phase 3 routing):
| Category | Examples | |----------|----------| | CANCER | Breast, lung, colorectal, melanoma | | METABOLIC | Type 2 diabetes, obesity, NAFLD | | CARDIOVASCULAR | CAD, heart failure, AF | | NEUROLOGICAL | Alzheimer, Parkinson, epilepsy | | RARE/MONOGENIC | Marfan, CF, sickle cell, Huntington | | AUTOIMMUNE | RA, lupus, MS, Crohn's |
See TOOLS_REFERENCE.md for full details. Key gotchas:
query (NOT q)variant_id (NOT rsid)species='homo_sapiens'query, case_sensitive, exact_match, limitgene_list is a STRING (space-separated), not array{articles: [...]}limit=1000 for all results{data: {entity: {field: ...}}} structurePhase 1: Disease Disambiguation & Profile Standardization
Phase 2: Genetic Risk Assessment
Phase 3: Disease-Specific Molecular Stratification (routes by disease type)
Phase 4: Pharmacogenomic Profiling
Phase 5: Comorbidity & Drug Interaction Risk
Phase 6: Molecular Pathway Analysis
Phase 7: Clinical Evidence & Guidelines
Phase 8: Clinical Trial Matching
Phase 9: Integrated Scoring & Recommendations
OpenTargets_get_disease_id_description_by_nameMyGene_query_genes to get Ensembl/Entrez IDsClinVar_search_variants, EnsemblVEP_annotate_rsid/_hgvsOpenTargets_target_disease_evidencegwas_get_associations_for_trait, OpenTargets_search_gwas_studies_by_diseasegnomad_get_variantgnomad_get_gene_constraints (pLI, LOEUF scores)Scoring: See SCORING_REFERENCE.md for genetic risk score component (0-35 points).
cBioPortal_get_mutations, HPA_get_cancer_prognostics_by_genefda_pharmacogenomic_biomarkers for FDA cutoffsGWAS_search_associations_by_gene, OpenTargets_target_disease_evidenceClinVar_search_variants for LDLR, APOB, PCSK9PharmGKB_get_clinical_annotations for SLCO1B1ClinVar_search_variantsUniProt_get_disease_variants_by_accessionScoring: See SCORING_REFERENCE.md for disease-specific tables.
PharmGKB_get_clinical_annotations, PharmGKB_get_dosing_guidelinesfda_pharmacogenomic_biomarkers (use limit=1000)PharmGKB_get_drug_detailsScoring: See SCORING_REFERENCE.md for PGx risk score (0-10 points).
OpenTargets_get_associated_targets_by_disease_efoIddrugbank_get_drug_interactions_by_drug_name_or_id, FDA_get_drug_interactions_by_drug_nameenrichr_gene_enrichment_analysis (libs: KEGG_2021_Human, Reactome_2022, GO_Biological_Process_2023)ReactomeAnalysis_pathway_enrichment, Reactome_map_uniprot_to_pathwaysSTRING_get_interaction_partners, STRING_functional_enrichmentOpenTargets_get_target_tractability_by_ensemblIDPubMed_Guidelines_Search (fallback: PubMed_search_articles)OpenTargets_get_associated_drugs_by_disease_efoId, FDA_get_indications_by_drug_namecivic_search_evidence_items, civic_search_assertionssearch_clinical_trials with condition + interventionsearch_clinical_trials for basket/umbrella trials| Score | Tier | Management | |-------|------|------------| | 75-100 | VERY HIGH | Intensive treatment, subspecialty referral, clinical trial | | 50-74 | HIGH | Aggressive treatment, close monitoring | | 25-49 | INTERMEDIATE | Standard guideline-based care, PGx-guided dosing | | 0-24 | LOW | Surveillance, prevention, risk factor modification |
Generate report per REPORT_TEMPLATE.md. See SCORING_REFERENCE.md for detailed scoring matrices.
See EXAMPLES.md for six detailed worked examples:
tools
Post-market safety surveillance and recall/adverse-event RETRIEVAL across the full spectrum of FDA-regulated products that are NOT covered by the drug-AE signal skills: medical devices, food / dietary supplements / cosmetics, veterinary drugs, and drug supply (shortages). Orchestrates openFDA endpoints (MAUDE device adverse events + device recalls + 510(k), CAERS food/supplement/ cosmetic adverse events, veterinary adverse events, drug shortages, and cross-product enforcement/recall reports). USE WHEN the user asks: "are there adverse events for [device / pacemaker / infusion pump / insulin pump]", "device recalls for [firm/product]", "supplement / vitamin / cosmetic adverse reactions", "is [drug] in shortage", "what injectables are on shortage", "veterinary / animal adverse events for [drug] in [dog/cat/horse]", "food recall for listeria", "MAUDE report for [device]", "CAERS reactions for [brand]". DO NOT USE for drug adverse-event SIGNAL detection or disproportionality (PRR / ROR / IC) or drug-AE association scoring — that is `tooluniverse-pharmacovigilance` / `tooluniverse-adverse-event-detection`. This skill is multi-product surveillance and retrieval, not drug-AE statistical signal mining.
tools
--- name: tooluniverse-phewas description: Cross-ancestry / cross-biobank phenome-wide association (PheWAS) and replication. Given ONE variant (rsID) or ONE gene, look up every phenotype it associates with across European/UK (UKB-TOPMed), Finnish (FinnGen), Japanese (BioBank Japan), and Taiwanese (TPMI) biobanks, plus exome-wide gene-burden PheWAS (Genebass), then judge whether an association replicates across ancestries or is population-specific. Use whenever the user asks "what else is this va
tools
Dereplicate a putative natural product and assign its chemical taxonomy. Use to answer "is [compound] a known natural product", "what microbe/organism produces [compound]", "what chemical class is [compound]", "dereplicate this metabolite (by formula/exact mass/InChIKey/SMILES)", or "classify this molecule into ChemOnt". Searches NPAtlas for known microbial natural products (producing organism + literature reference), assigns the ChemOnt kingdom→superclass→class→subclass hierarchy via ClassyFire, resolves systematic IUPAC names to structure via OPSIN, and cross-references identity in PubChem. NOT for general drug/compound identity or ADMET (use tooluniverse-chemical-compound-retrieval / tooluniverse-small-molecule-discovery) and NOT for metabolomics pathway/enrichment analysis (use tooluniverse-metabolomics skills).
tools
Genome-ASSEMBLY discovery, QC, and replicon mapping for any organism (bacteria, archaea, fungi, and beyond) using NCBI Datasets. Resolves an organism name or taxid to assemblies, picks the reference/representative or best-quality assembly, pulls assembly QC metrics (total length, contig/scaffold N50, contig count, GC%, assembly level, RefSeq category), enumerates chromosomes and plasmids via per-replicon sequence reports, and compares candidate assemblies on quality. Use for "what genomes are available for [organism]", "assembly stats / N50 / GC content for [GCF_/GCA_ accession]", "how many plasmids does [strain] have", "compare assemblies for [species]", "find the reference genome for [taxon]", "is this assembly Complete Genome or just contigs". NOT for gene-level orthology/synteny (use tooluniverse-comparative-genomics), plant gene structure (use tooluniverse-plant-genomics), de novo assembly from raw reads (no tool exists), or taxonomy-only name/lineage lookups.