plugin/skills/tooluniverse-functional-genomics-screens/SKILL.md
Interpret hits from CRISPR-KO/CRISPRi/shRNA screens by integrating DepMap essentiality, gnomAD constraint scores, pathway context (Reactome, STRING), druggability (DGIdb), and clinical evidence (CIViC, COSMIC). Use for screen-hit prioritization, essentiality ranking, and turning a list of screen hits into a prioritized target shortlist.
npx skillsauth add mims-harvard/tooluniverse tooluniverse-functional-genomics-screensInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Pipeline for validating and prioritizing hits from genetic screens (CRISPR-KO, CRISPRi, shRNA) by integrating essentiality (DepMap), constraint (gnomAD), pathways (Reactome, STRING), druggability (DGIdb), and clinical evidence (CIViC, COSMIC).
Guiding principles:
When uncertain about any scientific fact, SEARCH databases first.
When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.
Phase 0: Input Processing → gene list, screen type, cell line, disease context
Phase 1: Hit Validation → DepMap dependency, gnomAD constraint, UniProt function
Phase 2: Pathway & Network → Reactome enrichment, STRING network, functional clusters
Phase 3: Druggability → DGIdb interactions, druggable categories, PharmacoDB
Phase 4: Clinical Evidence → CIViC, COSMIC mutations
Phase 5: Literature → PubMed for key hits
Phase 6: Prioritized Report → ranked target list with multi-dimensional scoring
Tools:
DepMap_get_gene_dependencies(gene_symbol=...) -- returns gene metadata only (NOT per-cell-line scores)DepMap_search_cell_lines(query=...) -- cell line metadatagnomad_get_gene_constraints(gene_symbol=...) -- pLI, LOEUF (may return "Service overloaded")UniProt_get_function_by_accession(accession=...) -- function summaryClassification: Pan-essential (>90% lines), Selectively essential (specific lineages), Context-specific (screen model only). Chronos < -0.5 = likely essential, < -1.0 = strongly essential.
DepMap per-cell-line Chronos scores: DepMap_get_gene_dependencies returns metadata only. For the actual per-cell-line scores, use the bundled script in the cell-line-profiling skill — tooluniverse-cell-line-profiling/scripts/depmap_gene_dependency.py (downloads the current DepMap Public CRISPRGeneEffect.csv once, cached; queries by gene or cell-line):
python depmap_gene_dependency.py gene KRAS --lineage Lung --top 20 # most-dependent lines
python depmap_gene_dependency.py cell-line A375 --top 25 # genes the line needs
Chronos < -0.5 ≈ dependency, < -1.0 strongly essential. Fallback if you can't run it: gnomAD constraint + PubMed_search_articles(query="[gene] CRISPR screen [cancer]").
ReactomeAnalysis_pathway_enrichment(identifiers="TP53 BRCA1 EGFR") -- space-separated stringSTRING_get_network(identifiers="GENE1\rGENE2\rGENE3", species=9606) -- carriage-return separatedSTRING_functional_enrichment(identifiers=..., species=9606) -- GO/KEGG enrichmentDGIdb_get_drug_gene_interactions(genes=["EGFR","BRAF"]) -- drug-gene interactionsDGIdb_get_gene_druggability(genes=[...]) -- categories (kinase, GPCR, etc.)search_clinical_trials and PubMed for novel inhibitors not yet in DGIdb.civic_search_evidence_items(molecular_profile=gene) -- NOT queryCOSMIC_get_mutations_by_gene(gene_name=...) -- somatic mutation frequencyScoring (0-18):
| Criterion | Score 3 | Score 0 | |-----------|---------|---------| | Selective essentiality | <-0.5 in disease AND >-0.2 elsewhere | >-0.2 (not essential) | | Pathway convergence | 3+ hits same pathway | Isolated hit | | Druggability | Approved drug exists | Not druggable | | Clinical evidence | CIViC therapeutic | No clinical data | | Constraint | pLI >0.9 | No data | | Literature | Multiple validation studies | No publications |
Tiers: T1 (15-18) high-confidence, T2 (10-14) promising, T3 (5-9) speculative, T4 (<5) likely false positive.
tools
Post-market safety surveillance and recall/adverse-event RETRIEVAL across the full spectrum of FDA-regulated products that are NOT covered by the drug-AE signal skills: medical devices, food / dietary supplements / cosmetics, veterinary drugs, and drug supply (shortages). Orchestrates openFDA endpoints (MAUDE device adverse events + device recalls + 510(k), CAERS food/supplement/ cosmetic adverse events, veterinary adverse events, drug shortages, and cross-product enforcement/recall reports). USE WHEN the user asks: "are there adverse events for [device / pacemaker / infusion pump / insulin pump]", "device recalls for [firm/product]", "supplement / vitamin / cosmetic adverse reactions", "is [drug] in shortage", "what injectables are on shortage", "veterinary / animal adverse events for [drug] in [dog/cat/horse]", "food recall for listeria", "MAUDE report for [device]", "CAERS reactions for [brand]". DO NOT USE for drug adverse-event SIGNAL detection or disproportionality (PRR / ROR / IC) or drug-AE association scoring — that is `tooluniverse-pharmacovigilance` / `tooluniverse-adverse-event-detection`. This skill is multi-product surveillance and retrieval, not drug-AE statistical signal mining.
tools
--- name: tooluniverse-phewas description: Cross-ancestry / cross-biobank phenome-wide association (PheWAS) and replication. Given ONE variant (rsID) or ONE gene, look up every phenotype it associates with across European/UK (UKB-TOPMed), Finnish (FinnGen), Japanese (BioBank Japan), and Taiwanese (TPMI) biobanks, plus exome-wide gene-burden PheWAS (Genebass), then judge whether an association replicates across ancestries or is population-specific. Use whenever the user asks "what else is this va
tools
Dereplicate a putative natural product and assign its chemical taxonomy. Use to answer "is [compound] a known natural product", "what microbe/organism produces [compound]", "what chemical class is [compound]", "dereplicate this metabolite (by formula/exact mass/InChIKey/SMILES)", or "classify this molecule into ChemOnt". Searches NPAtlas for known microbial natural products (producing organism + literature reference), assigns the ChemOnt kingdom→superclass→class→subclass hierarchy via ClassyFire, resolves systematic IUPAC names to structure via OPSIN, and cross-references identity in PubChem. NOT for general drug/compound identity or ADMET (use tooluniverse-chemical-compound-retrieval / tooluniverse-small-molecule-discovery) and NOT for metabolomics pathway/enrichment analysis (use tooluniverse-metabolomics skills).
tools
Genome-ASSEMBLY discovery, QC, and replicon mapping for any organism (bacteria, archaea, fungi, and beyond) using NCBI Datasets. Resolves an organism name or taxid to assemblies, picks the reference/representative or best-quality assembly, pulls assembly QC metrics (total length, contig/scaffold N50, contig count, GC%, assembly level, RefSeq category), enumerates chromosomes and plasmids via per-replicon sequence reports, and compares candidate assemblies on quality. Use for "what genomes are available for [organism]", "assembly stats / N50 / GC content for [GCF_/GCA_ accession]", "how many plasmids does [strain] have", "compare assemblies for [species]", "find the reference genome for [taxon]", "is this assembly Complete Genome or just contigs". NOT for gene-level orthology/synteny (use tooluniverse-comparative-genomics), plant gene structure (use tooluniverse-plant-genomics), de novo assembly from raw reads (no tool exists), or taxonomy-only name/lineage lookups.