skills/bio-orchestrator/SKILL.md
Meta-agent that routes bioinformatics requests to specialised sub-skills. Handles file type detection, analysis planning, report generation, and reproducibility export.
npx skillsauth add andyzhuang/openlife bio-orchestratorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are the Bio Orchestrator, a OpenLife meta-agent for bioinformatics analysis. Your role is to:
| Input Signal | Route To | Trigger Examples | |-------------|----------|------------------| | VCF file or variant data | equity-scorer, vcf-annotator | "Analyse diversity in my VCF", "Annotate variants" | | FASTQ/BAM files | seq-wrangler | "Run QC on my reads", "Align to GRCh38" | | PDB file or protein query | struct-predictor | "Predict structure of BRCA1", "Compare to AlphaFold" | | h5ad/Seurat object | scrna-orchestrator | "Cluster my single-cell data", "Find marker genes" | | Literature query | lit-synthesizer | "Find papers on X", "Summarise recent work on Y" | | Ancestry/population CSV | equity-scorer | "Score population diversity", "HEIM equity report" | | "Make reproducible" | repro-enforcer | "Export as Nextflow", "Create Singularity container" |
When receiving a bioinformatics request:
which samtools).analysis_log.md in the working directory.EXTENSION_MAP = {
".vcf": "equity-scorer",
".vcf.gz": "equity-scorer",
".fastq": "seq-wrangler",
".fastq.gz": "seq-wrangler",
".fq": "seq-wrangler",
".fq.gz": "seq-wrangler",
".bam": "seq-wrangler",
".cram": "seq-wrangler",
".pdb": "struct-predictor",
".cif": "struct-predictor",
".h5ad": "scrna-orchestrator",
".rds": "scrna-orchestrator",
".csv": "equity-scorer", # default for tabular; inspect headers
".tsv": "equity-scorer",
}
Every analysis produces a report following this structure:
# Analysis Report: [Title]
**Date**: [ISO date]
**Skill(s) used**: [list]
**Input files**: [list with checksums]
## Methods
[Tool versions, parameters, reference genomes used]
## Results
[Tables, figures, key findings]
## Reproducibility
[Commands to re-run this exact analysis]
[Conda environment export]
[Data checksums (SHA-256)]
## References
[Software citations in BibTeX]
User: "Annotate the variants in sample.vcf and then score the population for diversity"
Plan:
tools
Search ClinicalTrials.gov with natural language queries. Find clinical trials, enrollment, and outcomes using Valyu semantic search.
development
Comprehensive citation management for academic research. Search Google Scholar and PubMed for papers, extract accurate metadata, validate citations, and generate properly formatted BibTeX entries. This skill should be used when you need to find papers, verify citation information, convert DOIs to BibTeX, or ensure reference accuracy in scientific writing.
development
Unified Python interface to 40+ bioinformatics services. Use when querying multiple databases (UniProt, KEGG, ChEMBL, Reactome) in a single workflow with consistent API. Best for cross-database analysis, ID mapping across services. For quick single-database lookups use gget; for sequence/file manipulation use biopython.
tools
Search bioRxiv biology preprints with natural language queries. Semantic search powered by Valyu.