Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

mims-harvard/tooluniverse-gwas-study-explorer

Name: tooluniverse-gwas-study-explorer
Author: mims-harvard

skills/tooluniverse-gwas-study-explorer/SKILL.md

npx skillsauth add mims-harvard/tooluniverse tooluniverse-gwas-study-explorer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

GWAS Study Deep Dive & Meta-Analysis

Compare GWAS studies, perform meta-analyses, and assess replication across cohorts

Overview

The GWAS Study Deep Dive & Meta-Analysis skill enables comprehensive comparison of genome-wide association studies (GWAS) for the same trait, meta-analysis of genetic loci across studies, and systematic assessment of replication and study quality. It integrates data from the NHGRI-EBI GWAS Catalog and Open Targets Genetics to provide a complete picture of the genetic architecture of complex traits.

Key Capabilities

Study Comparison: Compare all GWAS studies for a trait, assessing sample sizes, ancestries, and platforms
Meta-Analysis: Aggregate effect sizes across studies and calculate heterogeneity statistics
Replication Assessment: Identify replicated vs novel findings across discovery and replication cohorts
Quality Evaluation: Assess statistical power, ancestry diversity, and data availability

COMPUTE, DON'T DESCRIBE

When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.

Domain Reasoning: Comparing Studies for the Same Trait

When comparing GWAS studies for the same trait, ask: do they replicate? The same lead SNPs appearing in independent studies is strong evidence of a true association. Different lead SNPs at the same locus may reflect LD differences between populations — they may tag the same causal variant. Different loci entirely may reflect different study designs, phenotype definitions, or population ancestry. Before concluding that a finding failed to replicate, check whether the SNP was even genotyped or imputed in the replication cohort.

LOOK UP DON'T GUESS: effect sizes, p-values, allele frequencies, and LD structure for specific loci. Do not assume a SNP present in one study is present in another — use gwas_get_associations_for_snp to retrieve cross-study data. Do not infer LD blocks from genomic proximity; use credible sets from Open Targets for fine-mapping results.

Use Cases

1. Comprehensive Trait Analysis

Scenario: "I want to understand all available GWAS data for type 2 diabetes"

Workflow:

Search for all T2D studies in GWAS Catalog
Filter by sample size and ancestry
Extract top associations from each study
Identify consistently replicated loci
Assess ancestry-specific effects

Outcome: Complete landscape of T2D genetics with replicated findings and population-specific signals

2. Locus-Specific Meta-Analysis

Scenario: "Is the TCF7L2 association with T2D consistent across all studies?"

Workflow:

Retrieve all TCF7L2 (rs7903146) associations for T2D
Calculate combined effect size and p-value
Assess heterogeneity (I² statistic)
Generate forest plot data
Interpret heterogeneity level

Outcome: Quantitative assessment of effect size consistency with heterogeneity interpretation

Honesty rule (important): A real inverse-variance meta-analysis needs each study's beta + 95% CI. python_implementation.py parses these from the GWAS Catalog beta/or_value + range fields and only then pools effect sizes and computes Cochran's-Q I². When the matched associations don't report usable effect sizes (common), it returns method="descriptive", combined_beta=None, heterogeneity_i2=None, and combined_p_value = the smallest reported p (not a pooled p) — do NOT present a descriptive result as a formal meta-analysis or invent an I².

3. Replication Analysis

Scenario: "Which findings from the discovery cohort replicated in the independent sample?"

Workflow:

Get top hits from discovery study
Check for presence and significance in replication study
Assess direction consistency
Calculate replication rate
Identify novel vs failed replication

Outcome: Systematic replication report with success rates and failed findings

4. Multi-Ancestry Comparison

Scenario: "Are T2D loci consistent across European and East Asian populations?"

Workflow:

Filter studies by ancestry
Compare top associations between populations
Identify shared vs population-specific loci
Assess allele frequency differences
Evaluate transferability of genetic risk scores

Outcome: Ancestry-specific genetic architecture with transferability assessment

Statistical Methods

Meta-Analysis Approach

This skill implements standard GWAS meta-analysis methods:

Fixed-Effects Model:

Used when heterogeneity is low (I² < 25%)
Weights studies by inverse variance
Assumes true effect size is the same across studies

Random-Effects Model (recommended when I² > 50%):

Accounts for between-study variation
More conservative than fixed-effects
Better for diverse ancestries or methodologies

Heterogeneity Assessment:

The I² statistic measures the percentage of variance due to between-study heterogeneity:

I² = [(Q - df) / Q] × 100%

where Q = Cochran's Q statistic
      df = degrees of freedom (n_studies - 1)

Interpretation Guidelines:

I² < 25%: Low heterogeneity → fixed-effects appropriate
I² = 25-50%: Moderate heterogeneity → investigate sources
I² = 50-75%: Substantial heterogeneity → random-effects preferred
I² > 75%: Considerable heterogeneity → meta-analysis may not be appropriate

Sources of Heterogeneity

Common reasons for high I²:

Ancestry differences: Different allele frequencies and LD structure
Phenotype heterogeneity: Trait definition varies across studies
Platform differences: Imputation quality and coverage
Winner's curse: Discovery studies overestimate effect sizes
Cohort characteristics: Age, sex, environmental factors

Recommendations:

Perform subgroup analysis by ancestry
Use meta-regression to investigate sources
Consider excluding outlier studies
Apply genomic control correction

Study Quality Assessment

Quality Metrics

The skill evaluates studies based on:

1. Sample Size:

Power to detect associations (80% power requires n > 10,000 for OR=1.2)
Precision of effect size estimates
Ability to detect modest effects

2. Ancestry Diversity:

Single-ancestry vs multi-ancestry
Population stratification control
Transferability of findings

3. Data Availability:

Summary statistics available for meta-analysis
Individual-level data vs summary-level
Imputation quality scores

4. Genotyping Quality:

Platform density and coverage
Imputation reference panel
Quality control measures

5. Statistical Rigor:

Genome-wide significance threshold (p < 5×10⁻⁸)
Multiple testing correction
Replication in independent cohort

Quality Tiers

Tier 1 (High Quality):

n ≥ 50,000
Summary statistics available
Multi-ancestry or large single-ancestry
Imputed to high-quality reference
Independent replication

Tier 2 (Moderate Quality):

n ≥ 10,000
Standard GWAS platform
Adequate power for common variants
Some data availability

Tier 3 (Limited):

n < 10,000
Limited power
May miss modest effects
Use with caution

Best Practices

Before Meta-Analysis

Check phenotype consistency: Ensure studies measure the same trait
Verify ancestry overlap: High heterogeneity expected if ancestries differ
Harmonize alleles: Align effect alleles across studies
Quality control: Exclude low-quality studies or associations

Interpreting Results

Genome-wide significance: p < 5×10⁻⁸ (Bonferroni for ~1M independent tests)
Replication threshold: p < 0.05 in independent cohort
Direction consistency: Effect should be same direction across studies
Heterogeneity: I² > 50% suggests caution in interpretation

Common Pitfalls

❌ Don't:

Meta-analyze without checking heterogeneity
Ignore ancestry differences
Over-interpret nominal p-values
Assume replication failure means false positive

✅ Do:

Always report I² statistic
Perform sensitivity analyses
Consider ancestry-stratified analysis
Account for winner's curse in discovery studies

Limitations & Caveats

Data Limitations

Incomplete Overlap: Studies may analyze different SNPs
Cohort Overlap: Some cohorts participate in multiple studies (inflates significance)
Publication Bias: Significant findings more likely to be published
Winner's Curse: Discovery studies overestimate effect sizes
Imputation Quality: Varies across studies and populations

Statistical Limitations

Heterogeneity: High I² may preclude meaningful meta-analysis
Sample Size Differences: Large studies dominate fixed-effects models
Allele Frequency Differences: Same variant has different effects across ancestries
Linkage Disequilibrium: Fine-mapping needed to identify causal variants
Gene-Environment Interactions: Not captured in standard meta-analysis

Interpretation Guidelines

When I² > 75%:

Meta-analysis results should be interpreted with extreme caution
Investigate sources of heterogeneity systematically
Consider ancestry-specific or subgroup analyses
Descriptive comparison may be more appropriate than meta-analysis

When Studies Conflict:

Check for methodological differences
Verify phenotype definitions match
Investigate population stratification
Consider conditional analysis

Tools Used

GWAS Catalog API

gwas_search_studies: Find studies by trait
gwas_get_study_by_id: Get detailed study metadata
gwas_get_associations_for_study: Retrieve study associations
gwas_get_associations_for_snp: Get SNP associations across studies
gwas_search_associations: Search associations by trait

Open Targets Genetics GraphQL API

OpenTargets_search_gwas_studies_by_disease: Disease-based study search
OpenTargets_get_gwas_study: Detailed study information with LD populations
OpenTargets_get_variant_credible_sets: Fine-mapped loci for variant
OpenTargets_get_study_credible_sets: All credible sets for study
OpenTargets_get_variant_info: Variant annotation and allele frequencies

Glossary

Credible Set: Set of variants likely to contain the causal variant (from fine-mapping)

L2G (Locus-to-Gene): Score predicting which gene is affected by a GWAS locus License: Open source (MIT)

mims-harvard/tooluniverse-gwas-study-explorer

skills/tooluniverse-gwas-study-explorer/SKILL.md

Compare GWAS studies, perform meta-analyses across cohorts, and assess signal replication. Uses GWAS Catalog metadata, study-level statistics, and cross-cohort comparison. Use for evaluating GWAS reproducibility for a trait, meta-analysis sample size and effect-size aggregation, and detecting study heterogeneity (population, design, ancestry).

1,429 stars

tools

Updated Jun 7, 2026

$ install --global

skillsauth

npx skillsauth add mims-harvard/tooluniverse tooluniverse-gwas-study-explorer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 7, 2026, 6:28 AM165.6s3 files scanned

SKILL.md

name:: tooluniverse-gwas-study-explorer
description:: Compare GWAS studies, perform meta-analyses across cohorts, and assess signal replication. Uses GWAS Catalog metadata, study-level statistics, and cross-cohort comparison. Use for evaluating GWAS reproducibility for a trait, meta-analysis sample size and effect-size aggregation, and detecting study heterogeneity (population, design, ancestry).
disable-model-invocation:: true

GWAS Study Deep Dive & Meta-Analysis

Compare GWAS studies, perform meta-analyses, and assess replication across cohorts

Overview

Key Capabilities

Study Comparison: Compare all GWAS studies for a trait, assessing sample sizes, ancestries, and platforms
Meta-Analysis: Aggregate effect sizes across studies and calculate heterogeneity statistics
Replication Assessment: Identify replicated vs novel findings across discovery and replication cohorts
Quality Evaluation: Assess statistical power, ancestry diversity, and data availability

COMPUTE, DON'T DESCRIBE

Domain Reasoning: Comparing Studies for the Same Trait

Use Cases

1. Comprehensive Trait Analysis

Scenario: "I want to understand all available GWAS data for type 2 diabetes"

Workflow:

Search for all T2D studies in GWAS Catalog
Filter by sample size and ancestry
Extract top associations from each study
Identify consistently replicated loci
Assess ancestry-specific effects

Outcome: Complete landscape of T2D genetics with replicated findings and population-specific signals

2. Locus-Specific Meta-Analysis

Scenario: "Is the TCF7L2 association with T2D consistent across all studies?"

Workflow:

Retrieve all TCF7L2 (rs7903146) associations for T2D
Calculate combined effect size and p-value
Assess heterogeneity (I² statistic)
Generate forest plot data
Interpret heterogeneity level

Outcome: Quantitative assessment of effect size consistency with heterogeneity interpretation

Honesty rule (important): A real inverse-variance meta-analysis needs each study's beta + 95% CI. python_implementation.py parses these from the GWAS Catalog beta/or_value + range fields and only then pools effect sizes and computes Cochran's-Q I². When the matched associations don't report usable effect sizes (common), it returns method="descriptive", combined_beta=None, heterogeneity_i2=None, and combined_p_value = the smallest reported p (not a pooled p) — do NOT present a descriptive result as a formal meta-analysis or invent an I².

3. Replication Analysis

Scenario: "Which findings from the discovery cohort replicated in the independent sample?"

Workflow:

Get top hits from discovery study
Check for presence and significance in replication study
Assess direction consistency
Calculate replication rate
Identify novel vs failed replication

Outcome: Systematic replication report with success rates and failed findings

4. Multi-Ancestry Comparison

Scenario: "Are T2D loci consistent across European and East Asian populations?"

Workflow:

Filter studies by ancestry
Compare top associations between populations
Identify shared vs population-specific loci
Assess allele frequency differences
Evaluate transferability of genetic risk scores

Outcome: Ancestry-specific genetic architecture with transferability assessment

Statistical Methods

Meta-Analysis Approach

This skill implements standard GWAS meta-analysis methods:

Fixed-Effects Model:

Used when heterogeneity is low (I² < 25%)
Weights studies by inverse variance
Assumes true effect size is the same across studies

Random-Effects Model (recommended when I² > 50%):

Accounts for between-study variation
More conservative than fixed-effects
Better for diverse ancestries or methodologies

Heterogeneity Assessment:

The I² statistic measures the percentage of variance due to between-study heterogeneity:

I² = [(Q - df) / Q] × 100%

where Q = Cochran's Q statistic
      df = degrees of freedom (n_studies - 1)

Interpretation Guidelines:

I² < 25%: Low heterogeneity → fixed-effects appropriate
I² = 25-50%: Moderate heterogeneity → investigate sources
I² = 50-75%: Substantial heterogeneity → random-effects preferred
I² > 75%: Considerable heterogeneity → meta-analysis may not be appropriate

Sources of Heterogeneity

Common reasons for high I²:

Ancestry differences: Different allele frequencies and LD structure
Phenotype heterogeneity: Trait definition varies across studies
Platform differences: Imputation quality and coverage
Winner's curse: Discovery studies overestimate effect sizes
Cohort characteristics: Age, sex, environmental factors

Recommendations:

Perform subgroup analysis by ancestry
Use meta-regression to investigate sources
Consider excluding outlier studies
Apply genomic control correction

Study Quality Assessment

Quality Metrics

The skill evaluates studies based on:

1. Sample Size:

Power to detect associations (80% power requires n > 10,000 for OR=1.2)
Precision of effect size estimates
Ability to detect modest effects

2. Ancestry Diversity:

Single-ancestry vs multi-ancestry
Population stratification control
Transferability of findings

3. Data Availability:

Summary statistics available for meta-analysis
Individual-level data vs summary-level
Imputation quality scores

4. Genotyping Quality:

Platform density and coverage
Imputation reference panel
Quality control measures

5. Statistical Rigor:

Genome-wide significance threshold (p < 5×10⁻⁸)
Multiple testing correction
Replication in independent cohort

Quality Tiers

Tier 1 (High Quality):

n ≥ 50,000
Summary statistics available
Multi-ancestry or large single-ancestry
Imputed to high-quality reference
Independent replication

Tier 2 (Moderate Quality):

n ≥ 10,000
Standard GWAS platform
Adequate power for common variants
Some data availability

Tier 3 (Limited):

n < 10,000
Limited power
May miss modest effects
Use with caution

Best Practices

Before Meta-Analysis

Check phenotype consistency: Ensure studies measure the same trait
Verify ancestry overlap: High heterogeneity expected if ancestries differ
Harmonize alleles: Align effect alleles across studies
Quality control: Exclude low-quality studies or associations

Interpreting Results

Genome-wide significance: p < 5×10⁻⁸ (Bonferroni for ~1M independent tests)
Replication threshold: p < 0.05 in independent cohort
Direction consistency: Effect should be same direction across studies
Heterogeneity: I² > 50% suggests caution in interpretation

Common Pitfalls

❌ Don't:

Meta-analyze without checking heterogeneity
Ignore ancestry differences
Over-interpret nominal p-values
Assume replication failure means false positive

✅ Do:

Always report I² statistic
Perform sensitivity analyses
Consider ancestry-stratified analysis
Account for winner's curse in discovery studies

Limitations & Caveats

Data Limitations

Incomplete Overlap: Studies may analyze different SNPs
Cohort Overlap: Some cohorts participate in multiple studies (inflates significance)
Publication Bias: Significant findings more likely to be published
Winner's Curse: Discovery studies overestimate effect sizes
Imputation Quality: Varies across studies and populations

Statistical Limitations

Heterogeneity: High I² may preclude meaningful meta-analysis
Sample Size Differences: Large studies dominate fixed-effects models
Allele Frequency Differences: Same variant has different effects across ancestries
Linkage Disequilibrium: Fine-mapping needed to identify causal variants
Gene-Environment Interactions: Not captured in standard meta-analysis

Interpretation Guidelines

When I² > 75%:

Meta-analysis results should be interpreted with extreme caution
Investigate sources of heterogeneity systematically
Consider ancestry-specific or subgroup analyses
Descriptive comparison may be more appropriate than meta-analysis

When Studies Conflict:

Check for methodological differences
Verify phenotype definitions match
Investigate population stratification
Consider conditional analysis

Tools Used

GWAS Catalog API

gwas_search_studies: Find studies by trait
gwas_get_study_by_id: Get detailed study metadata
gwas_get_associations_for_study: Retrieve study associations
gwas_get_associations_for_snp: Get SNP associations across studies
gwas_search_associations: Search associations by trait

Open Targets Genetics GraphQL API

OpenTargets_search_gwas_studies_by_disease: Disease-based study search
OpenTargets_get_gwas_study: Detailed study information with LD populations
OpenTargets_get_variant_credible_sets: Fine-mapped loci for variant
OpenTargets_get_study_credible_sets: All credible sets for study
OpenTargets_get_variant_info: Variant annotation and allele frequencies

Glossary

Credible Set: Set of variants likely to contain the causal variant (from fine-mapping)

L2G (Locus-to-Gene): Score predicting which gene is affected by a GWAS locus License: Open source (MIT)

Related Skills

mims-harvard/tooluniverse-self-review

tools

VerifiedTrustedCommunity

Generate the success criteria for a task or question, then review work against them. Given a task, goal, or open-ended question, decompose it into scenarios, evaluation perspectives, and fine-grained weighted YES/NO criteria using the Recursive Expansion Tree (RET) method; if work is supplied, score it criterion-by-criterion and surface what is missing or could be better. Use when asked to self-review or check your own work, judge whether a task is done well or completely, build a definition-of-done or completeness checklist, create an evaluation rubric or grading criteria, score or grade answers to a question, set up an LLM-as-judge rubric, or when the user mentions self-review, completeness check, success criteria, evaluation criteria, scoring rubric, Qworld, or the RET algorithm.

1,583SKILL.mdUpdated Jul 22, 2026

mims-harvard/tooluniverse-self-review

mims-harvard/tooluniverse-peptide-target-deorphanization

tools

VerifiedTrustedCommunity

Find the real protein target(s) of a peptide from its sequence — peptide target deorphanization / off-target identification, for ANY target class (GPCR, ion channel, protease, cytokine/growth-factor receptor, enzyme, integrin), not only GPCRs. Use when a peptide has a phenotype but does not bind its hypothesized target, when a peptide binds a target in one species or assay but not another, or to screen candidate targets for an orphan peptide. A target-class router steers a multi-route keyless pipeline (PROSITE/ELM motif, BLAST homology, HGNC/InterPro/GPCRdb/GtoPdb target-family enumeration, OpenTargets phenotype anchor, EnsemblCompara/Alliance cross-species reconciliation) plus optional NVIDIA-NIM co-folding (Boltz2, AlphaFold2-Multimer, OpenFold3) for structural confirmation.

1,583SKILL.mdUpdated Jul 22, 2026

mims-harvard/tooluniverse-peptide-target-deorphanization

mims-harvard/tooluniverse-cs-setup

tools

VerifiedTrustedCommunity

Install or update ToolUniverse in Claude Science — create the conda env, install the tooluniverse pip package, and (re)build the tooluniverse-research skill by fetching the current workflow library from GitHub. Use for first-time setup, upgrading the ToolUniverse version, refreshing the bundled workflows after an upstream release, or reinstalling on a new machine.

1,583SKILL.mdUpdated Jul 22, 2026

mims-harvard/tooluniverse-cs-setup

mims-harvard/tooluniverse-codex-plugin

tools

VerifiedTrustedCommunity

Install, set up, verify, update, pin, uninstall, or troubleshoot the ToolUniverse plugin on OpenAI Codex. ALWAYS consult this skill for any of those — don't answer from memory, because the exact marketplace name (mims-harvard/ToolUniverse), the "codex plugin marketplace add" then "codex plugin add -m tooluniverse" flow, Codex's startup auto-upgrade behavior, the uvx tooluniverse MCP server, and the API-key env vars are easy to get wrong. Use it whenever someone wants to get ToolUniverse (or "the 1000+ scientific tools" / "the harvard tools") working on Codex, says the Codex plugin or its tools/skills won't load, hits a uvx or MCP-server startup error, asks how Codex updates it, wants to pin or remove it, or finds it running an old tool version — even if they never say the word "plugin". Not for the Claude Code plugin (use tooluniverse-claude-code-plugin), for running research with the tools, or for authoring new tools or skills.

1,583SKILL.mdUpdated Jul 22, 2026

mims-harvard/tooluniverse-codex-plugin

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/mims-harvard/tooluniverse.git

# Copy into Claude Code skills folder (global)
cp -r tooluniverse/skills/tooluniverse-gwas-study-explorer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

mims-harvard/tooluniverse

1,429 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT