Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

aipoch/pdb-database

Name: pdb-database
Author: aipoch

scientific-skills/Evidence Insights/pdb-database/SKILL.md

npx skillsauth add aipoch/medical-research-skills pdb-database

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

When to Use

Use this skill when you need to:

Find protein/nucleic acid 3D structures by keywords, organism, experimental method, or resolution.
Identify related structures via sequence similarity (e.g., homolog search for modeling).
Identify related structures via 3D structure similarity (e.g., fold-level comparisons).
Download coordinates (PDB/mmCIF) for downstream analysis, visualization, docking, or modeling.
Run batch retrieval of metadata/coordinates to feed pipelines in drug discovery, protein engineering, or structural bioinformatics.

Key Features

Text and attribute-based search over RCSB PDB entries.
Sequence similarity search with configurable thresholds (e-value, identity).
Structure similarity search using an existing entry as a query.
Programmatic metadata retrieval via the RCSB Data API (schema-based or GraphQL).
Direct coordinate downloads in PDB and mmCIF formats.
Batch processing patterns for multiple PDB IDs.

Dependencies

rcsb-api (latest recommended; provides rcsbapi.search and rcsbapi.data)
requests>=2.0 (HTTP downloads)
biopython>=1.80 (optional; parsing/analyzing PDB coordinates)

Install (example):

uv pip install rcsb-api requests biopython

Example Usage

The following script is end-to-end runnable: it searches for a target, fetches metadata, downloads coordinates, and parses the structure.

#!/usr/bin/env python3
import pathlib
import requests

from rcsbapi.search import TextQuery, AttributeQuery
from rcsbapi.search.attrs import rcsb_entry_info
from rcsbapi.data import fetch, Schema

from Bio.PDB import PDBParser


def download_text(url: str, out_path: pathlib.Path) -> None:
    r = requests.get(url, timeout=60)
    r.raise_for_status()
    out_path.write_text(r.text, encoding="utf-8")


def main():
    out_dir = pathlib.Path("pdb_out")
    out_dir.mkdir(exist_ok=True)

    # 1) Search: hemoglobin entries with resolution < 2.0 Å
    q_text = TextQuery("hemoglobin")
    q_res = AttributeQuery(
        attribute=rcsb_entry_info.resolution_combined,
        operator="less",
        value=2.0,
    )
    query = q_text & q_res

    pdb_ids = list(query())[:5]
    if not pdb_ids:
        raise SystemExit("No results found.")
    pdb_id = pdb_ids[0]
    print(f"Selected PDB ID: {pdb_id}")

    # 2) Fetch entry metadata
    entry = fetch(pdb_id, schema=Schema.ENTRY)
    title = entry.get("struct", {}).get("title")
    method = (entry.get("exptl") or [{}])[0].get("method")
    resolution = (entry.get("rcsb_entry_info") or {}).get("resolution_combined")
    deposit_date = (entry.get("rcsb_accession_info") or {}).get("deposit_date")

    print("Metadata:")
    print(f"  Title: {title}")
    print(f"  Method: {method}")
    print(f"  Resolution: {resolution}")
    print(f"  Deposit date: {deposit_date}")

    # 3) Download coordinates (PDB and mmCIF)
    pdb_path = out_dir / f"{pdb_id}.pdb"
    cif_path = out_dir / f"{pdb_id}.cif"

    download_text(f"https://files.rcsb.org/download/{pdb_id}.pdb", pdb_path)
    download_text(f"https://files.rcsb.org/download/{pdb_id}.cif", cif_path)
    print(f"Downloaded: {pdb_path} and {cif_path}")

    # 4) Parse PDB coordinates (example: count atoms)
    parser = PDBParser(QUIET=True)
    structure = parser.get_structure(pdb_id, str(pdb_path))

    atom_count = sum(1 for _ in structure.get_atoms())
    chain_ids = sorted({chain.id for chain in structure.get_chains()})
    print("Parsed structure:")
    print(f"  Chains: {chain_ids}")
    print(f"  Atom count: {atom_count}")


if __name__ == "__main__":
    main()

Implementation Details

Search Modes and Query Composition

Text search uses free-text matching over entry annotations (titles, keywords, descriptions).
Attribute search filters by structured fields (e.g., organism, method, resolution).
Sequence similarity search typically supports:
- evalue_cutoff: lower is more stringent (fewer, more confident hits).
- identity_cutoff: fraction identity threshold (e.g., 0.9 for near-identical).
Structure similarity search uses an existing structure (e.g., an entry_id) as the geometric reference.
Queries can be combined with boolean logic:
- query1 & query2 (AND)
- query1 | query2 (OR)
- ~query (NOT), where supported by the client

Data Retrieval (Schema vs GraphQL)

Schema-based fetch (e.g., Schema.ENTRY, Schema.POLYMER_ENTITY) is convenient for common objects and stable access patterns.
GraphQL fetch is best when you need a custom selection of fields in one request (reduce round-trips and payload).

Example GraphQL pattern:

from rcsbapi.data import fetch

query = """
{
  entry(entry_id: "4HHB") {
    struct { title }
    exptl { method }
    rcsb_entry_info { resolution_combined deposited_atom_count }
  }
}
"""
data = fetch(query_type="graphql", query=query)

Coordinate Downloads and Formats

PDB: legacy text format; widely supported but less expressive for large/complex structures.
mmCIF (PDBx): modern standard; preferred for completeness and large structures.

Direct download endpoints:

https://files.rcsb.org/download/{PDB_ID}.pdb
https://files.rcsb.org/download/{PDB_ID}.cif

Batch Processing Pattern

For batch metadata retrieval, iterate over IDs and call fetch(pdb_id, schema=Schema.ENTRY); handle exceptions per-ID to keep pipelines robust. For large batches, consider rate limiting and caching to avoid repeated downloads.

Reference Documentation

If present in this repository, consult:

references/api_reference.md for advanced endpoint usage, query patterns, schema notes, rate limits, and troubleshooting.

aipoch/pdb-database

scientific-skills/Evidence Insights/pdb-database/SKILL.md

Access the RCSB Protein Data Bank (PDB) to search, download, and programmatically retrieve 3D macromolecular structures and metadata; use when you need structure discovery (text/sequence/3D similarity) or automated structural data ingestion for structural biology and drug discovery workflows.

37 stars

data-ai

Updated Mar 26, 2026

$ install --global

skillsauth

npx skillsauth add aipoch/medical-research-skills pdb-database

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

70%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 4, 2026, 8:51 AM212.7s1 file scanned

SKILL.md

name:: pdb-database
description:: Access the RCSB Protein Data Bank (PDB) to search, download, and programmatically retrieve 3D macromolecular structures and metadata; use when you need structure discovery (text/sequence/3D similarity) or automated structural data ingestion for structural biology and drug discovery workflows.
license:: MIT
skill-author:: AIPOCH

When to Use

Use this skill when you need to:

Find protein/nucleic acid 3D structures by keywords, organism, experimental method, or resolution.
Identify related structures via sequence similarity (e.g., homolog search for modeling).
Identify related structures via 3D structure similarity (e.g., fold-level comparisons).
Download coordinates (PDB/mmCIF) for downstream analysis, visualization, docking, or modeling.
Run batch retrieval of metadata/coordinates to feed pipelines in drug discovery, protein engineering, or structural bioinformatics.

Key Features

Text and attribute-based search over RCSB PDB entries.
Sequence similarity search with configurable thresholds (e-value, identity).
Structure similarity search using an existing entry as a query.
Programmatic metadata retrieval via the RCSB Data API (schema-based or GraphQL).
Direct coordinate downloads in PDB and mmCIF formats.
Batch processing patterns for multiple PDB IDs.

Dependencies

rcsb-api (latest recommended; provides rcsbapi.search and rcsbapi.data)
requests>=2.0 (HTTP downloads)
biopython>=1.80 (optional; parsing/analyzing PDB coordinates)

Install (example):

uv pip install rcsb-api requests biopython

Example Usage

The following script is end-to-end runnable: it searches for a target, fetches metadata, downloads coordinates, and parses the structure.

#!/usr/bin/env python3
import pathlib
import requests

from rcsbapi.search import TextQuery, AttributeQuery
from rcsbapi.search.attrs import rcsb_entry_info
from rcsbapi.data import fetch, Schema

from Bio.PDB import PDBParser


def download_text(url: str, out_path: pathlib.Path) -> None:
    r = requests.get(url, timeout=60)
    r.raise_for_status()
    out_path.write_text(r.text, encoding="utf-8")


def main():
    out_dir = pathlib.Path("pdb_out")
    out_dir.mkdir(exist_ok=True)

    # 1) Search: hemoglobin entries with resolution < 2.0 Å
    q_text = TextQuery("hemoglobin")
    q_res = AttributeQuery(
        attribute=rcsb_entry_info.resolution_combined,
        operator="less",
        value=2.0,
    )
    query = q_text & q_res

    pdb_ids = list(query())[:5]
    if not pdb_ids:
        raise SystemExit("No results found.")
    pdb_id = pdb_ids[0]
    print(f"Selected PDB ID: {pdb_id}")

    # 2) Fetch entry metadata
    entry = fetch(pdb_id, schema=Schema.ENTRY)
    title = entry.get("struct", {}).get("title")
    method = (entry.get("exptl") or [{}])[0].get("method")
    resolution = (entry.get("rcsb_entry_info") or {}).get("resolution_combined")
    deposit_date = (entry.get("rcsb_accession_info") or {}).get("deposit_date")

    print("Metadata:")
    print(f"  Title: {title}")
    print(f"  Method: {method}")
    print(f"  Resolution: {resolution}")
    print(f"  Deposit date: {deposit_date}")

    # 3) Download coordinates (PDB and mmCIF)
    pdb_path = out_dir / f"{pdb_id}.pdb"
    cif_path = out_dir / f"{pdb_id}.cif"

    download_text(f"https://files.rcsb.org/download/{pdb_id}.pdb", pdb_path)
    download_text(f"https://files.rcsb.org/download/{pdb_id}.cif", cif_path)
    print(f"Downloaded: {pdb_path} and {cif_path}")

    # 4) Parse PDB coordinates (example: count atoms)
    parser = PDBParser(QUIET=True)
    structure = parser.get_structure(pdb_id, str(pdb_path))

    atom_count = sum(1 for _ in structure.get_atoms())
    chain_ids = sorted({chain.id for chain in structure.get_chains()})
    print("Parsed structure:")
    print(f"  Chains: {chain_ids}")
    print(f"  Atom count: {atom_count}")


if __name__ == "__main__":
    main()

Implementation Details

Search Modes and Query Composition

Text search uses free-text matching over entry annotations (titles, keywords, descriptions).
Attribute search filters by structured fields (e.g., organism, method, resolution).
Sequence similarity search typically supports:
- evalue_cutoff: lower is more stringent (fewer, more confident hits).
- identity_cutoff: fraction identity threshold (e.g., 0.9 for near-identical).
Structure similarity search uses an existing structure (e.g., an entry_id) as the geometric reference.
Queries can be combined with boolean logic:
- query1 & query2 (AND)
- query1 | query2 (OR)
- ~query (NOT), where supported by the client

Data Retrieval (Schema vs GraphQL)

Schema-based fetch (e.g., Schema.ENTRY, Schema.POLYMER_ENTITY) is convenient for common objects and stable access patterns.
GraphQL fetch is best when you need a custom selection of fields in one request (reduce round-trips and payload).

Example GraphQL pattern:

from rcsbapi.data import fetch

query = """
{
  entry(entry_id: "4HHB") {
    struct { title }
    exptl { method }
    rcsb_entry_info { resolution_combined deposited_atom_count }
  }
}
"""
data = fetch(query_type="graphql", query=query)

Coordinate Downloads and Formats

PDB: legacy text format; widely supported but less expressive for large/complex structures.
mmCIF (PDBx): modern standard; preferred for completeness and large structures.

Direct download endpoints:

https://files.rcsb.org/download/{PDB_ID}.pdb
https://files.rcsb.org/download/{PDB_ID}.cif

Batch Processing Pattern

Reference Documentation

If present in this repository, consult:

references/api_reference.md for advanced endpoint usage, query patterns, schema notes, rate limits, and troubleshooting.

Related Skills

aipoch/conventional-oncology-hub-gene

tools

VerifiedTrustedCommunity

Generates complete conventional oncology bulk-transcriptome biomarker and hub-gene research designs from a user-provided cancer type and study direction. Always use this skill whenever a user wants to design, plan, or build a tumor bioinformatics study centered on differential expression, prognostic filtering or risk modeling, PPI-based hub-gene prioritization, diagnostic/prognostic evaluation, clinical association, immune infiltration context, methylation context, and optional tissue or cell validation. Covers five study patterns (signature-first prognostic workflow, hub-gene-first biomarker workflow, hybrid signature-to-hub workflow, immune-context biomarker workflow, translational validation workflow) and always outputs four workload configs (Lite / Standard / Advanced / Publication+) with recommended primary plan, step-by-step workflow, figure plan, validation strategy, minimal executable version, publication upgrade path...

348SKILL.mdUpdated Apr 28, 2026

aipoch/conventional-oncology-hub-gene

aipoch/conventional-non-oncology-hub-gene

development

VerifiedTrustedCommunity

Generates complete conventional non-oncology bioinformatics research designs from a user-provided disease context, process-related gene family or biological theme, and validation direction. Use when a study centers on multi-dataset bulk transcriptome integration, DEG analysis, process-gene intersection, enrichment analysis, GSEA, PPI hub-gene prioritization, TF/miRNA regulatory networks, ROC-based biomarker evaluation, and immune infiltration analysis. Covers five study patterns (process-DEG discovery, enrichment/GSEA interpretation, hub-gene prioritization, regulatory-network and immune interpretation, multi-layer public validation) and always outputs Lite / Standard / Advanced / Publication+ with a recommended primary plan, stepwise workflow, figure plan, validation hierarchy, minimal executable version, publication upgrade path, and strictly verified literature retrieval.

348SKILL.mdUpdated Apr 28, 2026

aipoch/conventional-non-oncology-hub-gene

aipoch/confounder-and-bias-control-planner

tools

VerifiedTrustedCommunity

Plans confounder control, variable adjustment logic, and bias mitigation strategies at the protocol stage for clinical, epidemiologic, translational, observational, and biomarker studies. Always use this skill when a user needs to identify major confounders, decide which variables should or should not be adjusted for, compare matching/stratification/weighting approaches, anticipate selection or measurement bias, or pressure-test a study design before execution. Focus on bias sensing, causal structure awareness, variable-role classification, and critical design review rather than generic statistical advice.

348SKILL.mdUpdated Apr 28, 2026

aipoch/confounder-and-bias-control-planner

aipoch/comparative-network-toxicology-shared-mechanism-reference-grounded

testing

VerifiedTrustedCommunity

Generates complete comparative network-toxicology research designs from a user-provided exposure pair, shared toxic phenotype, and validation direction. Use when a study centers on two related exposures under one outcome and needs target collection, shared-vs-specific target decomposition, enrichment, PPI hub prioritization, docking, optional transcriptomic cross-checks, and conservative mechanistic synthesis. Covers five study patterns and always outputs Lite / Standard / Advanced / Publication+ with a recommended primary plan, stepwise workflow, figure plan, validation hierarchy, minimal executable version, publication upgrade path, and strictly verified literature retrieval.

348SKILL.mdUpdated Apr 28, 2026

aipoch/comparative-network-toxicology-shared-mechanism-reference-grounded

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/aipoch/medical-research-skills.git

# Copy into Claude Code skills folder (global)
cp -r medical-research-skills/scientific-skills/Evidence Insights/pdb-database ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

aipoch/medical-research-skills

37 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT