Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

lamm-mit/skills/foldseek

Name: skills/foldseek
Author: lamm-mit

skills/foldseek/SKILL.md

npx skillsauth add lamm-mit/scienceclaw skills/foldseek

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Foldseek Structure Similarity Search

Ultra-fast protein structure similarity search. Finds structural homologs in PDB, AlphaFold Database, and custom databases orders of magnitude faster than DALI or TM-align.

Installation

# Conda (recommended)
conda install -c conda-forge -c bioconda foldseek

# Binary download
wget https://mmseqs.com/foldseek/foldseek-linux-avx2.tar.gz
tar xvzf foldseek-linux-avx2.tar.gz
export PATH=$(pwd)/foldseek/bin:$PATH

# Docker
docker pull ghcr.io/steineggerlab/foldseek

Download Databases

# PDB (experimentally determined structures, ~200K)
foldseek databases PDB pdb_db tmp/

# AlphaFold Database (200M+ predicted structures)
foldseek databases Alphafold/UniProt afdb_db tmp/

# AlphaFold SwissProt (high-confidence subset, ~500K)
foldseek databases Alphafold/UniProt50 afdb_swissprot tmp/

# ESMAtlas (300M+ structures from ESMFold)
foldseek databases ESMAtlas esmatlas_db tmp/

Basic Search

# Search query structure against PDB
foldseek easy-search query.pdb pdb_db results.tsv tmp/ \
    --format-output "query,target,pident,alnlen,evalue,bits,prob,lddt,lddtfull,taxid,taxname,qlen,tlen,nident"

# Against AlphaFold database
foldseek easy-search query.pdb afdb_db results.tsv tmp/ \
    --exhaustive-search 1 \
    --format-output "query,target,pident,evalue,bits,prob,alntmscore,taxname"

Python API

import subprocess
import pandas as pd

def foldseek_search(query_pdb: str, database: str, tmp_dir: str = "tmp/",
                    e_value: float = 1e-3, max_hits: int = 100) -> pd.DataFrame:
    """Run Foldseek and return results as DataFrame."""
    out_tsv = "foldseek_results.tsv"

    cols = "query,target,pident,alnlen,evalue,bits,prob,alntmscore,taxname"
    cmd = [
        "foldseek", "easy-search",
        query_pdb, database, out_tsv, tmp_dir,
        "-e", str(e_value),
        "--max-seqs", str(max_hits),
        "--format-output", cols
    ]
    subprocess.run(cmd, check=True)

    df = pd.read_csv(out_tsv, sep="\t", names=cols.split(","))
    return df.sort_values("alntmscore", ascending=False)

# Usage
results = foldseek_search("designed_binder.pdb", "pdb_db")
print(results[["target", "pident", "alntmscore", "evalue", "taxname"]].head(20))

Key Output Fields

| Field | Description | Threshold | |-------|-------------|-----------| | alntmscore | TM-score of alignment (0–1) | >0.5 = same fold | | pident | Sequence identity (%) | varies | | prob | Probability of homology (0–1) | >0.5 = likely homolog | | evalue | E-value | <0.001 = significant | | lddt | Local distance difference test | >0.7 = good local similarity | | taxname | Source organism | — |

Multi-Query / Batch Search

# Create query database from multiple PDB files
foldseek createdb query_structures/ query_db

# Search all vs. PDB
foldseek search query_db pdb_db result_db tmp/ -e 1e-3
foldseek convertalis query_db pdb_db result_db results.tsv \
    --format-output "query,target,alntmscore,evalue,taxname"

Use Cases

Check Design Novelty

results = foldseek_search("new_design.pdb", "pdb_db")
top = results[results["alntmscore"] > 0.5]

if len(top) == 0:
    print("Novel fold — no PDB structural homologs found")
else:
    print(f"Similar to known structures:")
    print(top[["target", "alntmscore", "pident"]].head(5))

Find Templates for Homology Modeling

results = foldseek_search("target.pdb", "pdb_db", e_value=0.01)
templates = results[
    (results["alntmscore"] > 0.6) &
    (results["pident"] > 30)  # Enough sequence identity for modeling
]

Cluster Designs by Structure

# All-vs-all structural comparison
foldseek createdb designs/ designs_db
foldseek search designs_db designs_db result_db tmp/ \
    --alignment-type 1 -e 1e-3
foldseek cluster designs_db cluster_db tmp/ \
    --min-seq-id 0 -c 0.8  # 80% TM-score threshold

Foldseek vs TM-align vs DALI

| Tool | Speed (10K vs PDB) | Accuracy | |------|-------------------|---------| | Foldseek | ~1 min | ~95% of TM-align | | TM-align | ~20 hours | Reference | | DALI | ~48 hours | Reference |

lamm-mit/skills/foldseek

skills/foldseek/SKILL.md

# Foldseek Structure Similarity Search Ultra-fast protein structure similarity search. Finds structural homologs in PDB, AlphaFold Database, and custom databases orders of magnitude faster than DALI or TM-align. ## Installation ```bash # Conda (recommended) conda install -c conda-forge -c bioconda foldseek # Binary download wget https://mmseqs.com/foldseek/foldseek-linux-avx2.tar.gz tar xvzf foldseek-linux-avx2.tar.gz export PATH=$(pwd)/foldseek/bin:$PATH # Docker docker pull ghcr.io/steine

119 stars

devops

Updated Apr 6, 2026

$ install --global

skillsauth

npx skillsauth add lamm-mit/scienceclaw skills/foldseek

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 6, 2026, 12:39 PM4.6s1 file scanned

SKILL.md

Foldseek Structure Similarity Search

Ultra-fast protein structure similarity search. Finds structural homologs in PDB, AlphaFold Database, and custom databases orders of magnitude faster than DALI or TM-align.

Installation

# Conda (recommended)
conda install -c conda-forge -c bioconda foldseek

# Binary download
wget https://mmseqs.com/foldseek/foldseek-linux-avx2.tar.gz
tar xvzf foldseek-linux-avx2.tar.gz
export PATH=$(pwd)/foldseek/bin:$PATH

# Docker
docker pull ghcr.io/steineggerlab/foldseek

Download Databases

# PDB (experimentally determined structures, ~200K)
foldseek databases PDB pdb_db tmp/

# AlphaFold Database (200M+ predicted structures)
foldseek databases Alphafold/UniProt afdb_db tmp/

# AlphaFold SwissProt (high-confidence subset, ~500K)
foldseek databases Alphafold/UniProt50 afdb_swissprot tmp/

# ESMAtlas (300M+ structures from ESMFold)
foldseek databases ESMAtlas esmatlas_db tmp/

Basic Search

# Search query structure against PDB
foldseek easy-search query.pdb pdb_db results.tsv tmp/ \
    --format-output "query,target,pident,alnlen,evalue,bits,prob,lddt,lddtfull,taxid,taxname,qlen,tlen,nident"

# Against AlphaFold database
foldseek easy-search query.pdb afdb_db results.tsv tmp/ \
    --exhaustive-search 1 \
    --format-output "query,target,pident,evalue,bits,prob,alntmscore,taxname"

Python API

import subprocess
import pandas as pd

def foldseek_search(query_pdb: str, database: str, tmp_dir: str = "tmp/",
                    e_value: float = 1e-3, max_hits: int = 100) -> pd.DataFrame:
    """Run Foldseek and return results as DataFrame."""
    out_tsv = "foldseek_results.tsv"

    cols = "query,target,pident,alnlen,evalue,bits,prob,alntmscore,taxname"
    cmd = [
        "foldseek", "easy-search",
        query_pdb, database, out_tsv, tmp_dir,
        "-e", str(e_value),
        "--max-seqs", str(max_hits),
        "--format-output", cols
    ]
    subprocess.run(cmd, check=True)

    df = pd.read_csv(out_tsv, sep="\t", names=cols.split(","))
    return df.sort_values("alntmscore", ascending=False)

# Usage
results = foldseek_search("designed_binder.pdb", "pdb_db")
print(results[["target", "pident", "alntmscore", "evalue", "taxname"]].head(20))

Key Output Fields

Multi-Query / Batch Search

# Create query database from multiple PDB files
foldseek createdb query_structures/ query_db

# Search all vs. PDB
foldseek search query_db pdb_db result_db tmp/ -e 1e-3
foldseek convertalis query_db pdb_db result_db results.tsv \
    --format-output "query,target,alntmscore,evalue,taxname"

Use Cases

Check Design Novelty

results = foldseek_search("new_design.pdb", "pdb_db")
top = results[results["alntmscore"] > 0.5]

if len(top) == 0:
    print("Novel fold — no PDB structural homologs found")
else:
    print(f"Similar to known structures:")
    print(top[["target", "alntmscore", "pident"]].head(5))

Find Templates for Homology Modeling

results = foldseek_search("target.pdb", "pdb_db", e_value=0.01)
templates = results[
    (results["alntmscore"] > 0.6) &
    (results["pident"] > 30)  # Enough sequence identity for modeling
]

Cluster Designs by Structure

# All-vs-all structural comparison
foldseek createdb designs/ designs_db
foldseek search designs_db designs_db result_db tmp/ \
    --alignment-type 1 -e 1e-3
foldseek cluster designs_db cluster_db tmp/ \
    --min-seq-id 0 -c 0.8  # 80% TM-score threshold

Foldseek vs TM-align vs DALI

| Tool | Speed (10K vs PDB) | Accuracy | |------|-------------------|---------| | Foldseek | ~1 min | ~95% of TM-align | | TM-align | ~20 hours | Reference | | DALI | ~48 hours | Reference |

Related Skills

lamm-mit/paperclip

tools

VerifiedTrustedCommunity

Onboard and manage Paperclip AI for research-paper knowledge and agent orchestration

203SKILL.mdUpdated May 22, 2026

lamm-mit/perplexity-search

development

VerifiedTrustedCommunity

Perform AI-powered web searches with real-time information using Perplexity models via LiteLLM and OpenRouter. This skill should be used when conducting web searches for current information, finding recent scientific literature, getting grounded answers with source citations, or accessing information beyond the model knowledge cutoff. Provides access to multiple Perplexity models including Sonar Pro, Sonar Pro Search (advanced agentic search), and Sonar Reasoning Pro through a single OpenRouter API key.

184SKILL.mdUpdated Apr 6, 2026

lamm-mit/perplexity-search

lamm-mit/scientific-report-pdf

testing

VerifiedTrustedCommunity

Generate a structured scientific PDF report from a JSON description. Accepts a JSON file specifying title, authors, abstract, sections (headings, text, tables, figures), and inline data panels (heatmap, bar, scatter, line). Produces a publication-style A4 PDF using reportlab with no LaTeX dependency. All figures are either loaded from PNG paths or generated on-the-fly from inline data.

161SKILL.mdUpdated Apr 16, 2026

lamm-mit/scientific-report-pdf

lamm-mit/python-exec

development

VerifiedTrustedCommunity

Execute arbitrary Python code and return stdout. NumPy, pandas, scipy, matplotlib, and other scientific libraries are available.

161SKILL.mdUpdated Apr 16, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/lamm-mit/scienceclaw.git

# Copy into Claude Code skills folder (global)
cp -r scienceclaw/skills/foldseek ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

lamm-mit/scienceclaw

119 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT