skills/binder-design/SKILL.md
Guidance for choosing the right protein binder design tool. Use this skill when: (1) Deciding between BoltzGen, BindCraft, or RFdiffusion, (2) Planning a binder design campaign, (3) Understanding trade-offs between different approaches, (4) Selecting tools for specific target types. For specific tool parameters, use the individual tool skills (boltzgen, bindcraft, rfdiffusion, etc.).
npx skillsauth add adaptyvbio/protein-design-skills binder-designInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
De novo binder design?
│
├─ Standard target → BoltzGen (recommended)
│ All-atom output (no separate ProteinMPNN step needed)
│ Better for ligand/small molecule binding
│ Single-step design (backbone + sequence + side chains)
│
├─ Need diversity/exploration → RFdiffusion + ProteinMPNN
│ Maximum backbone diversity
│ Two-step: backbone then sequence
│
├─ Integrated validation → BindCraft
│ Built-in AF2 validation
│ End-to-end pipeline
│
├─ Ligand binding → BoltzGen ✓
│ All-atom diffusion handles ligand context
│
├─ Peptide/nanobody → Germinal
│ VHH/nanobody design
│ Germline-aware optimization
│
└─ Antibody/Nanobody
+-- VHH design --> germinal skill
| Tool | Strengths | Weaknesses | Best For | |------|-----------|------------|----------| | BoltzGen | All-atom, single-step, ligand-aware | Higher GPU requirement | Standard (recommended) | | BindCraft | End-to-end, built-in AF2 validation | Less diverse | Production campaigns | | RFdiffusion | High diversity, fast | Requires ProteinMPNN | Exploration, diversity | | Germinal | Nanobody/VHH design | Specialized | Antibody optimization |
BoltzGen provides all-atom design with built-in side-chain packing:
Target → BoltzGen → Validate → Filter
(pdb) (all-atom) (chai) (qc)
# Fetch structure from PDB
# Use pdb skill for guidance
First, create a YAML config file (e.g., binder.yaml):
entities:
- protein:
id: B
sequence: 70..100
- file:
path: target.cif
include:
- chain:
id: A
binding_types:
- chain:
id: A
binding: 45,67,89
Then run:
modal run modal_boltzgen.py \
--input-yaml binder.yaml \
--protocol protein-anything \
--num-designs 50
Why BoltzGen?
For maximum diversity or when backbone-only is preferred:
# Step 1: Backbone generation
modal run modal_rfdiffusion.py \
--pdb target.pdb \
--contigs "A1-150/0 70-100" \
--hotspot "A45,A67,A89" \
--num-designs 500
# Step 2: Sequence design
modal run modal_ligandmpnn.py \
--pdb-path backbone.pdb \
--num-seq-per-target 16 \
--sampling-temp 0.1
modal run modal_chai1.py \
--input-faa sequences.fasta \
--out-dir predictions/
Apply standard thresholds:
See protein-qc skill for details.
| Stage | Count | Purpose | |-------|-------|---------| | Backbone generation | 500-1000 | Diversity | | Sequences per backbone | 8-16 | Sequence space | | AF2 predictions | All | Validation | | After filtering | 50-200 | Candidates | | Experimental testing | 10-50 | Final selection |
| Step | Compute Time | |------|--------------| | RFdiffusion (500 designs) | 2-4 hours | | ProteinMPNN (8000 sequences) | 1-2 hours | | AF2 prediction (8000 sequences) | 12-24 hours | | Filtering and analysis | 1-2 hours |
Total: 1-2 days of compute
testing
Access UniProt for protein sequence and annotation retrieval. Use this skill when: (1) Looking up protein sequences by accession, (2) Finding functional annotations, (3) Getting domain boundaries, (4) Finding homologs and variants, (5) Cross-referencing to PDB structures. For structure retrieval, use pdb. For sequence design, use proteinmpnn.
development
Solubility-optimized protein sequence design using SolubleMPNN. Use this skill when: (1) Designing for E. coli expression, (2) Optimizing solubility of designed proteins, (3) Reducing aggregation propensity, (4) Need high-yield expression, (5) Avoiding inclusion body formation. For standard design, use proteinmpnn. For ligand-aware design, use ligandmpnn.
tools
First-time setup for protein design tools. Use this skill when: (1) User is new and hasn't run any tools yet, (2) Commands fail with "file not found" or "modal: command not found", (3) Modal authentication errors occur, (4) User asks how to get started or set up the environment, (5) biomodals directory is missing or tools aren't working.
development
Generate protein backbones using RFdiffusion, a diffusion-based generative model for de novo protein structure generation. Use this skill when: (1) Designing binder scaffolds for a target protein, (2) Generating novel protein backbones from scratch, (3) Scaffolding functional motifs into new proteins, (4) Specifying hotspot residues for interface design, (5) Creating symmetric oligomers. For sequence design after backbone generation, use proteinmpnn. For structure validation, use alphafold or chai. For QC thresholds, use protein-qc.