skills/protein-design-workflow/SKILL.md
End-to-end guidance for protein design pipelines. Use this skill when: (1) Starting a new protein design project, (2) Need step-by-step workflow guidance, (3) Understanding the full design pipeline, (4) Planning compute resources and timelines, (5) Integrating multiple design tools. For tool selection, use binder-design. For QC thresholds, use protein-qc.
npx skillsauth add adaptyvbio/protein-design-skills protein-design-workflowInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Target Preparation --> Backbone Generation --> Sequence Design
| | |
v v v
(pdb skill) (rfdiffusion) (proteinmpnn)
| |
v v
Structure Validation --> Filtering
| |
v v
(alphafold/chai) (protein-qc)
# Download from PDB
curl -o target.pdb "https://files.rcsb.org/download/XXXX.pdb"
# Extract target chain
# Remove waters, ligands if needed
# Trim to binding region + 10A buffer
Output: target_prepared.pdb, hotspot list
# RFdiffusion runs from the official repo, not biomodals
python run_inference.py \
inference.input_pdb=target_prepared.pdb \
contigmap.contigs=[A1-150/0 70-100] \
ppi.hotspot_res=[A45,A67,A89] \
inference.num_designs=500
modal run modal_bindcraft.py \
--input-pdb target_prepared.pdb \
--target-hotspot-residues "45,67,89" \
--number-of-final-designs 100
Output: 100-500 backbone PDBs
for backbone in backbones/*.pdb; do
modal run modal_ligandmpnn.py \
--input-pdb "$backbone" \
--params-str "--number_of_batches 8 --temperature 0.1"
done
Output: 8 sequences per backbone (800-4000 total)
# Prepare FASTA with binder + target
# binder:target format for multimer
modal run modal_alphafold.py \
--input-fasta all_sequences.fasta \
--out-dir predictions/
Output: AF2 predictions with pLDDT, ipTM, PAE
import pandas as pd
# Load metrics
designs = pd.read_csv('all_metrics.csv')
# Filter
filtered = designs[
(designs['pLDDT'] > 0.85) &
(designs['ipTM'] > 0.50) &
(designs['PAE_interface'] < 10) &
(designs['scRMSD'] < 2.0) &
(designs['esm2_pll'] > 0.0)
]
# Rank by composite score
filtered['score'] = (
0.3 * filtered['pLDDT'] +
0.3 * filtered['ipTM'] +
0.2 * (1 - filtered['PAE_interface'] / 20) +
0.2 * filtered['esm2_pll']
)
top_designs = filtered.nlargest(50, 'score')
Output: 50-200 filtered candidates
| Stage | GPU | Time (100 designs) | |-------|-----|-------------------| | RFdiffusion | A10G | 30 min | | ProteinMPNN | T4 | 15 min | | Chai / AlphaFold | A100 | 4-8 hours | | Filtering | CPU | 15 min |
| Problem | Solution | |---------|----------| | Low ipTM | Check hotspots, increase designs | | Poor diversity | Higher temperature, more backbones | | High scRMSD | Backbone may be unusual | | Low pLDDT | Check design quality |
data-ai
Structure prediction with Protenix, an open AlphaFold3 reproduction. Use this skill when: (1) Predicting complex structures with an AF3-class model, (2) Wanting an open alternative to AF3 alongside Boltz and Chai, (3) Validating designed binder-target complexes. For QC thresholds, use protein-qc. For ipSAE ranking, use ipsae.
devops
Multi-objective, gradient-based protein binder design with Mosaic. Use this skill when: (1) Composing several structure or sequence models into one design objective, (2) Optimizing binders against a custom loss rather than a fixed pipeline, (3) Wanting gradient descent over sequence space in the style of ColabDesign, RSO, or BindCraft but with interchangeable predictors, (4) Letting the optimizer choose the epitope instead of fixing hotspots. For an end-to-end binder pipeline with default filters, use bindcraft. For all-atom diffusion design, use boltzgen. For backbone-only generation, use rfdiffusion.
development
De novo antibody and nanobody (VHH) design with Germinal. Use this skill when: (1) Designing epitope-targeted nanobodies or scFvs, (2) Needing CDR design on a fixed framework, (3) Working on antibody-format binders rather than miniproteins. For miniprotein binders, use binder-design (BoltzGen, BindCraft, RFdiffusion, Mosaic). For structure validation, use boltz or chai.
testing
Access UniProt for protein sequence and annotation retrieval. Use this skill when: (1) Looking up protein sequences by accession, (2) Finding functional annotations, (3) Getting domain boundaries, (4) Finding homologs and variants, (5) Cross-referencing to PDB structures. For structure retrieval, use pdb. For sequence design, use proteinmpnn.