ESMFold is a GPU-accelerated protein structure prediction model that infers atomic-level 3D coordinates directly from amino acid sequences using ESM-2 language model representations, without MSAs or templates. The API supports single chains and multimeric complexes with up to 4 chains encoded as colon-separated sequences, each request batching up to 2 items and sequences up to 768 residues. It returns PDB structures with mean pLDDT and pTM confidence scores, enabling high-throughput workflows in protein engineering, metagenomics, and structural biology.
Predict¶
Predict 3D structure for one or more protein chains (separated by colons) using ESMFold
- POST /api/v3/esmfold/predict/¶
Predict endpoint for ESMFold.
- Request Headers:
Content-Type – application/json
Authorization – Token YOUR_API_KEY
Request
params (object, optional) — Configuration parameters:
batch_size (int, default: 2) — Maximum number of items processed per request
max_sequence_len (int, default: 768) — Maximum number of amino acid residues per chain in a sequence
max_n_multimers (int, default: 4) — Maximum number of chains allowed in a multimer sequence
items (array of objects, min: 1, max: 2) — Input sequences:
sequence (string, min length: 1, max length: 771, required) — Protein sequence using the extended amino acid alphabet with “:” as chain separator, allowing up to 3 non-consecutive “:” characters
Example request:
- Status Codes:
200 OK – Successful response
400 Bad Request – Invalid input
500 Internal Server Error – Internal server error
Response
results (array of objects) — One result per input item, in the order requested:
pdb (string) — Predicted protein structure in PDB format, including coordinates and standard PDB records
mean_plddt (float) — Mean predicted Local Distance Difference Test (pLDDT) confidence score over all residues
ptm (float) — Predicted Template Modeling (pTM) score estimating global fold accuracy
Example response:
Performance¶
ESMFold runs inference on NVIDIA A10G GPUs with 4 vCPUs and 16 GB RAM, using mixed-precision computation, chunked attention, and optional CPU offloading to efficiently handle single- or multi-chain inputs up to 768 residues total.
Inference speed is tuned for rapid turnaround: for a 384-residue monomer, network forward time is around 14 seconds on a single GPU, typically ~6× faster than a single AlphaFold2 model and >60× faster than AlphaFold2 on short sequences (<200 residues) when MSA search is considered.
Structural accuracy is competitive with other single-sequence and MSA-based predictors: on CASP14 targets, ESMFold reaches mean LDDT ≈ 0.68 (vs. ≈ 0.85 for AlphaFold2 and ≈ 0.81 for RoseTTAFold). On the CAMEO benchmark, mean TM-scores are ≈ 0.90 (easy), 0.79 (medium), and 0.45 (hard), slightly below AlphaFold2 but close to RoseTTAFold on easy and medium targets.
Confidence estimates (mean pLDDT and pTM) are well calibrated: mean pLDDT strongly correlates with realized accuracy (LDDT/TM-score), enabling downstream pipelines to filter or prioritize BioLM ESMFold predictions by quality when trading off against slower, more accurate models such as AlphaFold2 or against nanobody-specialized models such as NanobodyBuilder.
Applications¶
Rapid single- and few-chain structure prediction in protein engineering pipelines, enabling teams to assess structural viability of designed variants in seconds without MSA generation; accelerates iterative design for therapeutics or industrial biocatalysts within the API limits of up to 4 chains and 768 residues total.
High-throughput structural screening of protein variant or metagenomic libraries by ranking candidates using mean pLDDT and pTM scores from the API response; useful for prioritizing stable folds and well-packed cores for downstream experimental validation; less suitable for detailed modeling of large complexes or interfaces beyond four chains.
Structural annotation of orphan or low-homology proteins in discovery programs, providing predicted 3D coordinates (PDB output) and confidence metrics directly from sequence; supports target selection and domain boundary assessment when experimental structures or deep MSAs are unavailable; accuracy decreases on very hard, novel folds, so predictions should be combined with additional evidence.
Computational ranking of protein design candidates, combining ESMFold mean pLDDT and pTM with other design scores to focus wet-lab screening on structurally plausible designs; reduces cost by deprioritizing clearly misfolded or disordered variants; not recommended as the sole criterion for functional optimization, as activity and specificity still require additional modeling or experiments.
Limitations¶
Maximum Sequence Length: The
sequenceinput (a single chain or multiple chains separated by:) must not exceed768amino acids in total. Longer proteins must be truncated or split into separate API calls.Batch Size: The
itemslist inESMFoldPredictRequestsupports at mostbatch_size=2sequences per request. Larger sets of sequences must be split across multiple requests.Multimeric Input: Multimer predictions are supported for up to
4chains persequence, separated by:characters. Complexes with more than 4 chains must be decomposed into smaller sub-complexes or modeled chain-wise.Protein Complexes: ESMFold was trained only on single-chain structures. Multimer predictions are therefore out-of-distribution and typically less accurate than specialized multimer models (for example AlphaFold-Multimer), especially for detailed interface geometry.
Accuracy vs. Speed: ESMFold is much faster than AlphaFold2 but usually less accurate on difficult targets (e.g. low-MSA-depth CASP/CAMEO-style benchmarks). For final ranking of a small number of critical designs, higher-accuracy MSA-based models are often preferable.
Confidence Estimation (pLDDT / pTM): The response fields
mean_plddtandptmcorrelate with model accuracy, but low-confidence predictions (mean_plddt< 70) or highly novel/orphan sequences should be treated cautiously and ideally cross-checked with additional models or experimental data.
How We Use It¶
ESMFold enables rapid, high-throughput prediction of protein structures directly from sequence data, accelerating protein design and engineering workflows by removing the need for multiple sequence alignment searches. Within BioLM-driven design cycles, ESMFold predictions feed standardized, API-level structural features (coordinates, pLDDT, pTM) into downstream thermodynamic, biophysical, and sequence-based models, allowing teams to quickly prioritize variants for synthesis and testing in enzyme optimization, antibody maturation, and targeted protein modification campaigns.
Supports iterative protein engineering by quickly assessing structural impact of sequence changes across single chains and small multimers (up to 4 chains in one request).
Integrates with BioLM scoring and filtering models to focus experimental resources on candidates with favorable predicted structure, stability, and developability profiles.
References¶
Lin, Z., et al. (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, 379(6637), 1123–1130.
