ESM-1v is a GPU-accelerated protein language model for zero-shot inference of mutational effects on protein function, predicting variant impact directly from amino acid sequences without task-specific training or MSAs. It returns log-odds variant scores reflecting functional constraints learned from 98 million evolutionary sequences. ESM-1v supports high-throughput variant evaluation, protein engineering, antibody optimization, and enzyme design workflows, providing predictions correlated with deep mutational scanning experiments.
Predict¶
Predict properties or scores for input sequences
- POST /api/v3/esm1v-all/predict/¶
Predict endpoint for ESM-1v.
- Request Headers:
Content-Type – application/json
Authorization – Token YOUR_API_KEY
Request
params (object, optional) — Configuration parameters:
model_number (string, enum: [“n1”, “n2”, “n3”, “n4”, “n5”, “all”], default: “all”) — Identifier specifying which ESM1v model variant to use for prediction
items (array of objects, min: 1, max: 5) — Input sequences:
sequence (string, min length: 1, max length: 512, required) — Protein sequence containing exactly one “<mask>” token; valid characters include standard amino acids, extended amino acids, and the special token “<mask>”
Example request:
- Status Codes:
200 OK – Successful response
400 Bad Request – Invalid input
500 Internal Server Error – Internal server error
Response
results (array of objects) — One result per input item, in the order requested:
esm1v-n[1-5] (array of objects) — Predictions from individual ESM-1v model variants (n1 through n5):
token (int) — Amino acid token index (0-32, inclusive)
token_str (string) — Amino acid single-letter code (20 standard amino acids, plus special tokens)
score (float, range: 0.0-1.0) — Predicted probability score for the amino acid at the masked position
sequence (string, length: 1-512 amino acids) — Protein sequence with the predicted amino acid replacing the “<mask>” token
esm1v-n[1-5] (array length: up to 33) — One prediction per possible amino acid substitution at the masked position, sorted by descending probability score
esm1v-n[1-5] (object keys) — Exactly one key per model variant (“esm1v-n1”, “esm1v-n2”, “esm1v-n3”, “esm1v-n4”, “esm1v-n5”)
esm1v-n[1-5] (array dimensions) — [num_predictions, 4], where num_predictions ≤ 33 (20 standard amino acids + special tokens)
esm1v-n[1-5] (score range) — Probability scores normalized across all possible amino acids at masked position (sum to 1.0)
esm1v-n[1-5] (sequence length) — Matches input sequence length (1-512 amino acids), with “<mask>” replaced by predicted amino acid
esm1v-n[1-5] (token indices) — Integer encoding of amino acids: 0-19 standard amino acids, 20-32 special tokens (including gap, mask, padding)
esm1v-n[1-5] (token_str values) — Single-letter amino acid codes (“A”, “C”, “D”, “E”, “F”, “G”, “H”, “I”, “K”, “L”, “M”, “N”, “P”, “Q”, “R”, “S”, “T”, “V”, “W”, “Y”) and special tokens (“<mask>”, “<pad>”, “<gap>”, etc.)
esm1v-n[1-5] (sequence constraints) — Exactly one “<mask>” token replaced per sequence
esm1v-n[1-5] (score precision) — Floating-point precision (float32)
esm1v-n[1-5] (token_str constraints) — Single-character strings for standard amino acids, multi-character strings for special tokens (“<mask>”, “<pad>”, “<gap>”, etc.)
esm1v-n[1-5] (token constraints) — Integer values correspond exactly to ESM-1v vocabulary indices (0-32 inclusive)
esm1v-n[1-5] (sequence format) — Protein sequence represented as a single continuous string of amino acid characters (no whitespace or delimiters)
esm1v-n[1-5] (sequence length constraints) — Matches input sequence length exactly (1-512 amino acids)
esm1v-n[1-5] (prediction ordering) — Predictions ordered by descending probability score (highest confidence first)
esm1v-n[1-5] (probability distribution) — Scores represent normalized probability distribution at masked position (sum exactly to 1.0)
esm1v-n[1-5] (special tokens) — Special tokens (“<mask>”, “<pad>”, “<gap>”, etc.) included in predictions with associated probability scores
esm1v-n[1-5] (model variant) — Predictions provided separately for each model variant (n1, n2, n3, n4, n5)
esm1v-n[1-5] (ensemble predictions) — When “all” variant selected, predictions from all five model variants provided separately under respective keys (“esm1v-n1” through “esm1v-n5”)
Example response:
Performance¶
GPU-accelerated inference using NVIDIA T4 GPUs, enabling rapid, scalable predictions.
Typical inference latency for single-sequence mutation effect prediction: approximately 2-4 seconds per sequence.
Zero-shot mutation effect prediction accuracy (Spearman ρ = 0.51) comparable to state-of-the-art unsupervised methods (e.g., DeepSequence ρ = 0.52, EVMutation ρ = 0.50), without requiring task-specific model training or multiple sequence alignments (MSAs).
Fine-tuning ESM-1v on protein-specific MSAs boosts predictive accuracy (Spearman ρ = 0.54), surpassing EVMutation and approaching DeepSequence, while still offering faster inference.
ESM-1v significantly outperforms earlier protein language models provided by BioLM (e.g., ESM-1b Spearman ρ = 0.46, ProtBERT Spearman ρ = 0.43) in zero-shot mutational effect prediction tasks.
Predictive performance is robust across diverse protein families, with ESM-1v outperforming DeepSequence on 17 out of 41 benchmarked deep mutational scanning datasets.
Inference efficiency significantly exceeds models requiring explicit multiple sequence alignments (e.g., MSA Transformer), as ESM-1v requires only a single forward pass per sequence.
BioLM’s optimized deployment enables efficient parallel processing and GPU utilization, ensuring consistent inference speed even at scale.
Applications¶
Predicting functional impacts of protein mutations to accelerate protein engineering workflows, enabling rapid filtering and prioritization of candidate variants without costly experimental screening; valuable for biotech companies engineering industrial enzymes, therapeutic proteins, or agricultural proteins; limited accuracy for highly novel mutations lacking evolutionary context.
Rapid computational screening of protein variants to identify stabilizing mutations or improve thermostability, enabling biotech firms to efficiently engineer proteins that function under harsh industrial conditions such as high temperature or extreme pH; less suitable for proteins with minimal evolutionary sequence data available.
Identifying critical binding-site residues to guide targeted mutagenesis experiments, enabling protein engineers to focus experimental resources on residues most likely to impact ligand binding affinity or specificity; valuable for companies optimizing therapeutic proteins or biosensors; may not accurately predict impacts of mutations distant from conserved functional sites.
Computational assessment of variant pathogenicity for clinical diagnostics, enabling genetic testing companies to rapidly classify novel protein-coding variants as benign or deleterious based on evolutionary constraints; useful for interpreting patient genomic data but less reliable for variants occurring in poorly conserved regions or proteins with limited evolutionary data.
Predicting protein stability changes due to mutations to guide formulation and manufacturing decisions, enabling biopharma companies to proactively identify sequence variants that may negatively impact protein yield, shelf-life, or aggregation propensity; beneficial for biologics manufacturing but not optimal for assessing mutations affecting post-translational modifications or glycosylation patterns.
Limitations¶
Maximum Sequence Length: Input sequences must be no longer than
512
amino acids. Longer sequences must be truncated or split into smaller segments for processing.Batch Size: Requests are limited to a maximum of
5
sequences per batch, which may impact throughput for large-scale analyses.GPU Type: GPU acceleration (
T4
) is only available when using the ensemble model (all
). Individual models (n1
ton5
) run exclusively on CPU, potentially resulting in slower inference times.ESM-1v is optimized for zero-shot prediction of mutational effects on protein function. It does not directly predict protein structure or provide embeddings for clustering or visualization tasks.
While ESM-1v performs well for initial screening and ranking of large libraries of protein variants, it may not be optimal for precise structural validation or late-stage ranking tasks, where specialized structural models (e.g., AlphaFold2, NanobodyBuilder) are more appropriate.
Predictions are based solely on sequence context learned from evolutionary data. Novel sequences with limited evolutionary representation may yield less accurate predictions.
How We Use It¶
BioLM integrates ESM-1v into protein engineering pipelines to enable rapid assessment of mutational effects on protein functionality, accelerating research cycles and reducing experimental overhead. Through zero-shot predictions and optional fine-tuning with protein family-specific datasets, ESM-1v efficiently informs antibody maturation, enzyme optimization, and general protein design tasks. This integration complements other BioLM workflows, such as embedding-based clustering, predictive modeling, and 3D structure-informed ranking, allowing comprehensive and scalable analysis of protein sequence designs.
Enables rapid prioritization and filtering of candidate protein sequences prior to lab synthesis and testing
Integrates seamlessly with embedding-based models and 3D structure analysis pipelines to inform iterative design cycles
References¶
Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. bioRxiv.