Cargando…

Prediction of antibody binding to SARS-CoV-2 RBDs

SUMMARY: The ability to predict antibody–antigen binding is essential for computational models of antibody affinity maturation and protein design. While most models aim to predict binding for arbitrary antigens and antibodies, the global impact of SARS-CoV-2 on public health and the availability of...

Descripción completa

Detalles Bibliográficos
Autor principal: Wang, Eric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9868522/
https://www.ncbi.nlm.nih.gov/pubmed/36698760
http://dx.doi.org/10.1093/bioadv/vbac103
_version_ 1784876557636993024
author Wang, Eric
author_facet Wang, Eric
author_sort Wang, Eric
collection PubMed
description SUMMARY: The ability to predict antibody–antigen binding is essential for computational models of antibody affinity maturation and protein design. While most models aim to predict binding for arbitrary antigens and antibodies, the global impact of SARS-CoV-2 on public health and the availability of associated data suggest that a SARS-CoV-2-specific model would be highly beneficial. In this work, we present a neural network model, trained on ∼315 000 datapoints from deep mutational scanning experiments, that predicts escape fractions of SARS-CoV-2 RBDs binding to arbitrary antibodies. The antibody embeddings within the model constitute an effective sequence space, which correlates with the Hamming distance, suggesting that these embeddings may be useful for downstream tasks such as binding prediction. Indeed, the model achieves Spearman correlation coefficients of 0.46 and 0.52 on two held-out test sets. By comparison, correlation coefficients calculated using existing structure and sequence-based models do not exceed 0.28. The correlation coefficient against dissociation constants of antibodies binding to SARS-CoV-2 RBD variants is 0.46. Additionally, the residue-level escapes are highest in the antibody epitope, correlating well with experimentally measured escapes. We further study the effect of antibody chain use, embedding dimension size and feed-forward and convolutional architectures on the model results. Lastly, we find that the inference time of our model is significantly faster than previous models, suggesting that it could be a useful tool for the accurate and rapid prediction of antibodies binding to SARS-CoV-2 RBDs. AVAILABILITY AND IMPLEMENTATION: The model and associated code are available for download at https://github.com/ericzwang/RBD_AB. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-9868522
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98685222023-01-24 Prediction of antibody binding to SARS-CoV-2 RBDs Wang, Eric Bioinform Adv Original Paper SUMMARY: The ability to predict antibody–antigen binding is essential for computational models of antibody affinity maturation and protein design. While most models aim to predict binding for arbitrary antigens and antibodies, the global impact of SARS-CoV-2 on public health and the availability of associated data suggest that a SARS-CoV-2-specific model would be highly beneficial. In this work, we present a neural network model, trained on ∼315 000 datapoints from deep mutational scanning experiments, that predicts escape fractions of SARS-CoV-2 RBDs binding to arbitrary antibodies. The antibody embeddings within the model constitute an effective sequence space, which correlates with the Hamming distance, suggesting that these embeddings may be useful for downstream tasks such as binding prediction. Indeed, the model achieves Spearman correlation coefficients of 0.46 and 0.52 on two held-out test sets. By comparison, correlation coefficients calculated using existing structure and sequence-based models do not exceed 0.28. The correlation coefficient against dissociation constants of antibodies binding to SARS-CoV-2 RBD variants is 0.46. Additionally, the residue-level escapes are highest in the antibody epitope, correlating well with experimentally measured escapes. We further study the effect of antibody chain use, embedding dimension size and feed-forward and convolutional architectures on the model results. Lastly, we find that the inference time of our model is significantly faster than previous models, suggesting that it could be a useful tool for the accurate and rapid prediction of antibodies binding to SARS-CoV-2 RBDs. AVAILABILITY AND IMPLEMENTATION: The model and associated code are available for download at https://github.com/ericzwang/RBD_AB. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2023-01-02 /pmc/articles/PMC9868522/ /pubmed/36698760 http://dx.doi.org/10.1093/bioadv/vbac103 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Wang, Eric
Prediction of antibody binding to SARS-CoV-2 RBDs
title Prediction of antibody binding to SARS-CoV-2 RBDs
title_full Prediction of antibody binding to SARS-CoV-2 RBDs
title_fullStr Prediction of antibody binding to SARS-CoV-2 RBDs
title_full_unstemmed Prediction of antibody binding to SARS-CoV-2 RBDs
title_short Prediction of antibody binding to SARS-CoV-2 RBDs
title_sort prediction of antibody binding to sars-cov-2 rbds
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9868522/
https://www.ncbi.nlm.nih.gov/pubmed/36698760
http://dx.doi.org/10.1093/bioadv/vbac103
work_keys_str_mv AT wangeric predictionofantibodybindingtosarscov2rbds