Cargando…
Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification
High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identific...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8032195/ https://www.ncbi.nlm.nih.gov/pubmed/33780441 http://dx.doi.org/10.1371/journal.pcbi.1008864 |
_version_ | 1783676185242238976 |
---|---|
author | Ripoll, Daniel R. Chaudhury, Sidhartha Wallqvist, Anders |
author_facet | Ripoll, Daniel R. Chaudhury, Sidhartha Wallqvist, Anders |
author_sort | Ripoll, Daniel R. |
collection | PubMed |
description | High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identification of therapeutic antibodies (Abs) or those associated with disease exposure and protection. Here, we describe our efforts to use artificial intelligence (AI)-based image-analyses for prospective classification of Abs based solely on sequence information. We hypothesized that Abs recognizing the same part of an antigen share a limited set of features at the binding interface, and that the binding site regions of these Abs share share common structure and physicochemical property patterns that can serve as a “fingerprint” to recognize uncharacterized Abs. We combined large-scale sequence-based protein-structure predictions to generate ensembles of 3-D Ab models, reduced the Ab binding interface to a 2-D image (fingerprint), used pre-trained convolutional neural networks to extract features, and trained deep neural networks (DNNs) to classify Abs. We evaluated this approach using Ab sequences derived from human HIV and Ebola viral infections to differentiate between two Abs, Abs belonging to specific B-cell family lineages, and Abs with different epitope preferences. In addition, we explored a different type of DNN method to detect one class of Abs from a larger pool of Abs. Testing on Ab sets that had been kept aside during model training, we achieved average prediction accuracies ranging from 71–96% depending on the complexity of the classification task. The high level of accuracies reached during these classification tests suggests that the DNN models were able to learn a series of structural patterns shared by Abs belonging to the same class. The developed methodology provides a means to apply AI-based image recognition techniques to analyze high-throughput B-cell sequencing datasets (repertoires) for Ab classification. |
format | Online Article Text |
id | pubmed-8032195 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-80321952021-04-15 Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification Ripoll, Daniel R. Chaudhury, Sidhartha Wallqvist, Anders PLoS Comput Biol Research Article High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identification of therapeutic antibodies (Abs) or those associated with disease exposure and protection. Here, we describe our efforts to use artificial intelligence (AI)-based image-analyses for prospective classification of Abs based solely on sequence information. We hypothesized that Abs recognizing the same part of an antigen share a limited set of features at the binding interface, and that the binding site regions of these Abs share share common structure and physicochemical property patterns that can serve as a “fingerprint” to recognize uncharacterized Abs. We combined large-scale sequence-based protein-structure predictions to generate ensembles of 3-D Ab models, reduced the Ab binding interface to a 2-D image (fingerprint), used pre-trained convolutional neural networks to extract features, and trained deep neural networks (DNNs) to classify Abs. We evaluated this approach using Ab sequences derived from human HIV and Ebola viral infections to differentiate between two Abs, Abs belonging to specific B-cell family lineages, and Abs with different epitope preferences. In addition, we explored a different type of DNN method to detect one class of Abs from a larger pool of Abs. Testing on Ab sets that had been kept aside during model training, we achieved average prediction accuracies ranging from 71–96% depending on the complexity of the classification task. The high level of accuracies reached during these classification tests suggests that the DNN models were able to learn a series of structural patterns shared by Abs belonging to the same class. The developed methodology provides a means to apply AI-based image recognition techniques to analyze high-throughput B-cell sequencing datasets (repertoires) for Ab classification. Public Library of Science 2021-03-29 /pmc/articles/PMC8032195/ /pubmed/33780441 http://dx.doi.org/10.1371/journal.pcbi.1008864 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Ripoll, Daniel R. Chaudhury, Sidhartha Wallqvist, Anders Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification |
title | Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification |
title_full | Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification |
title_fullStr | Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification |
title_full_unstemmed | Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification |
title_short | Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification |
title_sort | using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8032195/ https://www.ncbi.nlm.nih.gov/pubmed/33780441 http://dx.doi.org/10.1371/journal.pcbi.1008864 |
work_keys_str_mv | AT ripolldanielr usingtheantibodyantigenbindinginterfacetotrainimagebaseddeepneuralnetworksforantibodyepitopeclassification AT chaudhurysidhartha usingtheantibodyantigenbindinginterfacetotrainimagebaseddeepneuralnetworksforantibodyepitopeclassification AT wallqvistanders usingtheantibodyantigenbindinginterfacetotrainimagebaseddeepneuralnetworksforantibodyepitopeclassification |