Cargando…

Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification

High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identific...

Descripción completa

Detalles Bibliográficos
Autores principales: Ripoll, Daniel R., Chaudhury, Sidhartha, Wallqvist, Anders
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8032195/
https://www.ncbi.nlm.nih.gov/pubmed/33780441
http://dx.doi.org/10.1371/journal.pcbi.1008864
_version_ 1783676185242238976
author Ripoll, Daniel R.
Chaudhury, Sidhartha
Wallqvist, Anders
author_facet Ripoll, Daniel R.
Chaudhury, Sidhartha
Wallqvist, Anders
author_sort Ripoll, Daniel R.
collection PubMed
description High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identification of therapeutic antibodies (Abs) or those associated with disease exposure and protection. Here, we describe our efforts to use artificial intelligence (AI)-based image-analyses for prospective classification of Abs based solely on sequence information. We hypothesized that Abs recognizing the same part of an antigen share a limited set of features at the binding interface, and that the binding site regions of these Abs share share common structure and physicochemical property patterns that can serve as a “fingerprint” to recognize uncharacterized Abs. We combined large-scale sequence-based protein-structure predictions to generate ensembles of 3-D Ab models, reduced the Ab binding interface to a 2-D image (fingerprint), used pre-trained convolutional neural networks to extract features, and trained deep neural networks (DNNs) to classify Abs. We evaluated this approach using Ab sequences derived from human HIV and Ebola viral infections to differentiate between two Abs, Abs belonging to specific B-cell family lineages, and Abs with different epitope preferences. In addition, we explored a different type of DNN method to detect one class of Abs from a larger pool of Abs. Testing on Ab sets that had been kept aside during model training, we achieved average prediction accuracies ranging from 71–96% depending on the complexity of the classification task. The high level of accuracies reached during these classification tests suggests that the DNN models were able to learn a series of structural patterns shared by Abs belonging to the same class. The developed methodology provides a means to apply AI-based image recognition techniques to analyze high-throughput B-cell sequencing datasets (repertoires) for Ab classification.
format Online
Article
Text
id pubmed-8032195
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-80321952021-04-15 Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification Ripoll, Daniel R. Chaudhury, Sidhartha Wallqvist, Anders PLoS Comput Biol Research Article High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identification of therapeutic antibodies (Abs) or those associated with disease exposure and protection. Here, we describe our efforts to use artificial intelligence (AI)-based image-analyses for prospective classification of Abs based solely on sequence information. We hypothesized that Abs recognizing the same part of an antigen share a limited set of features at the binding interface, and that the binding site regions of these Abs share share common structure and physicochemical property patterns that can serve as a “fingerprint” to recognize uncharacterized Abs. We combined large-scale sequence-based protein-structure predictions to generate ensembles of 3-D Ab models, reduced the Ab binding interface to a 2-D image (fingerprint), used pre-trained convolutional neural networks to extract features, and trained deep neural networks (DNNs) to classify Abs. We evaluated this approach using Ab sequences derived from human HIV and Ebola viral infections to differentiate between two Abs, Abs belonging to specific B-cell family lineages, and Abs with different epitope preferences. In addition, we explored a different type of DNN method to detect one class of Abs from a larger pool of Abs. Testing on Ab sets that had been kept aside during model training, we achieved average prediction accuracies ranging from 71–96% depending on the complexity of the classification task. The high level of accuracies reached during these classification tests suggests that the DNN models were able to learn a series of structural patterns shared by Abs belonging to the same class. The developed methodology provides a means to apply AI-based image recognition techniques to analyze high-throughput B-cell sequencing datasets (repertoires) for Ab classification. Public Library of Science 2021-03-29 /pmc/articles/PMC8032195/ /pubmed/33780441 http://dx.doi.org/10.1371/journal.pcbi.1008864 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Ripoll, Daniel R.
Chaudhury, Sidhartha
Wallqvist, Anders
Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification
title Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification
title_full Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification
title_fullStr Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification
title_full_unstemmed Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification
title_short Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification
title_sort using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8032195/
https://www.ncbi.nlm.nih.gov/pubmed/33780441
http://dx.doi.org/10.1371/journal.pcbi.1008864
work_keys_str_mv AT ripolldanielr usingtheantibodyantigenbindinginterfacetotrainimagebaseddeepneuralnetworksforantibodyepitopeclassification
AT chaudhurysidhartha usingtheantibodyantigenbindinginterfacetotrainimagebaseddeepneuralnetworksforantibodyepitopeclassification
AT wallqvistanders usingtheantibodyantigenbindinginterfacetotrainimagebaseddeepneuralnetworksforantibodyepitopeclassification