Cargando…

Quantifying the nativeness of antibody sequences using long short-term memory networks

Antibodies often undergo substantial engineering en route to the generation of a therapeutic candidate with good developability properties. Characterization of antibody libraries has shown that retaining native-like sequence improves the overall quality of the library. Motivated by recent advances i...

Descripción completa

Detalles Bibliográficos
Autores principales: Wollacott, Andrew M, Xue, Chonghua, Qin, Qiuyuan, Hua, June, Bohnuud, Tanggis, Viswanathan, Karthik, Kolachalama, Vijaya B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7372931/
https://www.ncbi.nlm.nih.gov/pubmed/31504835
http://dx.doi.org/10.1093/protein/gzz031
_version_ 1783561410154856448
author Wollacott, Andrew M
Xue, Chonghua
Qin, Qiuyuan
Hua, June
Bohnuud, Tanggis
Viswanathan, Karthik
Kolachalama, Vijaya B
author_facet Wollacott, Andrew M
Xue, Chonghua
Qin, Qiuyuan
Hua, June
Bohnuud, Tanggis
Viswanathan, Karthik
Kolachalama, Vijaya B
author_sort Wollacott, Andrew M
collection PubMed
description Antibodies often undergo substantial engineering en route to the generation of a therapeutic candidate with good developability properties. Characterization of antibody libraries has shown that retaining native-like sequence improves the overall quality of the library. Motivated by recent advances in deep learning, we developed a bi-directional long short-term memory (LSTM) network model to make use of the large amount of available antibody sequence information, and use this model to quantify the nativeness of antibody sequences. The model scores sequences for their similarity to naturally occurring antibodies, which can be used as a consideration during design and engineering of libraries. We demonstrate the performance of this approach by training a model on human antibody sequences and show that our method outperforms other approaches at distinguishing human antibodies from those of other species. We show the applicability of this method for the evaluation of synthesized antibody libraries and humanization of mouse antibodies.
format Online
Article
Text
id pubmed-7372931
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73729312020-07-23 Quantifying the nativeness of antibody sequences using long short-term memory networks Wollacott, Andrew M Xue, Chonghua Qin, Qiuyuan Hua, June Bohnuud, Tanggis Viswanathan, Karthik Kolachalama, Vijaya B Protein Eng Des Sel Original Article Antibodies often undergo substantial engineering en route to the generation of a therapeutic candidate with good developability properties. Characterization of antibody libraries has shown that retaining native-like sequence improves the overall quality of the library. Motivated by recent advances in deep learning, we developed a bi-directional long short-term memory (LSTM) network model to make use of the large amount of available antibody sequence information, and use this model to quantify the nativeness of antibody sequences. The model scores sequences for their similarity to naturally occurring antibodies, which can be used as a consideration during design and engineering of libraries. We demonstrate the performance of this approach by training a model on human antibody sequences and show that our method outperforms other approaches at distinguishing human antibodies from those of other species. We show the applicability of this method for the evaluation of synthesized antibody libraries and humanization of mouse antibodies. Oxford University Press 2019-12 2019-08-28 /pmc/articles/PMC7372931/ /pubmed/31504835 http://dx.doi.org/10.1093/protein/gzz031 Text en © The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Article
Wollacott, Andrew M
Xue, Chonghua
Qin, Qiuyuan
Hua, June
Bohnuud, Tanggis
Viswanathan, Karthik
Kolachalama, Vijaya B
Quantifying the nativeness of antibody sequences using long short-term memory networks
title Quantifying the nativeness of antibody sequences using long short-term memory networks
title_full Quantifying the nativeness of antibody sequences using long short-term memory networks
title_fullStr Quantifying the nativeness of antibody sequences using long short-term memory networks
title_full_unstemmed Quantifying the nativeness of antibody sequences using long short-term memory networks
title_short Quantifying the nativeness of antibody sequences using long short-term memory networks
title_sort quantifying the nativeness of antibody sequences using long short-term memory networks
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7372931/
https://www.ncbi.nlm.nih.gov/pubmed/31504835
http://dx.doi.org/10.1093/protein/gzz031
work_keys_str_mv AT wollacottandrewm quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks
AT xuechonghua quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks
AT qinqiuyuan quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks
AT huajune quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks
AT bohnuudtanggis quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks
AT viswanathankarthik quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks
AT kolachalamavijayab quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks