Cargando…
Quantifying the nativeness of antibody sequences using long short-term memory networks
Antibodies often undergo substantial engineering en route to the generation of a therapeutic candidate with good developability properties. Characterization of antibody libraries has shown that retaining native-like sequence improves the overall quality of the library. Motivated by recent advances i...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7372931/ https://www.ncbi.nlm.nih.gov/pubmed/31504835 http://dx.doi.org/10.1093/protein/gzz031 |
_version_ | 1783561410154856448 |
---|---|
author | Wollacott, Andrew M Xue, Chonghua Qin, Qiuyuan Hua, June Bohnuud, Tanggis Viswanathan, Karthik Kolachalama, Vijaya B |
author_facet | Wollacott, Andrew M Xue, Chonghua Qin, Qiuyuan Hua, June Bohnuud, Tanggis Viswanathan, Karthik Kolachalama, Vijaya B |
author_sort | Wollacott, Andrew M |
collection | PubMed |
description | Antibodies often undergo substantial engineering en route to the generation of a therapeutic candidate with good developability properties. Characterization of antibody libraries has shown that retaining native-like sequence improves the overall quality of the library. Motivated by recent advances in deep learning, we developed a bi-directional long short-term memory (LSTM) network model to make use of the large amount of available antibody sequence information, and use this model to quantify the nativeness of antibody sequences. The model scores sequences for their similarity to naturally occurring antibodies, which can be used as a consideration during design and engineering of libraries. We demonstrate the performance of this approach by training a model on human antibody sequences and show that our method outperforms other approaches at distinguishing human antibodies from those of other species. We show the applicability of this method for the evaluation of synthesized antibody libraries and humanization of mouse antibodies. |
format | Online Article Text |
id | pubmed-7372931 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-73729312020-07-23 Quantifying the nativeness of antibody sequences using long short-term memory networks Wollacott, Andrew M Xue, Chonghua Qin, Qiuyuan Hua, June Bohnuud, Tanggis Viswanathan, Karthik Kolachalama, Vijaya B Protein Eng Des Sel Original Article Antibodies often undergo substantial engineering en route to the generation of a therapeutic candidate with good developability properties. Characterization of antibody libraries has shown that retaining native-like sequence improves the overall quality of the library. Motivated by recent advances in deep learning, we developed a bi-directional long short-term memory (LSTM) network model to make use of the large amount of available antibody sequence information, and use this model to quantify the nativeness of antibody sequences. The model scores sequences for their similarity to naturally occurring antibodies, which can be used as a consideration during design and engineering of libraries. We demonstrate the performance of this approach by training a model on human antibody sequences and show that our method outperforms other approaches at distinguishing human antibodies from those of other species. We show the applicability of this method for the evaluation of synthesized antibody libraries and humanization of mouse antibodies. Oxford University Press 2019-12 2019-08-28 /pmc/articles/PMC7372931/ /pubmed/31504835 http://dx.doi.org/10.1093/protein/gzz031 Text en © The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Article Wollacott, Andrew M Xue, Chonghua Qin, Qiuyuan Hua, June Bohnuud, Tanggis Viswanathan, Karthik Kolachalama, Vijaya B Quantifying the nativeness of antibody sequences using long short-term memory networks |
title | Quantifying the nativeness of antibody sequences using long short-term memory networks |
title_full | Quantifying the nativeness of antibody sequences using long short-term memory networks |
title_fullStr | Quantifying the nativeness of antibody sequences using long short-term memory networks |
title_full_unstemmed | Quantifying the nativeness of antibody sequences using long short-term memory networks |
title_short | Quantifying the nativeness of antibody sequences using long short-term memory networks |
title_sort | quantifying the nativeness of antibody sequences using long short-term memory networks |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7372931/ https://www.ncbi.nlm.nih.gov/pubmed/31504835 http://dx.doi.org/10.1093/protein/gzz031 |
work_keys_str_mv | AT wollacottandrewm quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks AT xuechonghua quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks AT qinqiuyuan quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks AT huajune quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks AT bohnuudtanggis quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks AT viswanathankarthik quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks AT kolachalamavijayab quantifyingthenativenessofantibodysequencesusinglongshorttermmemorynetworks |