Cargando…

Machine Learning Enables Accurate Prediction of Asparagine Deamidation Probability and Rate

The spontaneous conversion of asparagine residues to aspartic acid or iso-aspartic acid, via deamidation, is a major pathway of protein degradation and is often seriously disruptive to biological systems. Deamidation has been shown to negatively affect both in vitro stability and in vivo biological...

Descripción completa

Detalles Bibliográficos
Autores principales: Delmar, Jared A., Wang, Jihong, Choi, Seo Woo, Martins, Jason A., Mikhail, John P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Gene & Cell Therapy 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923510/
https://www.ncbi.nlm.nih.gov/pubmed/31890727
http://dx.doi.org/10.1016/j.omtm.2019.09.008
_version_ 1783481550708408320
author Delmar, Jared A.
Wang, Jihong
Choi, Seo Woo
Martins, Jason A.
Mikhail, John P.
author_facet Delmar, Jared A.
Wang, Jihong
Choi, Seo Woo
Martins, Jason A.
Mikhail, John P.
author_sort Delmar, Jared A.
collection PubMed
description The spontaneous conversion of asparagine residues to aspartic acid or iso-aspartic acid, via deamidation, is a major pathway of protein degradation and is often seriously disruptive to biological systems. Deamidation has been shown to negatively affect both in vitro stability and in vivo biological function of diverse classes of proteins. During protein therapeutics development, deamidation liabilities that are overlooked necessitate expensive and time-consuming remediation strategies, sometimes leading to termination of the project. In this paper, we apply machine learning to a large (n = 776) liquid chromatography-tandem mass spectrometry (LC-MS/MS) dataset of monoclonal antibody peptides to create computational models for the post-translational modification asparagine deamidation, using the random decision forest method. We show that our categorical model predicts antibody deamidation with nearly 5% increased accuracy and 0.2 MCC over the best currently available models. Surprisingly, our model also paces or outperforms advanced and conventional models on an independent non-antibody dataset. In addition to deamidation probability, we are able to accurately predict deamidation rate (R(2) = 0.963 and Q2 = 0.822), a capability with no peer in current models. This method should enable significant improvement in protein candidate selection, especially in biopharmaceutical development, and can be applied with similar accuracy to enzymes, monoclonal antibodies, next-generation formats, vaccine component antigens, and gene therapy vectors such as adeno-associated virus.
format Online
Article
Text
id pubmed-6923510
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Society of Gene & Cell Therapy
record_format MEDLINE/PubMed
spelling pubmed-69235102019-12-30 Machine Learning Enables Accurate Prediction of Asparagine Deamidation Probability and Rate Delmar, Jared A. Wang, Jihong Choi, Seo Woo Martins, Jason A. Mikhail, John P. Mol Ther Methods Clin Dev Article The spontaneous conversion of asparagine residues to aspartic acid or iso-aspartic acid, via deamidation, is a major pathway of protein degradation and is often seriously disruptive to biological systems. Deamidation has been shown to negatively affect both in vitro stability and in vivo biological function of diverse classes of proteins. During protein therapeutics development, deamidation liabilities that are overlooked necessitate expensive and time-consuming remediation strategies, sometimes leading to termination of the project. In this paper, we apply machine learning to a large (n = 776) liquid chromatography-tandem mass spectrometry (LC-MS/MS) dataset of monoclonal antibody peptides to create computational models for the post-translational modification asparagine deamidation, using the random decision forest method. We show that our categorical model predicts antibody deamidation with nearly 5% increased accuracy and 0.2 MCC over the best currently available models. Surprisingly, our model also paces or outperforms advanced and conventional models on an independent non-antibody dataset. In addition to deamidation probability, we are able to accurately predict deamidation rate (R(2) = 0.963 and Q2 = 0.822), a capability with no peer in current models. This method should enable significant improvement in protein candidate selection, especially in biopharmaceutical development, and can be applied with similar accuracy to enzymes, monoclonal antibodies, next-generation formats, vaccine component antigens, and gene therapy vectors such as adeno-associated virus. American Society of Gene & Cell Therapy 2019-10-01 /pmc/articles/PMC6923510/ /pubmed/31890727 http://dx.doi.org/10.1016/j.omtm.2019.09.008 Text en © 2019 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Delmar, Jared A.
Wang, Jihong
Choi, Seo Woo
Martins, Jason A.
Mikhail, John P.
Machine Learning Enables Accurate Prediction of Asparagine Deamidation Probability and Rate
title Machine Learning Enables Accurate Prediction of Asparagine Deamidation Probability and Rate
title_full Machine Learning Enables Accurate Prediction of Asparagine Deamidation Probability and Rate
title_fullStr Machine Learning Enables Accurate Prediction of Asparagine Deamidation Probability and Rate
title_full_unstemmed Machine Learning Enables Accurate Prediction of Asparagine Deamidation Probability and Rate
title_short Machine Learning Enables Accurate Prediction of Asparagine Deamidation Probability and Rate
title_sort machine learning enables accurate prediction of asparagine deamidation probability and rate
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923510/
https://www.ncbi.nlm.nih.gov/pubmed/31890727
http://dx.doi.org/10.1016/j.omtm.2019.09.008
work_keys_str_mv AT delmarjareda machinelearningenablesaccuratepredictionofasparaginedeamidationprobabilityandrate
AT wangjihong machinelearningenablesaccuratepredictionofasparaginedeamidationprobabilityandrate
AT choiseowoo machinelearningenablesaccuratepredictionofasparaginedeamidationprobabilityandrate
AT martinsjasona machinelearningenablesaccuratepredictionofasparaginedeamidationprobabilityandrate
AT mikhailjohnp machinelearningenablesaccuratepredictionofasparaginedeamidationprobabilityandrate