Cargando…

Protein pK(a) Prediction with Machine Learning

[Image: see text] Protein pK(a) prediction is essential for the investigation of the pH-associated relationship between protein structure and function. In this work, we introduce a deep learning-based protein pK(a) predictor DeepKa, which is trained and validated with the pK(a) values derived from c...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Zhitao, Luo, Fangfang, Wang, Yongxian, Li, Enling, Huang, Yandong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8697405/
https://www.ncbi.nlm.nih.gov/pubmed/34963965
http://dx.doi.org/10.1021/acsomega.1c05440
_version_ 1784620039831289856
author Cai, Zhitao
Luo, Fangfang
Wang, Yongxian
Li, Enling
Huang, Yandong
author_facet Cai, Zhitao
Luo, Fangfang
Wang, Yongxian
Li, Enling
Huang, Yandong
author_sort Cai, Zhitao
collection PubMed
description [Image: see text] Protein pK(a) prediction is essential for the investigation of the pH-associated relationship between protein structure and function. In this work, we introduce a deep learning-based protein pK(a) predictor DeepKa, which is trained and validated with the pK(a) values derived from continuous constant-pH molecular dynamics (CpHMD) simulations of 279 soluble proteins. Here, the CpHMD implemented in the Amber molecular dynamics package has been employed ( Y. HuangJ. Chem. Inf. Model.2018, 58, 1372−138329949356). Notably, to avoid discontinuities at the boundary, grid charges are proposed to represent protein electrostatics. We show that the prediction accuracy by DeepKa is close to that by CpHMD benchmarking simulations, validating DeepKa as an efficient protein pK(a) predictor. In addition, the training and validation sets created in this study can be applied to the development of machine learning-based protein pK(a) predictors in the future. Finally, the grid charge representation is general and applicable to other topics, such as the protein–ligand binding affinity prediction.
format Online
Article
Text
id pubmed-8697405
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-86974052021-12-27 Protein pK(a) Prediction with Machine Learning Cai, Zhitao Luo, Fangfang Wang, Yongxian Li, Enling Huang, Yandong ACS Omega [Image: see text] Protein pK(a) prediction is essential for the investigation of the pH-associated relationship between protein structure and function. In this work, we introduce a deep learning-based protein pK(a) predictor DeepKa, which is trained and validated with the pK(a) values derived from continuous constant-pH molecular dynamics (CpHMD) simulations of 279 soluble proteins. Here, the CpHMD implemented in the Amber molecular dynamics package has been employed ( Y. HuangJ. Chem. Inf. Model.2018, 58, 1372−138329949356). Notably, to avoid discontinuities at the boundary, grid charges are proposed to represent protein electrostatics. We show that the prediction accuracy by DeepKa is close to that by CpHMD benchmarking simulations, validating DeepKa as an efficient protein pK(a) predictor. In addition, the training and validation sets created in this study can be applied to the development of machine learning-based protein pK(a) predictors in the future. Finally, the grid charge representation is general and applicable to other topics, such as the protein–ligand binding affinity prediction. American Chemical Society 2021-12-07 /pmc/articles/PMC8697405/ /pubmed/34963965 http://dx.doi.org/10.1021/acsomega.1c05440 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Cai, Zhitao
Luo, Fangfang
Wang, Yongxian
Li, Enling
Huang, Yandong
Protein pK(a) Prediction with Machine Learning
title Protein pK(a) Prediction with Machine Learning
title_full Protein pK(a) Prediction with Machine Learning
title_fullStr Protein pK(a) Prediction with Machine Learning
title_full_unstemmed Protein pK(a) Prediction with Machine Learning
title_short Protein pK(a) Prediction with Machine Learning
title_sort protein pk(a) prediction with machine learning
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8697405/
https://www.ncbi.nlm.nih.gov/pubmed/34963965
http://dx.doi.org/10.1021/acsomega.1c05440
work_keys_str_mv AT caizhitao proteinpkapredictionwithmachinelearning
AT luofangfang proteinpkapredictionwithmachinelearning
AT wangyongxian proteinpkapredictionwithmachinelearning
AT lienling proteinpkapredictionwithmachinelearning
AT huangyandong proteinpkapredictionwithmachinelearning