Cargando…

Prediction of protein pK(a) with representation learning

The behavior of proteins is closely related to the protonation states of the residues. Therefore, prediction and measurement of pK(a) are essential to understand the basic functions of proteins. In this work, we develop a new empirical scheme for protein pK(a) prediction that is based on deep repres...

Descripción completa

Detalles Bibliográficos
Autores principales: Gokcan, Hatice, Isayev, Olexandr
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8864681/
https://www.ncbi.nlm.nih.gov/pubmed/35310485
http://dx.doi.org/10.1039/d1sc05610g
_version_ 1784655506227331072
author Gokcan, Hatice
Isayev, Olexandr
author_facet Gokcan, Hatice
Isayev, Olexandr
author_sort Gokcan, Hatice
collection PubMed
description The behavior of proteins is closely related to the protonation states of the residues. Therefore, prediction and measurement of pK(a) are essential to understand the basic functions of proteins. In this work, we develop a new empirical scheme for protein pK(a) prediction that is based on deep representation learning. It combines machine learning with atomic environment vector (AEV) and learned quantum mechanical representation from ANI-2x neural network potential (J. Chem. Theory Comput. 2020, 16, 4192). The scheme requires only the coordinate information of a protein as the input and separately estimates the pK(a) for all five titratable amino acid types. The accuracy of the approach was analyzed with both cross-validation and an external test set of proteins. Obtained results were compared with the widely used empirical approach PROPKA. The new empirical model provides accuracy with MAEs below 0.5 for all amino acid types. It surpasses the accuracy of PROPKA and performs significantly better than the null model. Our model is also sensitive to the local conformational changes and molecular interactions.
format Online
Article
Text
id pubmed-8864681
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-88646812022-03-17 Prediction of protein pK(a) with representation learning Gokcan, Hatice Isayev, Olexandr Chem Sci Chemistry The behavior of proteins is closely related to the protonation states of the residues. Therefore, prediction and measurement of pK(a) are essential to understand the basic functions of proteins. In this work, we develop a new empirical scheme for protein pK(a) prediction that is based on deep representation learning. It combines machine learning with atomic environment vector (AEV) and learned quantum mechanical representation from ANI-2x neural network potential (J. Chem. Theory Comput. 2020, 16, 4192). The scheme requires only the coordinate information of a protein as the input and separately estimates the pK(a) for all five titratable amino acid types. The accuracy of the approach was analyzed with both cross-validation and an external test set of proteins. Obtained results were compared with the widely used empirical approach PROPKA. The new empirical model provides accuracy with MAEs below 0.5 for all amino acid types. It surpasses the accuracy of PROPKA and performs significantly better than the null model. Our model is also sensitive to the local conformational changes and molecular interactions. The Royal Society of Chemistry 2022-02-01 /pmc/articles/PMC8864681/ /pubmed/35310485 http://dx.doi.org/10.1039/d1sc05610g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Gokcan, Hatice
Isayev, Olexandr
Prediction of protein pK(a) with representation learning
title Prediction of protein pK(a) with representation learning
title_full Prediction of protein pK(a) with representation learning
title_fullStr Prediction of protein pK(a) with representation learning
title_full_unstemmed Prediction of protein pK(a) with representation learning
title_short Prediction of protein pK(a) with representation learning
title_sort prediction of protein pk(a) with representation learning
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8864681/
https://www.ncbi.nlm.nih.gov/pubmed/35310485
http://dx.doi.org/10.1039/d1sc05610g
work_keys_str_mv AT gokcanhatice predictionofproteinpkawithrepresentationlearning
AT isayevolexandr predictionofproteinpkawithrepresentationlearning