Cargando…

B-factor prediction in proteins using a sequence-based deep learning model

B factors provide critical insight into protein dynamics. Predicting B factors of an atom in new proteins remains challenging as it is impacted by their neighbors in Euclidean space. Previous learning methods developed have resulted in low Pearson correlation coefficients beyond the training set due...

Descripción completa

Detalles Bibliográficos
Autores principales: Pandey, Akash, Liu, Elaine, Graham, Jacob, Chen, Wei, Keten, Sinan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10499862/
https://www.ncbi.nlm.nih.gov/pubmed/37720331
http://dx.doi.org/10.1016/j.patter.2023.100805
_version_ 1785105801163046912
author Pandey, Akash
Liu, Elaine
Graham, Jacob
Chen, Wei
Keten, Sinan
author_facet Pandey, Akash
Liu, Elaine
Graham, Jacob
Chen, Wei
Keten, Sinan
author_sort Pandey, Akash
collection PubMed
description B factors provide critical insight into protein dynamics. Predicting B factors of an atom in new proteins remains challenging as it is impacted by their neighbors in Euclidean space. Previous learning methods developed have resulted in low Pearson correlation coefficients beyond the training set due to their limited ability to capture the effect of neighboring atoms. With the advances in deep learning methods, we develop a sequence-based model that is tested on 2,442 proteins and outperforms the state-of-the-art models by 30%. We find that the model learns that the B factor of a site is prominently affected by atoms within a 12–15 Å radius, which is in excellent agreement with cutoffs from protein network models. The ablation study revealed that the B factor can largely be predicted from the primary sequence alone. Based on the abovementioned points, our model lays a foundation for predicting other properties that are correlated with the B factor.
format Online
Article
Text
id pubmed-10499862
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-104998622023-09-15 B-factor prediction in proteins using a sequence-based deep learning model Pandey, Akash Liu, Elaine Graham, Jacob Chen, Wei Keten, Sinan Patterns (N Y) Article B factors provide critical insight into protein dynamics. Predicting B factors of an atom in new proteins remains challenging as it is impacted by their neighbors in Euclidean space. Previous learning methods developed have resulted in low Pearson correlation coefficients beyond the training set due to their limited ability to capture the effect of neighboring atoms. With the advances in deep learning methods, we develop a sequence-based model that is tested on 2,442 proteins and outperforms the state-of-the-art models by 30%. We find that the model learns that the B factor of a site is prominently affected by atoms within a 12–15 Å radius, which is in excellent agreement with cutoffs from protein network models. The ablation study revealed that the B factor can largely be predicted from the primary sequence alone. Based on the abovementioned points, our model lays a foundation for predicting other properties that are correlated with the B factor. Elsevier 2023-08-04 /pmc/articles/PMC10499862/ /pubmed/37720331 http://dx.doi.org/10.1016/j.patter.2023.100805 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pandey, Akash
Liu, Elaine
Graham, Jacob
Chen, Wei
Keten, Sinan
B-factor prediction in proteins using a sequence-based deep learning model
title B-factor prediction in proteins using a sequence-based deep learning model
title_full B-factor prediction in proteins using a sequence-based deep learning model
title_fullStr B-factor prediction in proteins using a sequence-based deep learning model
title_full_unstemmed B-factor prediction in proteins using a sequence-based deep learning model
title_short B-factor prediction in proteins using a sequence-based deep learning model
title_sort b-factor prediction in proteins using a sequence-based deep learning model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10499862/
https://www.ncbi.nlm.nih.gov/pubmed/37720331
http://dx.doi.org/10.1016/j.patter.2023.100805
work_keys_str_mv AT pandeyakash bfactorpredictioninproteinsusingasequencebaseddeeplearningmodel
AT liuelaine bfactorpredictioninproteinsusingasequencebaseddeeplearningmodel
AT grahamjacob bfactorpredictioninproteinsusingasequencebaseddeeplearningmodel
AT chenwei bfactorpredictioninproteinsusingasequencebaseddeeplearningmodel
AT ketensinan bfactorpredictioninproteinsusingasequencebaseddeeplearningmodel