Cargando…

Accurate prediction of protein torsion angles using evolutionary signatures and recurrent neural network

The amino acid sequence of a protein contains all the necessary information to specify its shape, which dictates its biological activities. However, it is challenging and expensive to experimentally determine the three-dimensional structure of proteins. The backbone torsion angles play a critical ro...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Yong-Chang, ShangGuan, Tian-Jun, Ding, Xue-Ming, Cheung, Ngaam J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8548351/
https://www.ncbi.nlm.nih.gov/pubmed/34702851
http://dx.doi.org/10.1038/s41598-021-00477-2
_version_ 1784590554083885056
author Xu, Yong-Chang
ShangGuan, Tian-Jun
Ding, Xue-Ming
Cheung, Ngaam J.
author_facet Xu, Yong-Chang
ShangGuan, Tian-Jun
Ding, Xue-Ming
Cheung, Ngaam J.
author_sort Xu, Yong-Chang
collection PubMed
description The amino acid sequence of a protein contains all the necessary information to specify its shape, which dictates its biological activities. However, it is challenging and expensive to experimentally determine the three-dimensional structure of proteins. The backbone torsion angles play a critical role in protein structure prediction, and accurately predicting the angles can considerably advance the tertiary structure prediction by accelerating efficient sampling of the large conformational space for low energy structures. Here we first time propose evolutionary signatures computed from protein sequence profiles, and a novel recurrent architecture, termed ESIDEN, that adopts a straightforward architecture of recurrent neural networks with a small number of learnable parameters. The ESIDEN can capture efficient information from both the classic and new features benefiting from different recurrent architectures in processing information. On the other hand, compared to widely used classic features, the new features, especially the Ramachandran basin potential, provide statistical and evolutionary information to improve prediction accuracy. On four widely used benchmark datasets, the ESIDEN significantly improves the accuracy in predicting the torsion angles by comparison to the best-so-far methods. As demonstrated in the present study, the predicted angles can be used as structural constraints to accurately infer protein tertiary structures. Moreover, the proposed features would pave the way to improve machine learning-based methods in protein folding and structure prediction, as well as function prediction. The source code and data are available at the website https://kornmann.bioch.ox.ac.uk/leri/resources/download.html.
format Online
Article
Text
id pubmed-8548351
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-85483512021-10-27 Accurate prediction of protein torsion angles using evolutionary signatures and recurrent neural network Xu, Yong-Chang ShangGuan, Tian-Jun Ding, Xue-Ming Cheung, Ngaam J. Sci Rep Article The amino acid sequence of a protein contains all the necessary information to specify its shape, which dictates its biological activities. However, it is challenging and expensive to experimentally determine the three-dimensional structure of proteins. The backbone torsion angles play a critical role in protein structure prediction, and accurately predicting the angles can considerably advance the tertiary structure prediction by accelerating efficient sampling of the large conformational space for low energy structures. Here we first time propose evolutionary signatures computed from protein sequence profiles, and a novel recurrent architecture, termed ESIDEN, that adopts a straightforward architecture of recurrent neural networks with a small number of learnable parameters. The ESIDEN can capture efficient information from both the classic and new features benefiting from different recurrent architectures in processing information. On the other hand, compared to widely used classic features, the new features, especially the Ramachandran basin potential, provide statistical and evolutionary information to improve prediction accuracy. On four widely used benchmark datasets, the ESIDEN significantly improves the accuracy in predicting the torsion angles by comparison to the best-so-far methods. As demonstrated in the present study, the predicted angles can be used as structural constraints to accurately infer protein tertiary structures. Moreover, the proposed features would pave the way to improve machine learning-based methods in protein folding and structure prediction, as well as function prediction. The source code and data are available at the website https://kornmann.bioch.ox.ac.uk/leri/resources/download.html. Nature Publishing Group UK 2021-10-26 /pmc/articles/PMC8548351/ /pubmed/34702851 http://dx.doi.org/10.1038/s41598-021-00477-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Xu, Yong-Chang
ShangGuan, Tian-Jun
Ding, Xue-Ming
Cheung, Ngaam J.
Accurate prediction of protein torsion angles using evolutionary signatures and recurrent neural network
title Accurate prediction of protein torsion angles using evolutionary signatures and recurrent neural network
title_full Accurate prediction of protein torsion angles using evolutionary signatures and recurrent neural network
title_fullStr Accurate prediction of protein torsion angles using evolutionary signatures and recurrent neural network
title_full_unstemmed Accurate prediction of protein torsion angles using evolutionary signatures and recurrent neural network
title_short Accurate prediction of protein torsion angles using evolutionary signatures and recurrent neural network
title_sort accurate prediction of protein torsion angles using evolutionary signatures and recurrent neural network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8548351/
https://www.ncbi.nlm.nih.gov/pubmed/34702851
http://dx.doi.org/10.1038/s41598-021-00477-2
work_keys_str_mv AT xuyongchang accuratepredictionofproteintorsionanglesusingevolutionarysignaturesandrecurrentneuralnetwork
AT shangguantianjun accuratepredictionofproteintorsionanglesusingevolutionarysignaturesandrecurrentneuralnetwork
AT dingxueming accuratepredictionofproteintorsionanglesusingevolutionarysignaturesandrecurrentneuralnetwork
AT cheungngaamj accuratepredictionofproteintorsionanglesusingevolutionarysignaturesandrecurrentneuralnetwork