Cargando…

Deep learning methods for protein torsion angle prediction

BACKGROUND: Deep learning is one of the most powerful machine learning methods that has achieved the state-of-the-art performance in many domains. Since deep learning was introduced to the field of bioinformatics in 2012, it has achieved success in a number of areas such as protein residue-residue c...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Haiou, Hou, Jie, Adhikari, Badri, Lyu, Qiang, Cheng, Jianlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5604354/
https://www.ncbi.nlm.nih.gov/pubmed/28923002
http://dx.doi.org/10.1186/s12859-017-1834-2
_version_ 1783264848359981056
author Li, Haiou
Hou, Jie
Adhikari, Badri
Lyu, Qiang
Cheng, Jianlin
author_facet Li, Haiou
Hou, Jie
Adhikari, Badri
Lyu, Qiang
Cheng, Jianlin
author_sort Li, Haiou
collection PubMed
description BACKGROUND: Deep learning is one of the most powerful machine learning methods that has achieved the state-of-the-art performance in many domains. Since deep learning was introduced to the field of bioinformatics in 2012, it has achieved success in a number of areas such as protein residue-residue contact prediction, secondary structure prediction, and fold recognition. In this work, we developed deep learning methods to improve the prediction of torsion (dihedral) angles of proteins. RESULTS: We design four different deep learning architectures to predict protein torsion angles. The architectures including deep neural network (DNN) and deep restricted Boltzmann machine (DRBN), deep recurrent neural network (DRNN) and deep recurrent restricted Boltzmann machine (DReRBM) since the protein torsion angle prediction is a sequence related problem. In addition to existing protein features, two new features (predicted residue contact number and the error distribution of torsion angles extracted from sequence fragments) are used as input to each of the four deep learning architectures to predict phi and psi angles of protein backbone. The mean absolute error (MAE) of phi and psi angles predicted by DRNN, DReRBM, DRBM and DNN is about 20–21° and 29–30° on an independent dataset. The MAE of phi angle is comparable to the existing methods, but the MAE of psi angle is 29°, 2° lower than the existing methods. On the latest CASP12 targets, our methods also achieved the performance better than or comparable to a state-of-the art method. CONCLUSIONS: Our experiment demonstrates that deep learning is a valuable method for predicting protein torsion angles. The deep recurrent network architecture performs slightly better than deep feed-forward architecture, and the predicted residue contact number and the error distribution of torsion angles extracted from sequence fragments are useful features for improving prediction accuracy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1834-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5604354
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56043542017-09-21 Deep learning methods for protein torsion angle prediction Li, Haiou Hou, Jie Adhikari, Badri Lyu, Qiang Cheng, Jianlin BMC Bioinformatics Methodology Article BACKGROUND: Deep learning is one of the most powerful machine learning methods that has achieved the state-of-the-art performance in many domains. Since deep learning was introduced to the field of bioinformatics in 2012, it has achieved success in a number of areas such as protein residue-residue contact prediction, secondary structure prediction, and fold recognition. In this work, we developed deep learning methods to improve the prediction of torsion (dihedral) angles of proteins. RESULTS: We design four different deep learning architectures to predict protein torsion angles. The architectures including deep neural network (DNN) and deep restricted Boltzmann machine (DRBN), deep recurrent neural network (DRNN) and deep recurrent restricted Boltzmann machine (DReRBM) since the protein torsion angle prediction is a sequence related problem. In addition to existing protein features, two new features (predicted residue contact number and the error distribution of torsion angles extracted from sequence fragments) are used as input to each of the four deep learning architectures to predict phi and psi angles of protein backbone. The mean absolute error (MAE) of phi and psi angles predicted by DRNN, DReRBM, DRBM and DNN is about 20–21° and 29–30° on an independent dataset. The MAE of phi angle is comparable to the existing methods, but the MAE of psi angle is 29°, 2° lower than the existing methods. On the latest CASP12 targets, our methods also achieved the performance better than or comparable to a state-of-the art method. CONCLUSIONS: Our experiment demonstrates that deep learning is a valuable method for predicting protein torsion angles. The deep recurrent network architecture performs slightly better than deep feed-forward architecture, and the predicted residue contact number and the error distribution of torsion angles extracted from sequence fragments are useful features for improving prediction accuracy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1834-2) contains supplementary material, which is available to authorized users. BioMed Central 2017-09-18 /pmc/articles/PMC5604354/ /pubmed/28923002 http://dx.doi.org/10.1186/s12859-017-1834-2 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Li, Haiou
Hou, Jie
Adhikari, Badri
Lyu, Qiang
Cheng, Jianlin
Deep learning methods for protein torsion angle prediction
title Deep learning methods for protein torsion angle prediction
title_full Deep learning methods for protein torsion angle prediction
title_fullStr Deep learning methods for protein torsion angle prediction
title_full_unstemmed Deep learning methods for protein torsion angle prediction
title_short Deep learning methods for protein torsion angle prediction
title_sort deep learning methods for protein torsion angle prediction
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5604354/
https://www.ncbi.nlm.nih.gov/pubmed/28923002
http://dx.doi.org/10.1186/s12859-017-1834-2
work_keys_str_mv AT lihaiou deeplearningmethodsforproteintorsionangleprediction
AT houjie deeplearningmethodsforproteintorsionangleprediction
AT adhikaribadri deeplearningmethodsforproteintorsionangleprediction
AT lyuqiang deeplearningmethodsforproteintorsionangleprediction
AT chengjianlin deeplearningmethodsforproteintorsionangleprediction