Cargando…

DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction

BACKGROUND: Protein secondary structure (PSS) is critical to further predict the tertiary structure, understand protein function and design drugs. However, experimental techniques of PSS are time consuming and expensive, and thus it’s very urgent to develop efficient computational approaches for pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Yanbu, Li, Weihua, Wang, Bingyi, Liu, Huiqing, Zhou, Dongming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6580607/
https://www.ncbi.nlm.nih.gov/pubmed/31208331
http://dx.doi.org/10.1186/s12859-019-2940-0
_version_ 1783428053979889664
author Guo, Yanbu
Li, Weihua
Wang, Bingyi
Liu, Huiqing
Zhou, Dongming
author_facet Guo, Yanbu
Li, Weihua
Wang, Bingyi
Liu, Huiqing
Zhou, Dongming
author_sort Guo, Yanbu
collection PubMed
description BACKGROUND: Protein secondary structure (PSS) is critical to further predict the tertiary structure, understand protein function and design drugs. However, experimental techniques of PSS are time consuming and expensive, and thus it’s very urgent to develop efficient computational approaches for predicting PSS based on sequence information alone. Moreover, the feature matrix of a protein contains two dimensions: the amino-acid residue dimension and the feature vector dimension. Existing deep learning based methods have achieved remarkable performances of PSS prediction, but the methods often utilize the features from the amino-acid dimension. Thus, there is still room to improve computational methods of PSS prediction. RESULTS: We propose a novel deep neural network method, called DeepACLSTM, to predict 8-category PSS from protein sequence features and profile features. Our method efficiently applies asymmetric convolutional neural networks (ACNNs) combined with bidirectional long short-term memory (BLSTM) neural networks to predict PSS, leveraging the feature vector dimension of the protein feature matrix. In DeepACLSTM, the ACNNs extract the complex local contexts of amino-acids; the BLSTM neural networks capture the long-distance interdependencies between amino-acids. Furthermore, the prediction module predicts the category of each amino-acid residue based on both local contexts and long-distance interdependencies. To evaluate performances of DeepACLSTM, we conduct experiments on three publicly available datasets: CB513, CASP10 and CASP12. Results indicate that the performance of our method is superior to the state-of-the-art baselines on three publicly datasets. CONCLUSIONS: Experiments demonstrate that DeepACLSTM is an efficient predication method for predicting 8-category PSS and has the ability to extract more complex sequence-structure relationships between amino-acid residues. Moreover, experiments also indicate the feature vector dimension contains the useful information for improving PSS prediction.
format Online
Article
Text
id pubmed-6580607
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65806072019-06-24 DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction Guo, Yanbu Li, Weihua Wang, Bingyi Liu, Huiqing Zhou, Dongming BMC Bioinformatics Research Article BACKGROUND: Protein secondary structure (PSS) is critical to further predict the tertiary structure, understand protein function and design drugs. However, experimental techniques of PSS are time consuming and expensive, and thus it’s very urgent to develop efficient computational approaches for predicting PSS based on sequence information alone. Moreover, the feature matrix of a protein contains two dimensions: the amino-acid residue dimension and the feature vector dimension. Existing deep learning based methods have achieved remarkable performances of PSS prediction, but the methods often utilize the features from the amino-acid dimension. Thus, there is still room to improve computational methods of PSS prediction. RESULTS: We propose a novel deep neural network method, called DeepACLSTM, to predict 8-category PSS from protein sequence features and profile features. Our method efficiently applies asymmetric convolutional neural networks (ACNNs) combined with bidirectional long short-term memory (BLSTM) neural networks to predict PSS, leveraging the feature vector dimension of the protein feature matrix. In DeepACLSTM, the ACNNs extract the complex local contexts of amino-acids; the BLSTM neural networks capture the long-distance interdependencies between amino-acids. Furthermore, the prediction module predicts the category of each amino-acid residue based on both local contexts and long-distance interdependencies. To evaluate performances of DeepACLSTM, we conduct experiments on three publicly available datasets: CB513, CASP10 and CASP12. Results indicate that the performance of our method is superior to the state-of-the-art baselines on three publicly datasets. CONCLUSIONS: Experiments demonstrate that DeepACLSTM is an efficient predication method for predicting 8-category PSS and has the ability to extract more complex sequence-structure relationships between amino-acid residues. Moreover, experiments also indicate the feature vector dimension contains the useful information for improving PSS prediction. BioMed Central 2019-06-17 /pmc/articles/PMC6580607/ /pubmed/31208331 http://dx.doi.org/10.1186/s12859-019-2940-0 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Guo, Yanbu
Li, Weihua
Wang, Bingyi
Liu, Huiqing
Zhou, Dongming
DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction
title DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction
title_full DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction
title_fullStr DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction
title_full_unstemmed DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction
title_short DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction
title_sort deepaclstm: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6580607/
https://www.ncbi.nlm.nih.gov/pubmed/31208331
http://dx.doi.org/10.1186/s12859-019-2940-0
work_keys_str_mv AT guoyanbu deepaclstmdeepasymmetricconvolutionallongshorttermmemoryneuralmodelsforproteinsecondarystructureprediction
AT liweihua deepaclstmdeepasymmetricconvolutionallongshorttermmemoryneuralmodelsforproteinsecondarystructureprediction
AT wangbingyi deepaclstmdeepasymmetricconvolutionallongshorttermmemoryneuralmodelsforproteinsecondarystructureprediction
AT liuhuiqing deepaclstmdeepasymmetricconvolutionallongshorttermmemoryneuralmodelsforproteinsecondarystructureprediction
AT zhoudongming deepaclstmdeepasymmetricconvolutionallongshorttermmemoryneuralmodelsforproteinsecondarystructureprediction