Cargando…

Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction

The secondary structure of proteins is significant for studying the three-dimensional structure and functions of proteins. Several models from image understanding and natural language modeling have been successfully adapted in the protein sequence study area, such as Long Short-term Memory (LSTM) ne...

Descripción completa

Detalles Bibliográficos
Autores principales:	Guo, Yuzhi, Wu, Jiaxiang, Ma, Hehuan, Wang, Sheng, Huang, Junzhou
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9221033/ https://www.ncbi.nlm.nih.gov/pubmed/35740899 http://dx.doi.org/10.3390/biom12060774

_version_	1784732520832565248
author	Guo, Yuzhi Wu, Jiaxiang Ma, Hehuan Wang, Sheng Huang, Junzhou
author_facet	Guo, Yuzhi Wu, Jiaxiang Ma, Hehuan Wang, Sheng Huang, Junzhou
author_sort	Guo, Yuzhi
collection	PubMed
description	The secondary structure of proteins is significant for studying the three-dimensional structure and functions of proteins. Several models from image understanding and natural language modeling have been successfully adapted in the protein sequence study area, such as Long Short-term Memory (LSTM) network and Convolutional Neural Network (CNN). Recently, Gated Convolutional Neural Network (GCNN) has been proposed for natural language processing. It has achieved high levels of sentence scoring, as well as reduced the latency. Conditionally Parameterized Convolution (CondConv) is another novel study which has gained great success in the image processing area. Compared with vanilla CNN, CondConv uses extra sample-dependant modules to conditionally adjust the convolutional network. In this paper, we propose a novel Conditionally Parameterized Convolutional network (CondGCNN) which utilizes the power of both CondConv and GCNN. CondGCNN leverages an ensemble encoder to combine the capabilities of both LSTM and CondGCNN to encode protein sequences by better capturing protein sequential features. In addition, we explore the similarity between the secondary structure prediction problem and the image segmentation problem, and propose an ASP network (Atrous Spatial Pyramid Pooling (ASPP) based network) to capture fine boundary details in secondary structure. Extensive experiments show that the proposed method can achieve higher performance on protein secondary structure prediction task than existing methods on CB513, Casp11, CASP12, CASP13, and CASP14 datasets. We also conducted ablation studies over each component to verify the effectiveness. Our method is expected to be useful for any protein related prediction tasks, which is not limited to protein secondary structure prediction.
format	Online Article Text
id	pubmed-9221033
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-92210332022-06-24 Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction Guo, Yuzhi Wu, Jiaxiang Ma, Hehuan Wang, Sheng Huang, Junzhou Biomolecules Article The secondary structure of proteins is significant for studying the three-dimensional structure and functions of proteins. Several models from image understanding and natural language modeling have been successfully adapted in the protein sequence study area, such as Long Short-term Memory (LSTM) network and Convolutional Neural Network (CNN). Recently, Gated Convolutional Neural Network (GCNN) has been proposed for natural language processing. It has achieved high levels of sentence scoring, as well as reduced the latency. Conditionally Parameterized Convolution (CondConv) is another novel study which has gained great success in the image processing area. Compared with vanilla CNN, CondConv uses extra sample-dependant modules to conditionally adjust the convolutional network. In this paper, we propose a novel Conditionally Parameterized Convolutional network (CondGCNN) which utilizes the power of both CondConv and GCNN. CondGCNN leverages an ensemble encoder to combine the capabilities of both LSTM and CondGCNN to encode protein sequences by better capturing protein sequential features. In addition, we explore the similarity between the secondary structure prediction problem and the image segmentation problem, and propose an ASP network (Atrous Spatial Pyramid Pooling (ASPP) based network) to capture fine boundary details in secondary structure. Extensive experiments show that the proposed method can achieve higher performance on protein secondary structure prediction task than existing methods on CB513, Casp11, CASP12, CASP13, and CASP14 datasets. We also conducted ablation studies over each component to verify the effectiveness. Our method is expected to be useful for any protein related prediction tasks, which is not limited to protein secondary structure prediction. MDPI 2022-06-02 /pmc/articles/PMC9221033/ /pubmed/35740899 http://dx.doi.org/10.3390/biom12060774 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Guo, Yuzhi Wu, Jiaxiang Ma, Hehuan Wang, Sheng Huang, Junzhou Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction
title	Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction
title_full	Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction
title_fullStr	Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction
title_full_unstemmed	Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction
title_short	Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction
title_sort	deep ensemble learning with atrous spatial pyramid networks for protein secondary structure prediction
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9221033/ https://www.ncbi.nlm.nih.gov/pubmed/35740899 http://dx.doi.org/10.3390/biom12060774
work_keys_str_mv	AT guoyuzhi deepensemblelearningwithatrousspatialpyramidnetworksforproteinsecondarystructureprediction AT wujiaxiang deepensemblelearningwithatrousspatialpyramidnetworksforproteinsecondarystructureprediction AT mahehuan deepensemblelearningwithatrousspatialpyramidnetworksforproteinsecondarystructureprediction AT wangsheng deepensemblelearningwithatrousspatialpyramidnetworksforproteinsecondarystructureprediction AT huangjunzhou deepensemblelearningwithatrousspatialpyramidnetworksforproteinsecondarystructureprediction

Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction

Ejemplares similares