Cargando…

Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets

Intrinsically-disordered regions lack a well-defined 3D structure, but play key roles in determining the function of many proteins. Although predictors of disorder have been shown to achieve relatively high rates of correct classification of these segments, improvements over the the years have been...

Descripción completa

Detalles Bibliográficos
Autores principales: Volpato, Viola, Alshomrani, Badr, Pollastri, Gianluca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4581330/
https://www.ncbi.nlm.nih.gov/pubmed/26307973
http://dx.doi.org/10.3390/ijms160819868
_version_ 1782391545891127296
author Volpato, Viola
Alshomrani, Badr
Pollastri, Gianluca
author_facet Volpato, Viola
Alshomrani, Badr
Pollastri, Gianluca
author_sort Volpato, Viola
collection PubMed
description Intrinsically-disordered regions lack a well-defined 3D structure, but play key roles in determining the function of many proteins. Although predictors of disorder have been shown to achieve relatively high rates of correct classification of these segments, improvements over the the years have been slow, and accurate methods are needed that are capable of accommodating the ever-increasing amount of structurally-determined protein sequences to try to boost predictive performances. In this paper, we propose a predictor for short disordered regions based on bidirectional recurrent neural networks and tested by rigorous five-fold cross-validation on a large, non-redundant dataset collected from MobiDB, a new comprehensive source of protein disorder annotations. The system exploits sequence and structural information in the forms of frequency profiles, predicted secondary structure and solvent accessibility and direct disorder annotations from homologous protein structures (templates) deposited in the Protein Data Bank. The contributions of sequence, structure and homology information result in large improvements in predictive accuracy. Additionally, the large scale of the training set leads to low false positive rates, making our systems a robust and efficient way to address high-throughput disorder prediction.
format Online
Article
Text
id pubmed-4581330
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-45813302015-09-28 Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets Volpato, Viola Alshomrani, Badr Pollastri, Gianluca Int J Mol Sci Article Intrinsically-disordered regions lack a well-defined 3D structure, but play key roles in determining the function of many proteins. Although predictors of disorder have been shown to achieve relatively high rates of correct classification of these segments, improvements over the the years have been slow, and accurate methods are needed that are capable of accommodating the ever-increasing amount of structurally-determined protein sequences to try to boost predictive performances. In this paper, we propose a predictor for short disordered regions based on bidirectional recurrent neural networks and tested by rigorous five-fold cross-validation on a large, non-redundant dataset collected from MobiDB, a new comprehensive source of protein disorder annotations. The system exploits sequence and structural information in the forms of frequency profiles, predicted secondary structure and solvent accessibility and direct disorder annotations from homologous protein structures (templates) deposited in the Protein Data Bank. The contributions of sequence, structure and homology information result in large improvements in predictive accuracy. Additionally, the large scale of the training set leads to low false positive rates, making our systems a robust and efficient way to address high-throughput disorder prediction. MDPI 2015-08-21 /pmc/articles/PMC4581330/ /pubmed/26307973 http://dx.doi.org/10.3390/ijms160819868 Text en © 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Volpato, Viola
Alshomrani, Badr
Pollastri, Gianluca
Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets
title Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets
title_full Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets
title_fullStr Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets
title_full_unstemmed Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets
title_short Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets
title_sort accurate ab initio and template-based prediction of short intrinsically-disordered regions by bidirectional recurrent neural networks trained on large-scale datasets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4581330/
https://www.ncbi.nlm.nih.gov/pubmed/26307973
http://dx.doi.org/10.3390/ijms160819868
work_keys_str_mv AT volpatoviola accurateabinitioandtemplatebasedpredictionofshortintrinsicallydisorderedregionsbybidirectionalrecurrentneuralnetworkstrainedonlargescaledatasets
AT alshomranibadr accurateabinitioandtemplatebasedpredictionofshortintrinsicallydisorderedregionsbybidirectionalrecurrentneuralnetworkstrainedonlargescaledatasets
AT pollastrigianluca accurateabinitioandtemplatebasedpredictionofshortintrinsicallydisorderedregionsbybidirectionalrecurrentneuralnetworkstrainedonlargescaledatasets