Cargando…

ODiNPred: comprehensive prediction of protein order and disorder

Structural disorder is widespread in eukaryotic proteins and is vital for their function in diverse biological processes. It is therefore highly desirable to be able to predict the degree of order and disorder from amino acid sequence. It is, however, notoriously difficult to predict the degree of l...

Descripción completa

Detalles Bibliográficos
Autores principales: Dass, Rupashree, Mulder, Frans A. A., Nielsen, Jakob Toudahl
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7479119/
https://www.ncbi.nlm.nih.gov/pubmed/32901090
http://dx.doi.org/10.1038/s41598-020-71716-1
_version_ 1783580201321496576
author Dass, Rupashree
Mulder, Frans A. A.
Nielsen, Jakob Toudahl
author_facet Dass, Rupashree
Mulder, Frans A. A.
Nielsen, Jakob Toudahl
author_sort Dass, Rupashree
collection PubMed
description Structural disorder is widespread in eukaryotic proteins and is vital for their function in diverse biological processes. It is therefore highly desirable to be able to predict the degree of order and disorder from amino acid sequence. It is, however, notoriously difficult to predict the degree of local flexibility within structured domains and the presence and nuances of localized rigidity within intrinsically disordered regions. To identify such instances, we used the CheZOD database, which encompasses accurate, balanced, and continuous-valued quantification of protein (dis)order at amino acid resolution based on NMR chemical shifts. To computationally forecast the spectrum of protein disorder in the most comprehensive manner possible, we constructed the sequence-based protein order/disorder predictor ODiNPred, trained on an expanded version of CheZOD. ODiNPred applies a deep neural network comprising 157 unique sequence features to 1325 protein sequences together with the experimental NMR chemical shift data. Cross-validation for 117 protein sequences shows that ODiNPred better predicts the continuous variation in order along the protein sequence, suggesting that contemporary predictors are limited by the quality of training data. The inclusion of evolutionary features reduces the performance gap between ODiNPred and its peers, but analysis shows that it retains greater accuracy for the more challenging prediction of intermediate disorder.
format Online
Article
Text
id pubmed-7479119
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-74791192020-09-11 ODiNPred: comprehensive prediction of protein order and disorder Dass, Rupashree Mulder, Frans A. A. Nielsen, Jakob Toudahl Sci Rep Article Structural disorder is widespread in eukaryotic proteins and is vital for their function in diverse biological processes. It is therefore highly desirable to be able to predict the degree of order and disorder from amino acid sequence. It is, however, notoriously difficult to predict the degree of local flexibility within structured domains and the presence and nuances of localized rigidity within intrinsically disordered regions. To identify such instances, we used the CheZOD database, which encompasses accurate, balanced, and continuous-valued quantification of protein (dis)order at amino acid resolution based on NMR chemical shifts. To computationally forecast the spectrum of protein disorder in the most comprehensive manner possible, we constructed the sequence-based protein order/disorder predictor ODiNPred, trained on an expanded version of CheZOD. ODiNPred applies a deep neural network comprising 157 unique sequence features to 1325 protein sequences together with the experimental NMR chemical shift data. Cross-validation for 117 protein sequences shows that ODiNPred better predicts the continuous variation in order along the protein sequence, suggesting that contemporary predictors are limited by the quality of training data. The inclusion of evolutionary features reduces the performance gap between ODiNPred and its peers, but analysis shows that it retains greater accuracy for the more challenging prediction of intermediate disorder. Nature Publishing Group UK 2020-09-08 /pmc/articles/PMC7479119/ /pubmed/32901090 http://dx.doi.org/10.1038/s41598-020-71716-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Dass, Rupashree
Mulder, Frans A. A.
Nielsen, Jakob Toudahl
ODiNPred: comprehensive prediction of protein order and disorder
title ODiNPred: comprehensive prediction of protein order and disorder
title_full ODiNPred: comprehensive prediction of protein order and disorder
title_fullStr ODiNPred: comprehensive prediction of protein order and disorder
title_full_unstemmed ODiNPred: comprehensive prediction of protein order and disorder
title_short ODiNPred: comprehensive prediction of protein order and disorder
title_sort odinpred: comprehensive prediction of protein order and disorder
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7479119/
https://www.ncbi.nlm.nih.gov/pubmed/32901090
http://dx.doi.org/10.1038/s41598-020-71716-1
work_keys_str_mv AT dassrupashree odinpredcomprehensivepredictionofproteinorderanddisorder
AT mulderfransaa odinpredcomprehensivepredictionofproteinorderanddisorder
AT nielsenjakobtoudahl odinpredcomprehensivepredictionofproteinorderanddisorder