Cargando…
ODiNPred: comprehensive prediction of protein order and disorder
Structural disorder is widespread in eukaryotic proteins and is vital for their function in diverse biological processes. It is therefore highly desirable to be able to predict the degree of order and disorder from amino acid sequence. It is, however, notoriously difficult to predict the degree of l...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7479119/ https://www.ncbi.nlm.nih.gov/pubmed/32901090 http://dx.doi.org/10.1038/s41598-020-71716-1 |
_version_ | 1783580201321496576 |
---|---|
author | Dass, Rupashree Mulder, Frans A. A. Nielsen, Jakob Toudahl |
author_facet | Dass, Rupashree Mulder, Frans A. A. Nielsen, Jakob Toudahl |
author_sort | Dass, Rupashree |
collection | PubMed |
description | Structural disorder is widespread in eukaryotic proteins and is vital for their function in diverse biological processes. It is therefore highly desirable to be able to predict the degree of order and disorder from amino acid sequence. It is, however, notoriously difficult to predict the degree of local flexibility within structured domains and the presence and nuances of localized rigidity within intrinsically disordered regions. To identify such instances, we used the CheZOD database, which encompasses accurate, balanced, and continuous-valued quantification of protein (dis)order at amino acid resolution based on NMR chemical shifts. To computationally forecast the spectrum of protein disorder in the most comprehensive manner possible, we constructed the sequence-based protein order/disorder predictor ODiNPred, trained on an expanded version of CheZOD. ODiNPred applies a deep neural network comprising 157 unique sequence features to 1325 protein sequences together with the experimental NMR chemical shift data. Cross-validation for 117 protein sequences shows that ODiNPred better predicts the continuous variation in order along the protein sequence, suggesting that contemporary predictors are limited by the quality of training data. The inclusion of evolutionary features reduces the performance gap between ODiNPred and its peers, but analysis shows that it retains greater accuracy for the more challenging prediction of intermediate disorder. |
format | Online Article Text |
id | pubmed-7479119 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-74791192020-09-11 ODiNPred: comprehensive prediction of protein order and disorder Dass, Rupashree Mulder, Frans A. A. Nielsen, Jakob Toudahl Sci Rep Article Structural disorder is widespread in eukaryotic proteins and is vital for their function in diverse biological processes. It is therefore highly desirable to be able to predict the degree of order and disorder from amino acid sequence. It is, however, notoriously difficult to predict the degree of local flexibility within structured domains and the presence and nuances of localized rigidity within intrinsically disordered regions. To identify such instances, we used the CheZOD database, which encompasses accurate, balanced, and continuous-valued quantification of protein (dis)order at amino acid resolution based on NMR chemical shifts. To computationally forecast the spectrum of protein disorder in the most comprehensive manner possible, we constructed the sequence-based protein order/disorder predictor ODiNPred, trained on an expanded version of CheZOD. ODiNPred applies a deep neural network comprising 157 unique sequence features to 1325 protein sequences together with the experimental NMR chemical shift data. Cross-validation for 117 protein sequences shows that ODiNPred better predicts the continuous variation in order along the protein sequence, suggesting that contemporary predictors are limited by the quality of training data. The inclusion of evolutionary features reduces the performance gap between ODiNPred and its peers, but analysis shows that it retains greater accuracy for the more challenging prediction of intermediate disorder. Nature Publishing Group UK 2020-09-08 /pmc/articles/PMC7479119/ /pubmed/32901090 http://dx.doi.org/10.1038/s41598-020-71716-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Dass, Rupashree Mulder, Frans A. A. Nielsen, Jakob Toudahl ODiNPred: comprehensive prediction of protein order and disorder |
title | ODiNPred: comprehensive prediction of protein order and disorder |
title_full | ODiNPred: comprehensive prediction of protein order and disorder |
title_fullStr | ODiNPred: comprehensive prediction of protein order and disorder |
title_full_unstemmed | ODiNPred: comprehensive prediction of protein order and disorder |
title_short | ODiNPred: comprehensive prediction of protein order and disorder |
title_sort | odinpred: comprehensive prediction of protein order and disorder |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7479119/ https://www.ncbi.nlm.nih.gov/pubmed/32901090 http://dx.doi.org/10.1038/s41598-020-71716-1 |
work_keys_str_mv | AT dassrupashree odinpredcomprehensivepredictionofproteinorderanddisorder AT mulderfransaa odinpredcomprehensivepredictionofproteinorderanddisorder AT nielsenjakobtoudahl odinpredcomprehensivepredictionofproteinorderanddisorder |