Cargando…

Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions

Intrinsically disordered regions (IDRs) carry out many cellular functions and vary in length and placement in protein sequences. This diversity leads to variations in the underlying compositional biases, which were demonstrated for the short vs. long IDRs. We analyze compositional biases across four...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Bi, Kurgan, Lukasz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313023/
https://www.ncbi.nlm.nih.gov/pubmed/35883444
http://dx.doi.org/10.3390/biom12070888
_version_ 1784753977323159552
author Zhao, Bi
Kurgan, Lukasz
author_facet Zhao, Bi
Kurgan, Lukasz
author_sort Zhao, Bi
collection PubMed
description Intrinsically disordered regions (IDRs) carry out many cellular functions and vary in length and placement in protein sequences. This diversity leads to variations in the underlying compositional biases, which were demonstrated for the short vs. long IDRs. We analyze compositional biases across four classes of disorder: fully disordered proteins; short IDRs; long IDRs; and binding IDRs. We identify three distinct biases: for the fully disordered proteins, the short IDRs and the long and binding IDRs combined. We also investigate compositional bias for putative disorder produced by leading disorder predictors and find that it is similar to the bias of the native disorder. Interestingly, the accuracy of disorder predictions across different methods is correlated with the correctness of the compositional bias of their predictions highlighting the importance of the compositional bias. The predictive quality is relatively low for the disorder classes with compositional bias that is the most different from the “generic” disorder bias, while being much higher for the classes with the most similar bias. We discover that different predictors perform best across different classes of disorder. This suggests that no single predictor is universally best and motivates the development of new architectures that combine models that target specific disorder classes.
format Online
Article
Text
id pubmed-9313023
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93130232022-07-26 Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions Zhao, Bi Kurgan, Lukasz Biomolecules Article Intrinsically disordered regions (IDRs) carry out many cellular functions and vary in length and placement in protein sequences. This diversity leads to variations in the underlying compositional biases, which were demonstrated for the short vs. long IDRs. We analyze compositional biases across four classes of disorder: fully disordered proteins; short IDRs; long IDRs; and binding IDRs. We identify three distinct biases: for the fully disordered proteins, the short IDRs and the long and binding IDRs combined. We also investigate compositional bias for putative disorder produced by leading disorder predictors and find that it is similar to the bias of the native disorder. Interestingly, the accuracy of disorder predictions across different methods is correlated with the correctness of the compositional bias of their predictions highlighting the importance of the compositional bias. The predictive quality is relatively low for the disorder classes with compositional bias that is the most different from the “generic” disorder bias, while being much higher for the classes with the most similar bias. We discover that different predictors perform best across different classes of disorder. This suggests that no single predictor is universally best and motivates the development of new architectures that combine models that target specific disorder classes. MDPI 2022-06-25 /pmc/articles/PMC9313023/ /pubmed/35883444 http://dx.doi.org/10.3390/biom12070888 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhao, Bi
Kurgan, Lukasz
Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions
title Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions
title_full Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions
title_fullStr Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions
title_full_unstemmed Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions
title_short Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions
title_sort compositional bias of intrinsically disordered proteins and regions and their predictions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313023/
https://www.ncbi.nlm.nih.gov/pubmed/35883444
http://dx.doi.org/10.3390/biom12070888
work_keys_str_mv AT zhaobi compositionalbiasofintrinsicallydisorderedproteinsandregionsandtheirpredictions
AT kurganlukasz compositionalbiasofintrinsicallydisorderedproteinsandregionsandtheirpredictions