Cargando…
Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDP...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5429364/ https://www.ncbi.nlm.nih.gov/pubmed/28365882 http://dx.doi.org/10.1007/s10822-017-0020-y |
_version_ | 1783235999030050816 |
---|---|
author | Basu, Sankar Söderquist, Fredrik Wallner, Björn |
author_facet | Basu, Sankar Söderquist, Fredrik Wallner, Björn |
author_sort | Basu, Sankar |
collection | PubMed |
description | The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDPR). The current interest lies in the unraveling of a disorder-to-order transitioning code embedded in the amino acid sequences of IDPs/IDPRs. Disordered proteins are characterized by an enormous amount of structural plasticity which makes them promiscuous in binding to different partners, multi-functional in cellular activity and atypical in folding energy landscapes resembling partially folded molten globules. Also, their involvement in several deadly human diseases (e.g. cancer, cardiovascular and neurodegenerative diseases) makes them attractive drug targets, and important for a biochemical understanding of the disease(s). The study of the structural ensemble of IDPs is rather difficult, in particular for transient interactions. When bound to a structured partner, an IDPR adapts an ordered conformation in the complex. The residues that undergo this disorder-to-order transition are called protean residues, generally found in short contiguous stretches and the first step in understanding the modus operandi of an IDP/IDPR would be to predict these residues. There are a few available methods which predict these protean segments from their amino acid sequences; however, their performance reported in the literature leaves clear room for improvement. With this background, the current study presents ‘Proteus’, a random forest classifier that predicts the likelihood of a residue undergoing a disorder-to-order transition upon binding to a potential partner protein. The prediction is based on features that can be calculated using the amino acid sequence alone. Proteus compares favorably with existing methods predicting twice as many true positives as the second best method (55 vs. 27%) with a much higher precision on an independent data set. The current study also sheds some light on a possible ‘disorder-to-order’ transitioning consensus, untangled, yet embedded in the amino acid sequence of IDPs. Some guidelines have also been suggested for proceeding with a real-life structural modeling involving an IDPR using Proteus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10822-017-0020-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5429364 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-54293642017-05-30 Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins Basu, Sankar Söderquist, Fredrik Wallner, Björn J Comput Aided Mol Des Article The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDPR). The current interest lies in the unraveling of a disorder-to-order transitioning code embedded in the amino acid sequences of IDPs/IDPRs. Disordered proteins are characterized by an enormous amount of structural plasticity which makes them promiscuous in binding to different partners, multi-functional in cellular activity and atypical in folding energy landscapes resembling partially folded molten globules. Also, their involvement in several deadly human diseases (e.g. cancer, cardiovascular and neurodegenerative diseases) makes them attractive drug targets, and important for a biochemical understanding of the disease(s). The study of the structural ensemble of IDPs is rather difficult, in particular for transient interactions. When bound to a structured partner, an IDPR adapts an ordered conformation in the complex. The residues that undergo this disorder-to-order transition are called protean residues, generally found in short contiguous stretches and the first step in understanding the modus operandi of an IDP/IDPR would be to predict these residues. There are a few available methods which predict these protean segments from their amino acid sequences; however, their performance reported in the literature leaves clear room for improvement. With this background, the current study presents ‘Proteus’, a random forest classifier that predicts the likelihood of a residue undergoing a disorder-to-order transition upon binding to a potential partner protein. The prediction is based on features that can be calculated using the amino acid sequence alone. Proteus compares favorably with existing methods predicting twice as many true positives as the second best method (55 vs. 27%) with a much higher precision on an independent data set. The current study also sheds some light on a possible ‘disorder-to-order’ transitioning consensus, untangled, yet embedded in the amino acid sequence of IDPs. Some guidelines have also been suggested for proceeding with a real-life structural modeling involving an IDPR using Proteus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10822-017-0020-y) contains supplementary material, which is available to authorized users. Springer International Publishing 2017-04-01 2017 /pmc/articles/PMC5429364/ /pubmed/28365882 http://dx.doi.org/10.1007/s10822-017-0020-y Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Article Basu, Sankar Söderquist, Fredrik Wallner, Björn Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins |
title | Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins |
title_full | Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins |
title_fullStr | Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins |
title_full_unstemmed | Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins |
title_short | Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins |
title_sort | proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5429364/ https://www.ncbi.nlm.nih.gov/pubmed/28365882 http://dx.doi.org/10.1007/s10822-017-0020-y |
work_keys_str_mv | AT basusankar proteusarandomforestclassifiertopredictdisordertoordertransitioningbindingregionsinintrinsicallydisorderedproteins AT soderquistfredrik proteusarandomforestclassifiertopredictdisordertoordertransitioningbindingregionsinintrinsicallydisorderedproteins AT wallnerbjorn proteusarandomforestclassifiertopredictdisordertoordertransitioningbindingregionsinintrinsicallydisorderedproteins |