Cargando…

Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins

The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDP...

Descripción completa

Detalles Bibliográficos
Autores principales: Basu, Sankar, Söderquist, Fredrik, Wallner, Björn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5429364/
https://www.ncbi.nlm.nih.gov/pubmed/28365882
http://dx.doi.org/10.1007/s10822-017-0020-y
_version_ 1783235999030050816
author Basu, Sankar
Söderquist, Fredrik
Wallner, Björn
author_facet Basu, Sankar
Söderquist, Fredrik
Wallner, Björn
author_sort Basu, Sankar
collection PubMed
description The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDPR). The current interest lies in the unraveling of a disorder-to-order transitioning code embedded in the amino acid sequences of IDPs/IDPRs. Disordered proteins are characterized by an enormous amount of structural plasticity which makes them promiscuous in binding to different partners, multi-functional in cellular activity and atypical in folding energy landscapes resembling partially folded molten globules. Also, their involvement in several deadly human diseases (e.g. cancer, cardiovascular and neurodegenerative diseases) makes them attractive drug targets, and important for a biochemical understanding of the disease(s). The study of the structural ensemble of IDPs is rather difficult, in particular for transient interactions. When bound to a structured partner, an IDPR adapts an ordered conformation in the complex. The residues that undergo this disorder-to-order transition are called protean residues, generally found in short contiguous stretches and the first step in understanding the modus operandi of an IDP/IDPR would be to predict these residues. There are a few available methods which predict these protean segments from their amino acid sequences; however, their performance reported in the literature leaves clear room for improvement. With this background, the current study presents ‘Proteus’, a random forest classifier that predicts the likelihood of a residue undergoing a disorder-to-order transition upon binding to a potential partner protein. The prediction is based on features that can be calculated using the amino acid sequence alone. Proteus compares favorably with existing methods predicting twice as many true positives as the second best method (55 vs. 27%) with a much higher precision on an independent data set. The current study also sheds some light on a possible ‘disorder-to-order’ transitioning consensus, untangled, yet embedded in the amino acid sequence of IDPs. Some guidelines have also been suggested for proceeding with a real-life structural modeling involving an IDPR using Proteus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10822-017-0020-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5429364
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-54293642017-05-30 Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins Basu, Sankar Söderquist, Fredrik Wallner, Björn J Comput Aided Mol Des Article The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDPR). The current interest lies in the unraveling of a disorder-to-order transitioning code embedded in the amino acid sequences of IDPs/IDPRs. Disordered proteins are characterized by an enormous amount of structural plasticity which makes them promiscuous in binding to different partners, multi-functional in cellular activity and atypical in folding energy landscapes resembling partially folded molten globules. Also, their involvement in several deadly human diseases (e.g. cancer, cardiovascular and neurodegenerative diseases) makes them attractive drug targets, and important for a biochemical understanding of the disease(s). The study of the structural ensemble of IDPs is rather difficult, in particular for transient interactions. When bound to a structured partner, an IDPR adapts an ordered conformation in the complex. The residues that undergo this disorder-to-order transition are called protean residues, generally found in short contiguous stretches and the first step in understanding the modus operandi of an IDP/IDPR would be to predict these residues. There are a few available methods which predict these protean segments from their amino acid sequences; however, their performance reported in the literature leaves clear room for improvement. With this background, the current study presents ‘Proteus’, a random forest classifier that predicts the likelihood of a residue undergoing a disorder-to-order transition upon binding to a potential partner protein. The prediction is based on features that can be calculated using the amino acid sequence alone. Proteus compares favorably with existing methods predicting twice as many true positives as the second best method (55 vs. 27%) with a much higher precision on an independent data set. The current study also sheds some light on a possible ‘disorder-to-order’ transitioning consensus, untangled, yet embedded in the amino acid sequence of IDPs. Some guidelines have also been suggested for proceeding with a real-life structural modeling involving an IDPR using Proteus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10822-017-0020-y) contains supplementary material, which is available to authorized users. Springer International Publishing 2017-04-01 2017 /pmc/articles/PMC5429364/ /pubmed/28365882 http://dx.doi.org/10.1007/s10822-017-0020-y Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Basu, Sankar
Söderquist, Fredrik
Wallner, Björn
Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
title Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
title_full Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
title_fullStr Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
title_full_unstemmed Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
title_short Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
title_sort proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5429364/
https://www.ncbi.nlm.nih.gov/pubmed/28365882
http://dx.doi.org/10.1007/s10822-017-0020-y
work_keys_str_mv AT basusankar proteusarandomforestclassifiertopredictdisordertoordertransitioningbindingregionsinintrinsicallydisorderedproteins
AT soderquistfredrik proteusarandomforestclassifiertopredictdisordertoordertransitioningbindingregionsinintrinsicallydisorderedproteins
AT wallnerbjorn proteusarandomforestclassifiertopredictdisordertoordertransitioningbindingregionsinintrinsicallydisorderedproteins