Cargando…

DISOPRED3: precise disordered region predictions with annotated protein-binding activity

Motivation: A sizeable fraction of eukaryotic proteins contain intrinsically disordered regions (IDRs), which act in unfolded states or by undergoing transitions between structured and unstructured conformations. Over time, sequence-based classifiers of IDRs have become fairly accurate and currently...

Descripción completa

Detalles Bibliográficos
Autores principales: Jones, David T., Cozzetto, Domenico
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4380029/
https://www.ncbi.nlm.nih.gov/pubmed/25391399
http://dx.doi.org/10.1093/bioinformatics/btu744
_version_ 1782364277709996032
author Jones, David T.
Cozzetto, Domenico
author_facet Jones, David T.
Cozzetto, Domenico
author_sort Jones, David T.
collection PubMed
description Motivation: A sizeable fraction of eukaryotic proteins contain intrinsically disordered regions (IDRs), which act in unfolded states or by undergoing transitions between structured and unstructured conformations. Over time, sequence-based classifiers of IDRs have become fairly accurate and currently a major challenge is linking IDRs to their biological roles from the molecular to the systems level. Results: We describe DISOPRED3, which extends its predecessor with new modules to predict IDRs and protein-binding sites within them. Based on recent CASP evaluation results, DISOPRED3 can be regarded as state of the art in the identification of IDRs, and our self-assessment shows that it significantly improves over DISOPRED2 because its predictions are more specific across the whole board and more sensitive to IDRs longer than 20 amino acids. Predicted IDRs are annotated as protein binding through a novel SVM based classifier, which uses profile data and additional sequence-derived features. Based on benchmarking experiments with full cross-validation, we show that this predictor generates precise assignments of disordered protein binding regions and that it compares well with other publicly available tools. Availability and implementation: http://bioinf.cs.ucl.ac.uk/disopred Contact: d.t.jones@ucl.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4380029
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43800292015-04-15 DISOPRED3: precise disordered region predictions with annotated protein-binding activity Jones, David T. Cozzetto, Domenico Bioinformatics Original Papers Motivation: A sizeable fraction of eukaryotic proteins contain intrinsically disordered regions (IDRs), which act in unfolded states or by undergoing transitions between structured and unstructured conformations. Over time, sequence-based classifiers of IDRs have become fairly accurate and currently a major challenge is linking IDRs to their biological roles from the molecular to the systems level. Results: We describe DISOPRED3, which extends its predecessor with new modules to predict IDRs and protein-binding sites within them. Based on recent CASP evaluation results, DISOPRED3 can be regarded as state of the art in the identification of IDRs, and our self-assessment shows that it significantly improves over DISOPRED2 because its predictions are more specific across the whole board and more sensitive to IDRs longer than 20 amino acids. Predicted IDRs are annotated as protein binding through a novel SVM based classifier, which uses profile data and additional sequence-derived features. Based on benchmarking experiments with full cross-validation, we show that this predictor generates precise assignments of disordered protein binding regions and that it compares well with other publicly available tools. Availability and implementation: http://bioinf.cs.ucl.ac.uk/disopred Contact: d.t.jones@ucl.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-03-15 2014-11-12 /pmc/articles/PMC4380029/ /pubmed/25391399 http://dx.doi.org/10.1093/bioinformatics/btu744 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Jones, David T.
Cozzetto, Domenico
DISOPRED3: precise disordered region predictions with annotated protein-binding activity
title DISOPRED3: precise disordered region predictions with annotated protein-binding activity
title_full DISOPRED3: precise disordered region predictions with annotated protein-binding activity
title_fullStr DISOPRED3: precise disordered region predictions with annotated protein-binding activity
title_full_unstemmed DISOPRED3: precise disordered region predictions with annotated protein-binding activity
title_short DISOPRED3: precise disordered region predictions with annotated protein-binding activity
title_sort disopred3: precise disordered region predictions with annotated protein-binding activity
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4380029/
https://www.ncbi.nlm.nih.gov/pubmed/25391399
http://dx.doi.org/10.1093/bioinformatics/btu744
work_keys_str_mv AT jonesdavidt disopred3precisedisorderedregionpredictionswithannotatedproteinbindingactivity
AT cozzettodomenico disopred3precisedisorderedregionpredictionswithannotatedproteinbindingactivity