Cargando…

Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data

BACKGROUND: Studies of intrinsically disordered proteins that lack a stable tertiary structure but still have important biological functions critically rely on computational methods that predict this property based on sequence information. Although a number of fairly successful models for prediction...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Ping, Obradovic, Zoran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289073/
https://www.ncbi.nlm.nih.gov/pubmed/22166115
http://dx.doi.org/10.1186/1477-5956-9-S1-S12
_version_ 1782224845022429184
author Zhang, Ping
Obradovic, Zoran
author_facet Zhang, Ping
Obradovic, Zoran
author_sort Zhang, Ping
collection PubMed
description BACKGROUND: Studies of intrinsically disordered proteins that lack a stable tertiary structure but still have important biological functions critically rely on computational methods that predict this property based on sequence information. Although a number of fairly successful models for prediction of protein disorder have been developed over the last decade, the quality of their predictions is limited by available cases of confirmed disorders. RESULTS: To more reliably estimate protein disorder from protein sequences, an iterative algorithm is proposed that integrates predictions of multiple disorder models without relying on any protein sequences with confirmed disorder annotation. The iterative method alternately provides the maximum a posterior (MAP) estimation of disorder prediction and the maximum-likelihood (ML) estimation of quality of multiple disorder predictors. Experiments on data used at CASP7, CASP8, and CASP9 have shown the effectiveness of the proposed algorithm. CONCLUSIONS: The proposed algorithm can potentially be used to predict protein disorder and provide helpful suggestions on choosing suitable disorder predictors for unknown protein sequences.
format Online
Article
Text
id pubmed-3289073
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32890732012-02-29 Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data Zhang, Ping Obradovic, Zoran Proteome Sci Proceedings BACKGROUND: Studies of intrinsically disordered proteins that lack a stable tertiary structure but still have important biological functions critically rely on computational methods that predict this property based on sequence information. Although a number of fairly successful models for prediction of protein disorder have been developed over the last decade, the quality of their predictions is limited by available cases of confirmed disorders. RESULTS: To more reliably estimate protein disorder from protein sequences, an iterative algorithm is proposed that integrates predictions of multiple disorder models without relying on any protein sequences with confirmed disorder annotation. The iterative method alternately provides the maximum a posterior (MAP) estimation of disorder prediction and the maximum-likelihood (ML) estimation of quality of multiple disorder predictors. Experiments on data used at CASP7, CASP8, and CASP9 have shown the effectiveness of the proposed algorithm. CONCLUSIONS: The proposed algorithm can potentially be used to predict protein disorder and provide helpful suggestions on choosing suitable disorder predictors for unknown protein sequences. BioMed Central 2011-10-14 /pmc/articles/PMC3289073/ /pubmed/22166115 http://dx.doi.org/10.1186/1477-5956-9-S1-S12 Text en Copyright ©2011 Zhang and Obradovic; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Zhang, Ping
Obradovic, Zoran
Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data
title Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data
title_full Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data
title_fullStr Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data
title_full_unstemmed Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data
title_short Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data
title_sort unsupervised integration of multiple protein disorder predictors: the method and evaluation on casp7, casp8 and casp9 data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289073/
https://www.ncbi.nlm.nih.gov/pubmed/22166115
http://dx.doi.org/10.1186/1477-5956-9-S1-S12
work_keys_str_mv AT zhangping unsupervisedintegrationofmultipleproteindisorderpredictorsthemethodandevaluationoncasp7casp8andcasp9data
AT obradoviczoran unsupervisedintegrationofmultipleproteindisorderpredictorsthemethodandevaluationoncasp7casp8andcasp9data