Cargando…
Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data
BACKGROUND: Studies of intrinsically disordered proteins that lack a stable tertiary structure but still have important biological functions critically rely on computational methods that predict this property based on sequence information. Although a number of fairly successful models for prediction...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289073/ https://www.ncbi.nlm.nih.gov/pubmed/22166115 http://dx.doi.org/10.1186/1477-5956-9-S1-S12 |
_version_ | 1782224845022429184 |
---|---|
author | Zhang, Ping Obradovic, Zoran |
author_facet | Zhang, Ping Obradovic, Zoran |
author_sort | Zhang, Ping |
collection | PubMed |
description | BACKGROUND: Studies of intrinsically disordered proteins that lack a stable tertiary structure but still have important biological functions critically rely on computational methods that predict this property based on sequence information. Although a number of fairly successful models for prediction of protein disorder have been developed over the last decade, the quality of their predictions is limited by available cases of confirmed disorders. RESULTS: To more reliably estimate protein disorder from protein sequences, an iterative algorithm is proposed that integrates predictions of multiple disorder models without relying on any protein sequences with confirmed disorder annotation. The iterative method alternately provides the maximum a posterior (MAP) estimation of disorder prediction and the maximum-likelihood (ML) estimation of quality of multiple disorder predictors. Experiments on data used at CASP7, CASP8, and CASP9 have shown the effectiveness of the proposed algorithm. CONCLUSIONS: The proposed algorithm can potentially be used to predict protein disorder and provide helpful suggestions on choosing suitable disorder predictors for unknown protein sequences. |
format | Online Article Text |
id | pubmed-3289073 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32890732012-02-29 Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data Zhang, Ping Obradovic, Zoran Proteome Sci Proceedings BACKGROUND: Studies of intrinsically disordered proteins that lack a stable tertiary structure but still have important biological functions critically rely on computational methods that predict this property based on sequence information. Although a number of fairly successful models for prediction of protein disorder have been developed over the last decade, the quality of their predictions is limited by available cases of confirmed disorders. RESULTS: To more reliably estimate protein disorder from protein sequences, an iterative algorithm is proposed that integrates predictions of multiple disorder models without relying on any protein sequences with confirmed disorder annotation. The iterative method alternately provides the maximum a posterior (MAP) estimation of disorder prediction and the maximum-likelihood (ML) estimation of quality of multiple disorder predictors. Experiments on data used at CASP7, CASP8, and CASP9 have shown the effectiveness of the proposed algorithm. CONCLUSIONS: The proposed algorithm can potentially be used to predict protein disorder and provide helpful suggestions on choosing suitable disorder predictors for unknown protein sequences. BioMed Central 2011-10-14 /pmc/articles/PMC3289073/ /pubmed/22166115 http://dx.doi.org/10.1186/1477-5956-9-S1-S12 Text en Copyright ©2011 Zhang and Obradovic; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Zhang, Ping Obradovic, Zoran Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data |
title | Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data |
title_full | Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data |
title_fullStr | Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data |
title_full_unstemmed | Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data |
title_short | Unsupervised Integration of Multiple Protein Disorder Predictors: The Method and Evaluation on CASP7, CASP8 and CASP9 Data |
title_sort | unsupervised integration of multiple protein disorder predictors: the method and evaluation on casp7, casp8 and casp9 data |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289073/ https://www.ncbi.nlm.nih.gov/pubmed/22166115 http://dx.doi.org/10.1186/1477-5956-9-S1-S12 |
work_keys_str_mv | AT zhangping unsupervisedintegrationofmultipleproteindisorderpredictorsthemethodandevaluationoncasp7casp8andcasp9data AT obradoviczoran unsupervisedintegrationofmultipleproteindisorderpredictorsthemethodandevaluationoncasp7casp8andcasp9data |