Cargando…
DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach
The precise prediction of protein intrinsically disordered regions, which play a crucial role in biological procedures, is a necessary prerequisite to further the understanding of the principles and mechanisms of protein function. Here, we propose a novel predictor, DisoMCS, which is a more accurate...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4474717/ https://www.ncbi.nlm.nih.gov/pubmed/26090958 http://dx.doi.org/10.1371/journal.pone.0128334 |
_version_ | 1782377326436155392 |
---|---|
author | Wang, Zhiheng Yang, Qianqian Li, Tonghua Cong, Peisheng |
author_facet | Wang, Zhiheng Yang, Qianqian Li, Tonghua Cong, Peisheng |
author_sort | Wang, Zhiheng |
collection | PubMed |
description | The precise prediction of protein intrinsically disordered regions, which play a crucial role in biological procedures, is a necessary prerequisite to further the understanding of the principles and mechanisms of protein function. Here, we propose a novel predictor, DisoMCS, which is a more accurate predictor of protein intrinsically disordered regions. The DisoMCS bases on an original multi-class conservative score (MCS) obtained by sequence-order/disorder alignment. Initially, near-disorder regions are defined on fragments located at both the terminus of an ordered region connecting a disordered region. Then the multi-class conservative score is generated by sequence alignment against a known structure database and represented as order, near-disorder and disorder conservative scores. The MCS of each amino acid has three elements: order, near-disorder and disorder profiles. Finally, the MCS is exploited as features to identify disordered regions in sequences. DisoMCS utilizes a non-redundant data set as the training set, MCS and predicted secondary structure as features, and a conditional random field as the classification algorithm. In predicted near-disorder regions a residue is determined as an order or a disorder according to the optimized decision threshold. DisoMCS was evaluated by cross-validation, large-scale prediction, independent tests and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DisoMCS was very competitive in terms of accuracy of prediction when compared with well-established publicly available disordered region predictors. It also indicated our approach was more accurate when a query has higher homologous with the knowledge database. AVAILABILITY: The DisoMCS is available at http://cal.tongji.edu.cn/disorder/. |
format | Online Article Text |
id | pubmed-4474717 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-44747172015-06-30 DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach Wang, Zhiheng Yang, Qianqian Li, Tonghua Cong, Peisheng PLoS One Research Article The precise prediction of protein intrinsically disordered regions, which play a crucial role in biological procedures, is a necessary prerequisite to further the understanding of the principles and mechanisms of protein function. Here, we propose a novel predictor, DisoMCS, which is a more accurate predictor of protein intrinsically disordered regions. The DisoMCS bases on an original multi-class conservative score (MCS) obtained by sequence-order/disorder alignment. Initially, near-disorder regions are defined on fragments located at both the terminus of an ordered region connecting a disordered region. Then the multi-class conservative score is generated by sequence alignment against a known structure database and represented as order, near-disorder and disorder conservative scores. The MCS of each amino acid has three elements: order, near-disorder and disorder profiles. Finally, the MCS is exploited as features to identify disordered regions in sequences. DisoMCS utilizes a non-redundant data set as the training set, MCS and predicted secondary structure as features, and a conditional random field as the classification algorithm. In predicted near-disorder regions a residue is determined as an order or a disorder according to the optimized decision threshold. DisoMCS was evaluated by cross-validation, large-scale prediction, independent tests and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DisoMCS was very competitive in terms of accuracy of prediction when compared with well-established publicly available disordered region predictors. It also indicated our approach was more accurate when a query has higher homologous with the knowledge database. AVAILABILITY: The DisoMCS is available at http://cal.tongji.edu.cn/disorder/. Public Library of Science 2015-06-19 /pmc/articles/PMC4474717/ /pubmed/26090958 http://dx.doi.org/10.1371/journal.pone.0128334 Text en © 2015 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Wang, Zhiheng Yang, Qianqian Li, Tonghua Cong, Peisheng DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach |
title | DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach |
title_full | DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach |
title_fullStr | DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach |
title_full_unstemmed | DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach |
title_short | DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach |
title_sort | disomcs: accurately predicting protein intrinsically disordered regions using a multi-class conservative score approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4474717/ https://www.ncbi.nlm.nih.gov/pubmed/26090958 http://dx.doi.org/10.1371/journal.pone.0128334 |
work_keys_str_mv | AT wangzhiheng disomcsaccuratelypredictingproteinintrinsicallydisorderedregionsusingamulticlassconservativescoreapproach AT yangqianqian disomcsaccuratelypredictingproteinintrinsicallydisorderedregionsusingamulticlassconservativescoreapproach AT litonghua disomcsaccuratelypredictingproteinintrinsicallydisorderedregionsusingamulticlassconservativescoreapproach AT congpeisheng disomcsaccuratelypredictingproteinintrinsicallydisorderedregionsusingamulticlassconservativescoreapproach |