Cargando…

DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach

The precise prediction of protein intrinsically disordered regions, which play a crucial role in biological procedures, is a necessary prerequisite to further the understanding of the principles and mechanisms of protein function. Here, we propose a novel predictor, DisoMCS, which is a more accurate...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zhiheng, Yang, Qianqian, Li, Tonghua, Cong, Peisheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4474717/
https://www.ncbi.nlm.nih.gov/pubmed/26090958
http://dx.doi.org/10.1371/journal.pone.0128334
_version_ 1782377326436155392
author Wang, Zhiheng
Yang, Qianqian
Li, Tonghua
Cong, Peisheng
author_facet Wang, Zhiheng
Yang, Qianqian
Li, Tonghua
Cong, Peisheng
author_sort Wang, Zhiheng
collection PubMed
description The precise prediction of protein intrinsically disordered regions, which play a crucial role in biological procedures, is a necessary prerequisite to further the understanding of the principles and mechanisms of protein function. Here, we propose a novel predictor, DisoMCS, which is a more accurate predictor of protein intrinsically disordered regions. The DisoMCS bases on an original multi-class conservative score (MCS) obtained by sequence-order/disorder alignment. Initially, near-disorder regions are defined on fragments located at both the terminus of an ordered region connecting a disordered region. Then the multi-class conservative score is generated by sequence alignment against a known structure database and represented as order, near-disorder and disorder conservative scores. The MCS of each amino acid has three elements: order, near-disorder and disorder profiles. Finally, the MCS is exploited as features to identify disordered regions in sequences. DisoMCS utilizes a non-redundant data set as the training set, MCS and predicted secondary structure as features, and a conditional random field as the classification algorithm. In predicted near-disorder regions a residue is determined as an order or a disorder according to the optimized decision threshold. DisoMCS was evaluated by cross-validation, large-scale prediction, independent tests and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DisoMCS was very competitive in terms of accuracy of prediction when compared with well-established publicly available disordered region predictors. It also indicated our approach was more accurate when a query has higher homologous with the knowledge database. AVAILABILITY: The DisoMCS is available at http://cal.tongji.edu.cn/disorder/.
format Online
Article
Text
id pubmed-4474717
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44747172015-06-30 DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach Wang, Zhiheng Yang, Qianqian Li, Tonghua Cong, Peisheng PLoS One Research Article The precise prediction of protein intrinsically disordered regions, which play a crucial role in biological procedures, is a necessary prerequisite to further the understanding of the principles and mechanisms of protein function. Here, we propose a novel predictor, DisoMCS, which is a more accurate predictor of protein intrinsically disordered regions. The DisoMCS bases on an original multi-class conservative score (MCS) obtained by sequence-order/disorder alignment. Initially, near-disorder regions are defined on fragments located at both the terminus of an ordered region connecting a disordered region. Then the multi-class conservative score is generated by sequence alignment against a known structure database and represented as order, near-disorder and disorder conservative scores. The MCS of each amino acid has three elements: order, near-disorder and disorder profiles. Finally, the MCS is exploited as features to identify disordered regions in sequences. DisoMCS utilizes a non-redundant data set as the training set, MCS and predicted secondary structure as features, and a conditional random field as the classification algorithm. In predicted near-disorder regions a residue is determined as an order or a disorder according to the optimized decision threshold. DisoMCS was evaluated by cross-validation, large-scale prediction, independent tests and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DisoMCS was very competitive in terms of accuracy of prediction when compared with well-established publicly available disordered region predictors. It also indicated our approach was more accurate when a query has higher homologous with the knowledge database. AVAILABILITY: The DisoMCS is available at http://cal.tongji.edu.cn/disorder/. Public Library of Science 2015-06-19 /pmc/articles/PMC4474717/ /pubmed/26090958 http://dx.doi.org/10.1371/journal.pone.0128334 Text en © 2015 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Wang, Zhiheng
Yang, Qianqian
Li, Tonghua
Cong, Peisheng
DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach
title DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach
title_full DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach
title_fullStr DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach
title_full_unstemmed DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach
title_short DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach
title_sort disomcs: accurately predicting protein intrinsically disordered regions using a multi-class conservative score approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4474717/
https://www.ncbi.nlm.nih.gov/pubmed/26090958
http://dx.doi.org/10.1371/journal.pone.0128334
work_keys_str_mv AT wangzhiheng disomcsaccuratelypredictingproteinintrinsicallydisorderedregionsusingamulticlassconservativescoreapproach
AT yangqianqian disomcsaccuratelypredictingproteinintrinsicallydisorderedregionsusingamulticlassconservativescoreapproach
AT litonghua disomcsaccuratelypredictingproteinintrinsicallydisorderedregionsusingamulticlassconservativescoreapproach
AT congpeisheng disomcsaccuratelypredictingproteinintrinsicallydisorderedregionsusingamulticlassconservativescoreapproach