Cargando…

A Novel Approach for Predicting Disordered Regions in A Protein Sequence

OBJECTIVES: A number of published predictors are based on various algorithms and disordered protein sequence properties. Although many predictors have been published, the study of protein disordered region prediction is ongoing because different prediction methods can find different disordered regio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Meijing, Cho, Seong Beom, Ryu, Keun Ho
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2014
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4215001/ https://www.ncbi.nlm.nih.gov/pubmed/25379372 http://dx.doi.org/10.1016/j.phrp.2014.06.006

_version_	1782342042810056704
author	Li, Meijing Cho, Seong Beom Ryu, Keun Ho
author_facet	Li, Meijing Cho, Seong Beom Ryu, Keun Ho
author_sort	Li, Meijing
collection	PubMed
description	OBJECTIVES: A number of published predictors are based on various algorithms and disordered protein sequence properties. Although many predictors have been published, the study of protein disordered region prediction is ongoing because different prediction methods can find different disordered regions in a protein sequence. METHODS: Therefore we have used a new approach to find the more varying disordered regions for more efficient and accurate prediction of protein structures. In this study, we propose a novel approach called “emerging subsequence (ES) mining” without using the characteristics of the disordered protein. We first adapted the approach to generate emerging protein subsequences on public protein sequence data. Second, the disordered and ordered regions in a protein sequence were predicted by searching the generated emerging protein subsequence with a sliding window, which tends to overlap. Third, the scores of the overlapping regions were calculated based on support and growthrate values in both classes. Finally, the score of predicted regions in the target class were compared with the score of the source class, and the class having a higher score was selected. RESULTS: In this experiment, disordered sequence data and ordered sequence data was extracted from DisProt 6.02 and PDB respectively and used as training data. The test data come from CASP 9 and CASP 10 where disordered and ordered regions are known. CONCLUSION: Comparing with several published predictors, the results of the experiment show higher accuracy rates than with other existing methods.
format	Online Article Text
id	pubmed-4215001
institution	National Center for Biotechnology Information
language	English
publishDate	2014
record_format	MEDLINE/PubMed
spelling	pubmed-42150012014-11-06 A Novel Approach for Predicting Disordered Regions in A Protein Sequence Li, Meijing Cho, Seong Beom Ryu, Keun Ho Osong Public Health Res Perspect Original Article OBJECTIVES: A number of published predictors are based on various algorithms and disordered protein sequence properties. Although many predictors have been published, the study of protein disordered region prediction is ongoing because different prediction methods can find different disordered regions in a protein sequence. METHODS: Therefore we have used a new approach to find the more varying disordered regions for more efficient and accurate prediction of protein structures. In this study, we propose a novel approach called “emerging subsequence (ES) mining” without using the characteristics of the disordered protein. We first adapted the approach to generate emerging protein subsequences on public protein sequence data. Second, the disordered and ordered regions in a protein sequence were predicted by searching the generated emerging protein subsequence with a sliding window, which tends to overlap. Third, the scores of the overlapping regions were calculated based on support and growthrate values in both classes. Finally, the score of predicted regions in the target class were compared with the score of the source class, and the class having a higher score was selected. RESULTS: In this experiment, disordered sequence data and ordered sequence data was extracted from DisProt 6.02 and PDB respectively and used as training data. The test data come from CASP 9 and CASP 10 where disordered and ordered regions are known. CONCLUSION: Comparing with several published predictors, the results of the experiment show higher accuracy rates than with other existing methods. 2014-07-01 2014-08 /pmc/articles/PMC4215001/ /pubmed/25379372 http://dx.doi.org/10.1016/j.phrp.2014.06.006 Text en © 2014 Published by Elsevier B.V. on behalf of Korea Centers for Disease Control and Prevention. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the CC-BY-NC License (http://creativecommons.org/licenses/by-nc/3.0).
spellingShingle	Original Article Li, Meijing Cho, Seong Beom Ryu, Keun Ho A Novel Approach for Predicting Disordered Regions in A Protein Sequence
title	A Novel Approach for Predicting Disordered Regions in A Protein Sequence
title_full	A Novel Approach for Predicting Disordered Regions in A Protein Sequence
title_fullStr	A Novel Approach for Predicting Disordered Regions in A Protein Sequence
title_full_unstemmed	A Novel Approach for Predicting Disordered Regions in A Protein Sequence
title_short	A Novel Approach for Predicting Disordered Regions in A Protein Sequence
title_sort	novel approach for predicting disordered regions in a protein sequence
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4215001/ https://www.ncbi.nlm.nih.gov/pubmed/25379372 http://dx.doi.org/10.1016/j.phrp.2014.06.006
work_keys_str_mv	AT limeijing anovelapproachforpredictingdisorderedregionsinaproteinsequence AT choseongbeom anovelapproachforpredictingdisorderedregionsinaproteinsequence AT ryukeunho anovelapproachforpredictingdisorderedregionsinaproteinsequence AT limeijing novelapproachforpredictingdisorderedregionsinaproteinsequence AT choseongbeom novelapproachforpredictingdisorderedregionsinaproteinsequence AT ryukeunho novelapproachforpredictingdisorderedregionsinaproteinsequence

A Novel Approach for Predicting Disordered Regions in A Protein Sequence

Ejemplares similares