Cargando…

Prediction of protein secondary structures with a novel kernel density estimation based classifier

BACKGROUND: Though prediction of protein secondary structures has been an active research issue in bioinformatics for quite a few years and many approaches have been proposed, a new challenge emerges as the sizes of contemporary protein structure databases continue to grow rapidly. The new challenge...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chang, Darby Tien-Hao, Ou, Yu-Yen, Hung, Hao-Geng, Yang, Meng-Han, Chen, Chien-Yu, Oyang, Yen-Jen
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2008
Materias:	Technical Note
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2527571/ https://www.ncbi.nlm.nih.gov/pubmed/18710504 http://dx.doi.org/10.1186/1756-0500-1-51

_version_	1782158820568465408
author	Chang, Darby Tien-Hao Ou, Yu-Yen Hung, Hao-Geng Yang, Meng-Han Chen, Chien-Yu Oyang, Yen-Jen
author_facet	Chang, Darby Tien-Hao Ou, Yu-Yen Hung, Hao-Geng Yang, Meng-Han Chen, Chien-Yu Oyang, Yen-Jen
author_sort	Chang, Darby Tien-Hao
collection	PubMed
description	BACKGROUND: Though prediction of protein secondary structures has been an active research issue in bioinformatics for quite a few years and many approaches have been proposed, a new challenge emerges as the sizes of contemporary protein structure databases continue to grow rapidly. The new challenge concerns how we can effectively exploit all the information implicitly deposited in the protein structure databases and deliver ever-improving prediction accuracy as the databases expand rapidly. FINDINGS: The new challenge is addressed in this article by proposing a predictor designed with a novel kernel density estimation algorithm. One main distinctive feature of the kernel density estimation based approach is that the average execution time taken by the training process is in the order of O(nlogn), where n is the number of instances in the training dataset. In the experiments reported in this article, the proposed predictor delivered an average Q(3 )(three-state prediction accuracy) score of 80.3% and an average SOV (segment overlap) score of 76.9% for a set of 27 benchmark protein chains extracted from the EVA server that are longer than 100 residues. CONCLUSION: The experimental results reported in this article reveal that we can continue to achieve higher prediction accuracy of protein secondary structures by effectively exploiting the structural information deposited in fast-growing protein structure databases. In this respect, the kernel density estimation based approach enjoys a distinctive advantage with its low time complexity for carrying out the training process.
format	Text
id	pubmed-2527571
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-25275712008-09-02 Prediction of protein secondary structures with a novel kernel density estimation based classifier Chang, Darby Tien-Hao Ou, Yu-Yen Hung, Hao-Geng Yang, Meng-Han Chen, Chien-Yu Oyang, Yen-Jen BMC Res Notes Technical Note BACKGROUND: Though prediction of protein secondary structures has been an active research issue in bioinformatics for quite a few years and many approaches have been proposed, a new challenge emerges as the sizes of contemporary protein structure databases continue to grow rapidly. The new challenge concerns how we can effectively exploit all the information implicitly deposited in the protein structure databases and deliver ever-improving prediction accuracy as the databases expand rapidly. FINDINGS: The new challenge is addressed in this article by proposing a predictor designed with a novel kernel density estimation algorithm. One main distinctive feature of the kernel density estimation based approach is that the average execution time taken by the training process is in the order of O(nlogn), where n is the number of instances in the training dataset. In the experiments reported in this article, the proposed predictor delivered an average Q(3 )(three-state prediction accuracy) score of 80.3% and an average SOV (segment overlap) score of 76.9% for a set of 27 benchmark protein chains extracted from the EVA server that are longer than 100 residues. CONCLUSION: The experimental results reported in this article reveal that we can continue to achieve higher prediction accuracy of protein secondary structures by effectively exploiting the structural information deposited in fast-growing protein structure databases. In this respect, the kernel density estimation based approach enjoys a distinctive advantage with its low time complexity for carrying out the training process. BioMed Central 2008-07-23 /pmc/articles/PMC2527571/ /pubmed/18710504 http://dx.doi.org/10.1186/1756-0500-1-51 Text en Copyright © 2008 Chang et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Technical Note Chang, Darby Tien-Hao Ou, Yu-Yen Hung, Hao-Geng Yang, Meng-Han Chen, Chien-Yu Oyang, Yen-Jen Prediction of protein secondary structures with a novel kernel density estimation based classifier
title	Prediction of protein secondary structures with a novel kernel density estimation based classifier
title_full	Prediction of protein secondary structures with a novel kernel density estimation based classifier
title_fullStr	Prediction of protein secondary structures with a novel kernel density estimation based classifier
title_full_unstemmed	Prediction of protein secondary structures with a novel kernel density estimation based classifier
title_short	Prediction of protein secondary structures with a novel kernel density estimation based classifier
title_sort	prediction of protein secondary structures with a novel kernel density estimation based classifier
topic	Technical Note
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2527571/ https://www.ncbi.nlm.nih.gov/pubmed/18710504 http://dx.doi.org/10.1186/1756-0500-1-51
work_keys_str_mv	AT changdarbytienhao predictionofproteinsecondarystructureswithanovelkerneldensityestimationbasedclassifier AT ouyuyen predictionofproteinsecondarystructureswithanovelkerneldensityestimationbasedclassifier AT hunghaogeng predictionofproteinsecondarystructureswithanovelkerneldensityestimationbasedclassifier AT yangmenghan predictionofproteinsecondarystructureswithanovelkerneldensityestimationbasedclassifier AT chenchienyu predictionofproteinsecondarystructureswithanovelkerneldensityestimationbasedclassifier AT oyangyenjen predictionofproteinsecondarystructureswithanovelkerneldensityestimationbasedclassifier

Prediction of protein secondary structures with a novel kernel density estimation based classifier

Ejemplares similares