Cargando…

Error margin analysis for feature gene extraction

BACKGROUND: Feature gene extraction is a fundamental issue in microarray-based biomarker discovery. It is normally treated as an optimization problem of finding the best predictive feature genes that can effectively and stably discriminate distinct types of disease conditions, e.g. tumors and normal...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chow, Chi Kin, Zhu, Hai Long, Lacy, Jessica, Kuo, Winston P
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2010
Materias:	Methodology article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2885372/ https://www.ncbi.nlm.nih.gov/pubmed/20459827 http://dx.doi.org/10.1186/1471-2105-11-241

_version_	1782182380703842304
author	Chow, Chi Kin Zhu, Hai Long Lacy, Jessica Kuo, Winston P
author_facet	Chow, Chi Kin Zhu, Hai Long Lacy, Jessica Kuo, Winston P
author_sort	Chow, Chi Kin
collection	PubMed
description	BACKGROUND: Feature gene extraction is a fundamental issue in microarray-based biomarker discovery. It is normally treated as an optimization problem of finding the best predictive feature genes that can effectively and stably discriminate distinct types of disease conditions, e.g. tumors and normals. Since gene microarray data normally involves thousands of genes at, tens or hundreds of samples, the gene extraction process may fall into local optimums if the gene set is optimized according to the maximization of classification accuracy of the classifier built from it. RESULTS: In this paper, we propose a novel gene extraction method of error margin analysis to optimize the feature genes. The proposed algorithm has been tested upon one synthetic dataset and two real microarray datasets. Meanwhile, it has been compared with five existing gene extraction algorithms on each dataset. On the synthetic dataset, the results show that the feature set extracted by our algorithm is the closest to the actual gene set. For the two real datasets, our algorithm is superior in terms of balancing the size and the validation accuracy of the resultant gene set when comparing to other algorithms. CONCLUSION: Because of its distinct features, error margin analysis method can stably extract the relevant feature genes from microarray data for high-performance classification.
format	Text
id	pubmed-2885372
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-28853722010-06-15 Error margin analysis for feature gene extraction Chow, Chi Kin Zhu, Hai Long Lacy, Jessica Kuo, Winston P BMC Bioinformatics Methodology article BACKGROUND: Feature gene extraction is a fundamental issue in microarray-based biomarker discovery. It is normally treated as an optimization problem of finding the best predictive feature genes that can effectively and stably discriminate distinct types of disease conditions, e.g. tumors and normals. Since gene microarray data normally involves thousands of genes at, tens or hundreds of samples, the gene extraction process may fall into local optimums if the gene set is optimized according to the maximization of classification accuracy of the classifier built from it. RESULTS: In this paper, we propose a novel gene extraction method of error margin analysis to optimize the feature genes. The proposed algorithm has been tested upon one synthetic dataset and two real microarray datasets. Meanwhile, it has been compared with five existing gene extraction algorithms on each dataset. On the synthetic dataset, the results show that the feature set extracted by our algorithm is the closest to the actual gene set. For the two real datasets, our algorithm is superior in terms of balancing the size and the validation accuracy of the resultant gene set when comparing to other algorithms. CONCLUSION: Because of its distinct features, error margin analysis method can stably extract the relevant feature genes from microarray data for high-performance classification. BioMed Central 2010-05-11 /pmc/articles/PMC2885372/ /pubmed/20459827 http://dx.doi.org/10.1186/1471-2105-11-241 Text en Copyright ©2010 Chow et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology article Chow, Chi Kin Zhu, Hai Long Lacy, Jessica Kuo, Winston P Error margin analysis for feature gene extraction
title	Error margin analysis for feature gene extraction
title_full	Error margin analysis for feature gene extraction
title_fullStr	Error margin analysis for feature gene extraction
title_full_unstemmed	Error margin analysis for feature gene extraction
title_short	Error margin analysis for feature gene extraction
title_sort	error margin analysis for feature gene extraction
topic	Methodology article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2885372/ https://www.ncbi.nlm.nih.gov/pubmed/20459827 http://dx.doi.org/10.1186/1471-2105-11-241
work_keys_str_mv	AT chowchikin errormarginanalysisforfeaturegeneextraction AT zhuhailong errormarginanalysisforfeaturegeneextraction AT lacyjessica errormarginanalysisforfeaturegeneextraction AT kuowinstonp errormarginanalysisforfeaturegeneextraction

Error margin analysis for feature gene extraction

Ejemplares similares