Cargando…

A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification

Microarrays allow researchers to monitor the gene expression patterns for tens of thousands of genes across a wide range of cellular responses, phenotype and conditions. Selecting a small subset of discriminate genes from thousands of genes is important for accurate classification of diseases and ph...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Yanxiong, Li, Wenyuan, Liu, Ying
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675487/
https://www.ncbi.nlm.nih.gov/pubmed/19458773
_version_ 1782166699152244736
author Peng, Yanxiong
Li, Wenyuan
Liu, Ying
author_facet Peng, Yanxiong
Li, Wenyuan
Liu, Ying
author_sort Peng, Yanxiong
collection PubMed
description Microarrays allow researchers to monitor the gene expression patterns for tens of thousands of genes across a wide range of cellular responses, phenotype and conditions. Selecting a small subset of discriminate genes from thousands of genes is important for accurate classification of diseases and phenotypes. Many methods have been proposed to find subsets of genes with maximum relevance and minimum redundancy, which can distinguish accurately between samples with different labels. To find the minimum subset of relevant genes is often referred as biomarker discovery. Two main approaches, filter and wrapper techniques, have been applied to biomarker discovery. In this paper, we conducted a comparative study of different biomarker discovery methods, including six filter methods and three wrapper methods. We then proposed a hybrid approach, FR-Wrapper, for biomarker discovery. The aim of this approach is to find an optimum balance between the precision of the biomarker discovery and the computation cost, by taking advantages of both filter method’s efficiency and wrapper method’s high accuracy. Our hybrid approach applies Fisher’s ratio, a simple method easy to understand and implement, to filter out most of the irrelevant genes, then a wrapper method is employed to reduce the redundancy. The performance of FR-Wrapper approach is evaluated over four widely used microarray datasets. Analysis of experimental results reveals that the hybrid approach can achieve the goal of maximum relevance with minimum redundancy.
format Text
id pubmed-2675487
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26754872009-05-20 A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification Peng, Yanxiong Li, Wenyuan Liu, Ying Cancer Inform Original Research Microarrays allow researchers to monitor the gene expression patterns for tens of thousands of genes across a wide range of cellular responses, phenotype and conditions. Selecting a small subset of discriminate genes from thousands of genes is important for accurate classification of diseases and phenotypes. Many methods have been proposed to find subsets of genes with maximum relevance and minimum redundancy, which can distinguish accurately between samples with different labels. To find the minimum subset of relevant genes is often referred as biomarker discovery. Two main approaches, filter and wrapper techniques, have been applied to biomarker discovery. In this paper, we conducted a comparative study of different biomarker discovery methods, including six filter methods and three wrapper methods. We then proposed a hybrid approach, FR-Wrapper, for biomarker discovery. The aim of this approach is to find an optimum balance between the precision of the biomarker discovery and the computation cost, by taking advantages of both filter method’s efficiency and wrapper method’s high accuracy. Our hybrid approach applies Fisher’s ratio, a simple method easy to understand and implement, to filter out most of the irrelevant genes, then a wrapper method is employed to reduce the redundancy. The performance of FR-Wrapper approach is evaluated over four widely used microarray datasets. Analysis of experimental results reveals that the hybrid approach can achieve the goal of maximum relevance with minimum redundancy. Libertas Academica 2007-02-22 /pmc/articles/PMC2675487/ /pubmed/19458773 Text en © 2006 The authors.
spellingShingle Original Research
Peng, Yanxiong
Li, Wenyuan
Liu, Ying
A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
title A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
title_full A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
title_fullStr A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
title_full_unstemmed A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
title_short A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
title_sort hybrid approach for biomarker discovery from microarray gene expression data for cancer classification
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675487/
https://www.ncbi.nlm.nih.gov/pubmed/19458773
work_keys_str_mv AT pengyanxiong ahybridapproachforbiomarkerdiscoveryfrommicroarraygeneexpressiondataforcancerclassification
AT liwenyuan ahybridapproachforbiomarkerdiscoveryfrommicroarraygeneexpressiondataforcancerclassification
AT liuying ahybridapproachforbiomarkerdiscoveryfrommicroarraygeneexpressiondataforcancerclassification
AT pengyanxiong hybridapproachforbiomarkerdiscoveryfrommicroarraygeneexpressiondataforcancerclassification
AT liwenyuan hybridapproachforbiomarkerdiscoveryfrommicroarraygeneexpressiondataforcancerclassification
AT liuying hybridapproachforbiomarkerdiscoveryfrommicroarraygeneexpressiondataforcancerclassification