Cargando…

A voting approach to identify a small number of highly predictive genes using multiple classifiers

BACKGROUND: Microarray gene expression profiling has provided extensive datasets that can describe characteristics of cancer patients. An important challenge for this type of data is the discovery of gene sets which can be used as the basis of developing a clinical predictor for cancer. It is desira...

Descripción completa

Detalles Bibliográficos
Autores principales: Hassan, Md Rafiul, Hossain, M Maruf, Bailey, James, Macintyre, Geoff, Ho, Joshua WK, Ramamohanarao, Kotagiri
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648737/
https://www.ncbi.nlm.nih.gov/pubmed/19208118
http://dx.doi.org/10.1186/1471-2105-10-S1-S19
_version_ 1782164976376479744
author Hassan, Md Rafiul
Hossain, M Maruf
Bailey, James
Macintyre, Geoff
Ho, Joshua WK
Ramamohanarao, Kotagiri
author_facet Hassan, Md Rafiul
Hossain, M Maruf
Bailey, James
Macintyre, Geoff
Ho, Joshua WK
Ramamohanarao, Kotagiri
author_sort Hassan, Md Rafiul
collection PubMed
description BACKGROUND: Microarray gene expression profiling has provided extensive datasets that can describe characteristics of cancer patients. An important challenge for this type of data is the discovery of gene sets which can be used as the basis of developing a clinical predictor for cancer. It is desirable that such gene sets be compact, give accurate predictions across many classifiers, be biologically relevant and have good biological process coverage. RESULTS: By using a new type of multiple classifier voting approach, we have identified gene sets that can predict breast cancer prognosis accurately, for a range of classification algorithms. Unlike a wrapper approach, our method is not specialised towards a single classification technique. Experimental analysis demonstrates higher prediction accuracies for our sets of genes compared to previous work in the area. Moreover, our sets of genes are generally more compact than those previously proposed. Taking a biological viewpoint, from the literature, most of the genes in our sets are known to be strongly related to cancer. CONCLUSION: We show that it is possible to obtain superior classification accuracy with our approach and obtain a compact gene set that is also biologically relevant and has good coverage of different biological processes.
format Text
id pubmed-2648737
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26487372009-03-03 A voting approach to identify a small number of highly predictive genes using multiple classifiers Hassan, Md Rafiul Hossain, M Maruf Bailey, James Macintyre, Geoff Ho, Joshua WK Ramamohanarao, Kotagiri BMC Bioinformatics Research BACKGROUND: Microarray gene expression profiling has provided extensive datasets that can describe characteristics of cancer patients. An important challenge for this type of data is the discovery of gene sets which can be used as the basis of developing a clinical predictor for cancer. It is desirable that such gene sets be compact, give accurate predictions across many classifiers, be biologically relevant and have good biological process coverage. RESULTS: By using a new type of multiple classifier voting approach, we have identified gene sets that can predict breast cancer prognosis accurately, for a range of classification algorithms. Unlike a wrapper approach, our method is not specialised towards a single classification technique. Experimental analysis demonstrates higher prediction accuracies for our sets of genes compared to previous work in the area. Moreover, our sets of genes are generally more compact than those previously proposed. Taking a biological viewpoint, from the literature, most of the genes in our sets are known to be strongly related to cancer. CONCLUSION: We show that it is possible to obtain superior classification accuracy with our approach and obtain a compact gene set that is also biologically relevant and has good coverage of different biological processes. BioMed Central 2009-01-30 /pmc/articles/PMC2648737/ /pubmed/19208118 http://dx.doi.org/10.1186/1471-2105-10-S1-S19 Text en Copyright © 2009 Hassan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Hassan, Md Rafiul
Hossain, M Maruf
Bailey, James
Macintyre, Geoff
Ho, Joshua WK
Ramamohanarao, Kotagiri
A voting approach to identify a small number of highly predictive genes using multiple classifiers
title A voting approach to identify a small number of highly predictive genes using multiple classifiers
title_full A voting approach to identify a small number of highly predictive genes using multiple classifiers
title_fullStr A voting approach to identify a small number of highly predictive genes using multiple classifiers
title_full_unstemmed A voting approach to identify a small number of highly predictive genes using multiple classifiers
title_short A voting approach to identify a small number of highly predictive genes using multiple classifiers
title_sort voting approach to identify a small number of highly predictive genes using multiple classifiers
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648737/
https://www.ncbi.nlm.nih.gov/pubmed/19208118
http://dx.doi.org/10.1186/1471-2105-10-S1-S19
work_keys_str_mv AT hassanmdrafiul avotingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT hossainmmaruf avotingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT baileyjames avotingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT macintyregeoff avotingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT hojoshuawk avotingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT ramamohanaraokotagiri avotingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT hassanmdrafiul votingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT hossainmmaruf votingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT baileyjames votingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT macintyregeoff votingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT hojoshuawk votingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers
AT ramamohanaraokotagiri votingapproachtoidentifyasmallnumberofhighlypredictivegenesusingmultipleclassifiers