Cargando…

Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE

BACKGROUND: In class prediction problems using microarray data, gene selection is essential to improve the prediction accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVM-RFE...

Descripción completa

Detalles Bibliográficos
Autores principales: Niijima, Satoshi, Kuhara, Satoru
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1790716/
https://www.ncbi.nlm.nih.gov/pubmed/17187691
http://dx.doi.org/10.1186/1471-2105-7-543
_version_ 1782132126822432768
author Niijima, Satoshi
Kuhara, Satoru
author_facet Niijima, Satoshi
Kuhara, Satoru
author_sort Niijima, Satoshi
collection PubMed
description BACKGROUND: In class prediction problems using microarray data, gene selection is essential to improve the prediction accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVM-RFE) has become one of the leading methods and is being widely used. The SVM-based approach performs gene selection using the weight vector of the hyperplane constructed by the samples on the margin. However, the performance can be easily affected by noise and outliers, when it is applied to noisy, small sample size microarray data. RESULTS: In this paper, we propose a recursive gene selection method using the discriminant vector of the maximum margin criterion (MMC), which is a variant of classical linear discriminant analysis (LDA). To overcome the computational drawback of classical LDA and the problem of high dimensionality, we present efficient and stable algorithms for MMC-based RFE (MMC-RFE). The MMC-RFE algorithms naturally extend to multi-class cases. The performance of MMC-RFE was extensively compared with that of SVM-RFE using nine cancer microarray datasets, including four multi-class datasets. CONCLUSION: Our extensive comparison has demonstrated that for binary-class datasets MMC-RFE tends to show intermediate performance between hard-margin SVM-RFE and SVM-RFE with a properly chosen soft-margin parameter. Notably, MMC-RFE achieves significantly better performance with a smaller number of genes than SVM-RFE for multi-class datasets. The results suggest that MMC-RFE is less sensitive to noise and outliers due to the use of average margin, and thus may be useful for biomarker discovery from noisy data.
format Text
id pubmed-1790716
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-17907162007-02-05 Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE Niijima, Satoshi Kuhara, Satoru BMC Bioinformatics Research Article BACKGROUND: In class prediction problems using microarray data, gene selection is essential to improve the prediction accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVM-RFE) has become one of the leading methods and is being widely used. The SVM-based approach performs gene selection using the weight vector of the hyperplane constructed by the samples on the margin. However, the performance can be easily affected by noise and outliers, when it is applied to noisy, small sample size microarray data. RESULTS: In this paper, we propose a recursive gene selection method using the discriminant vector of the maximum margin criterion (MMC), which is a variant of classical linear discriminant analysis (LDA). To overcome the computational drawback of classical LDA and the problem of high dimensionality, we present efficient and stable algorithms for MMC-based RFE (MMC-RFE). The MMC-RFE algorithms naturally extend to multi-class cases. The performance of MMC-RFE was extensively compared with that of SVM-RFE using nine cancer microarray datasets, including four multi-class datasets. CONCLUSION: Our extensive comparison has demonstrated that for binary-class datasets MMC-RFE tends to show intermediate performance between hard-margin SVM-RFE and SVM-RFE with a properly chosen soft-margin parameter. Notably, MMC-RFE achieves significantly better performance with a smaller number of genes than SVM-RFE for multi-class datasets. The results suggest that MMC-RFE is less sensitive to noise and outliers due to the use of average margin, and thus may be useful for biomarker discovery from noisy data. BioMed Central 2006-12-25 /pmc/articles/PMC1790716/ /pubmed/17187691 http://dx.doi.org/10.1186/1471-2105-7-543 Text en Copyright © 2006 Niijima and Kuhara; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Niijima, Satoshi
Kuhara, Satoru
Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
title Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
title_full Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
title_fullStr Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
title_full_unstemmed Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
title_short Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
title_sort recursive gene selection based on maximum margin criterion: a comparison with svm-rfe
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1790716/
https://www.ncbi.nlm.nih.gov/pubmed/17187691
http://dx.doi.org/10.1186/1471-2105-7-543
work_keys_str_mv AT niijimasatoshi recursivegeneselectionbasedonmaximummargincriterionacomparisonwithsvmrfe
AT kuharasatoru recursivegeneselectionbasedonmaximummargincriterionacomparisonwithsvmrfe