Cargando…
Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
BACKGROUND: In class prediction problems using microarray data, gene selection is essential to improve the prediction accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVM-RFE...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1790716/ https://www.ncbi.nlm.nih.gov/pubmed/17187691 http://dx.doi.org/10.1186/1471-2105-7-543 |
_version_ | 1782132126822432768 |
---|---|
author | Niijima, Satoshi Kuhara, Satoru |
author_facet | Niijima, Satoshi Kuhara, Satoru |
author_sort | Niijima, Satoshi |
collection | PubMed |
description | BACKGROUND: In class prediction problems using microarray data, gene selection is essential to improve the prediction accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVM-RFE) has become one of the leading methods and is being widely used. The SVM-based approach performs gene selection using the weight vector of the hyperplane constructed by the samples on the margin. However, the performance can be easily affected by noise and outliers, when it is applied to noisy, small sample size microarray data. RESULTS: In this paper, we propose a recursive gene selection method using the discriminant vector of the maximum margin criterion (MMC), which is a variant of classical linear discriminant analysis (LDA). To overcome the computational drawback of classical LDA and the problem of high dimensionality, we present efficient and stable algorithms for MMC-based RFE (MMC-RFE). The MMC-RFE algorithms naturally extend to multi-class cases. The performance of MMC-RFE was extensively compared with that of SVM-RFE using nine cancer microarray datasets, including four multi-class datasets. CONCLUSION: Our extensive comparison has demonstrated that for binary-class datasets MMC-RFE tends to show intermediate performance between hard-margin SVM-RFE and SVM-RFE with a properly chosen soft-margin parameter. Notably, MMC-RFE achieves significantly better performance with a smaller number of genes than SVM-RFE for multi-class datasets. The results suggest that MMC-RFE is less sensitive to noise and outliers due to the use of average margin, and thus may be useful for biomarker discovery from noisy data. |
format | Text |
id | pubmed-1790716 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-17907162007-02-05 Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE Niijima, Satoshi Kuhara, Satoru BMC Bioinformatics Research Article BACKGROUND: In class prediction problems using microarray data, gene selection is essential to improve the prediction accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVM-RFE) has become one of the leading methods and is being widely used. The SVM-based approach performs gene selection using the weight vector of the hyperplane constructed by the samples on the margin. However, the performance can be easily affected by noise and outliers, when it is applied to noisy, small sample size microarray data. RESULTS: In this paper, we propose a recursive gene selection method using the discriminant vector of the maximum margin criterion (MMC), which is a variant of classical linear discriminant analysis (LDA). To overcome the computational drawback of classical LDA and the problem of high dimensionality, we present efficient and stable algorithms for MMC-based RFE (MMC-RFE). The MMC-RFE algorithms naturally extend to multi-class cases. The performance of MMC-RFE was extensively compared with that of SVM-RFE using nine cancer microarray datasets, including four multi-class datasets. CONCLUSION: Our extensive comparison has demonstrated that for binary-class datasets MMC-RFE tends to show intermediate performance between hard-margin SVM-RFE and SVM-RFE with a properly chosen soft-margin parameter. Notably, MMC-RFE achieves significantly better performance with a smaller number of genes than SVM-RFE for multi-class datasets. The results suggest that MMC-RFE is less sensitive to noise and outliers due to the use of average margin, and thus may be useful for biomarker discovery from noisy data. BioMed Central 2006-12-25 /pmc/articles/PMC1790716/ /pubmed/17187691 http://dx.doi.org/10.1186/1471-2105-7-543 Text en Copyright © 2006 Niijima and Kuhara; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Niijima, Satoshi Kuhara, Satoru Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE |
title | Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE |
title_full | Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE |
title_fullStr | Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE |
title_full_unstemmed | Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE |
title_short | Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE |
title_sort | recursive gene selection based on maximum margin criterion: a comparison with svm-rfe |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1790716/ https://www.ncbi.nlm.nih.gov/pubmed/17187691 http://dx.doi.org/10.1186/1471-2105-7-543 |
work_keys_str_mv | AT niijimasatoshi recursivegeneselectionbasedonmaximummargincriterionacomparisonwithsvmrfe AT kuharasatoru recursivegeneselectionbasedonmaximummargincriterionacomparisonwithsvmrfe |