Cargando…

Recursive Feature Selection with Significant Variables of Support Vectors

The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods us...

Descripción completa

Detalles Bibliográficos
Autores principales: Tsai, Chen-An, Huang, Chien-Hsun, Chang, Ching-Wei, Chen, Chun-Houh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3426197/
https://www.ncbi.nlm.nih.gov/pubmed/22927888
http://dx.doi.org/10.1155/2012/712542
_version_ 1782241480359804928
author Tsai, Chen-An
Huang, Chien-Hsun
Chang, Ching-Wei
Chen, Chun-Houh
author_facet Tsai, Chen-An
Huang, Chien-Hsun
Chang, Ching-Wei
Chen, Chun-Houh
author_sort Tsai, Chen-An
collection PubMed
description The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods use univariate ranking criteria and arbitrarily choose a threshold to choose genes. However, the parameter setting may not be compatible to the selected classification algorithms. In this paper, we propose a new gene selection method (SVM-t) based on the use of t-statistics embedded in support vector machine. We compared the performance to two similar SVM-based methods: SVM recursive feature elimination (SVMRFE) and recursive support vector machine (RSVM). The three methods were compared based on extensive simulation experiments and analyses of two published microarray datasets. In the simulation experiments, we found that the proposed method is more robust in selecting informative genes than SVMRFE and RSVM and capable to attain good classification performance when the variations of informative and noninformative genes are different. In the analysis of two microarray datasets, the proposed method yields better performance in identifying fewer genes with good prediction accuracy, compared to SVMRFE and RSVM.
format Online
Article
Text
id pubmed-3426197
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-34261972012-08-27 Recursive Feature Selection with Significant Variables of Support Vectors Tsai, Chen-An Huang, Chien-Hsun Chang, Ching-Wei Chen, Chun-Houh Comput Math Methods Med Research Article The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods use univariate ranking criteria and arbitrarily choose a threshold to choose genes. However, the parameter setting may not be compatible to the selected classification algorithms. In this paper, we propose a new gene selection method (SVM-t) based on the use of t-statistics embedded in support vector machine. We compared the performance to two similar SVM-based methods: SVM recursive feature elimination (SVMRFE) and recursive support vector machine (RSVM). The three methods were compared based on extensive simulation experiments and analyses of two published microarray datasets. In the simulation experiments, we found that the proposed method is more robust in selecting informative genes than SVMRFE and RSVM and capable to attain good classification performance when the variations of informative and noninformative genes are different. In the analysis of two microarray datasets, the proposed method yields better performance in identifying fewer genes with good prediction accuracy, compared to SVMRFE and RSVM. Hindawi Publishing Corporation 2012 2012-08-15 /pmc/articles/PMC3426197/ /pubmed/22927888 http://dx.doi.org/10.1155/2012/712542 Text en Copyright © 2012 Chen-An Tsai et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Tsai, Chen-An
Huang, Chien-Hsun
Chang, Ching-Wei
Chen, Chun-Houh
Recursive Feature Selection with Significant Variables of Support Vectors
title Recursive Feature Selection with Significant Variables of Support Vectors
title_full Recursive Feature Selection with Significant Variables of Support Vectors
title_fullStr Recursive Feature Selection with Significant Variables of Support Vectors
title_full_unstemmed Recursive Feature Selection with Significant Variables of Support Vectors
title_short Recursive Feature Selection with Significant Variables of Support Vectors
title_sort recursive feature selection with significant variables of support vectors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3426197/
https://www.ncbi.nlm.nih.gov/pubmed/22927888
http://dx.doi.org/10.1155/2012/712542
work_keys_str_mv AT tsaichenan recursivefeatureselectionwithsignificantvariablesofsupportvectors
AT huangchienhsun recursivefeatureselectionwithsignificantvariablesofsupportvectors
AT changchingwei recursivefeatureselectionwithsignificantvariablesofsupportvectors
AT chenchunhouh recursivefeatureselectionwithsignificantvariablesofsupportvectors