Cargando…
Recursive Feature Selection with Significant Variables of Support Vectors
The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods us...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3426197/ https://www.ncbi.nlm.nih.gov/pubmed/22927888 http://dx.doi.org/10.1155/2012/712542 |
_version_ | 1782241480359804928 |
---|---|
author | Tsai, Chen-An Huang, Chien-Hsun Chang, Ching-Wei Chen, Chun-Houh |
author_facet | Tsai, Chen-An Huang, Chien-Hsun Chang, Ching-Wei Chen, Chun-Houh |
author_sort | Tsai, Chen-An |
collection | PubMed |
description | The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods use univariate ranking criteria and arbitrarily choose a threshold to choose genes. However, the parameter setting may not be compatible to the selected classification algorithms. In this paper, we propose a new gene selection method (SVM-t) based on the use of t-statistics embedded in support vector machine. We compared the performance to two similar SVM-based methods: SVM recursive feature elimination (SVMRFE) and recursive support vector machine (RSVM). The three methods were compared based on extensive simulation experiments and analyses of two published microarray datasets. In the simulation experiments, we found that the proposed method is more robust in selecting informative genes than SVMRFE and RSVM and capable to attain good classification performance when the variations of informative and noninformative genes are different. In the analysis of two microarray datasets, the proposed method yields better performance in identifying fewer genes with good prediction accuracy, compared to SVMRFE and RSVM. |
format | Online Article Text |
id | pubmed-3426197 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-34261972012-08-27 Recursive Feature Selection with Significant Variables of Support Vectors Tsai, Chen-An Huang, Chien-Hsun Chang, Ching-Wei Chen, Chun-Houh Comput Math Methods Med Research Article The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods use univariate ranking criteria and arbitrarily choose a threshold to choose genes. However, the parameter setting may not be compatible to the selected classification algorithms. In this paper, we propose a new gene selection method (SVM-t) based on the use of t-statistics embedded in support vector machine. We compared the performance to two similar SVM-based methods: SVM recursive feature elimination (SVMRFE) and recursive support vector machine (RSVM). The three methods were compared based on extensive simulation experiments and analyses of two published microarray datasets. In the simulation experiments, we found that the proposed method is more robust in selecting informative genes than SVMRFE and RSVM and capable to attain good classification performance when the variations of informative and noninformative genes are different. In the analysis of two microarray datasets, the proposed method yields better performance in identifying fewer genes with good prediction accuracy, compared to SVMRFE and RSVM. Hindawi Publishing Corporation 2012 2012-08-15 /pmc/articles/PMC3426197/ /pubmed/22927888 http://dx.doi.org/10.1155/2012/712542 Text en Copyright © 2012 Chen-An Tsai et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Tsai, Chen-An Huang, Chien-Hsun Chang, Ching-Wei Chen, Chun-Houh Recursive Feature Selection with Significant Variables of Support Vectors |
title | Recursive Feature Selection with Significant Variables of Support Vectors |
title_full | Recursive Feature Selection with Significant Variables of Support Vectors |
title_fullStr | Recursive Feature Selection with Significant Variables of Support Vectors |
title_full_unstemmed | Recursive Feature Selection with Significant Variables of Support Vectors |
title_short | Recursive Feature Selection with Significant Variables of Support Vectors |
title_sort | recursive feature selection with significant variables of support vectors |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3426197/ https://www.ncbi.nlm.nih.gov/pubmed/22927888 http://dx.doi.org/10.1155/2012/712542 |
work_keys_str_mv | AT tsaichenan recursivefeatureselectionwithsignificantvariablesofsupportvectors AT huangchienhsun recursivefeatureselectionwithsignificantvariablesofsupportvectors AT changchingwei recursivefeatureselectionwithsignificantvariablesofsupportvectors AT chenchunhouh recursivefeatureselectionwithsignificantvariablesofsupportvectors |