Cargando…

Feature selection using Haar wavelet power spectrum

BACKGROUND: Feature selection is an approach to overcome the 'curse of dimensionality' in complex researches like disease classification using microarrays. Statistical methods are utilized more in this domain. Most of them do not fit for a wide range of datasets. The transform oriented sig...

Descripción completa

Detalles Bibliográficos
Autores principales: Subramani, Prabakaran, Sahu, Rajendra, Verma, Shekhar
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1618414/
https://www.ncbi.nlm.nih.gov/pubmed/17022808
http://dx.doi.org/10.1186/1471-2105-7-432
_version_ 1782130521580503040
author Subramani, Prabakaran
Sahu, Rajendra
Verma, Shekhar
author_facet Subramani, Prabakaran
Sahu, Rajendra
Verma, Shekhar
author_sort Subramani, Prabakaran
collection PubMed
description BACKGROUND: Feature selection is an approach to overcome the 'curse of dimensionality' in complex researches like disease classification using microarrays. Statistical methods are utilized more in this domain. Most of them do not fit for a wide range of datasets. The transform oriented signal processing domains are not probed much when other fields like image and video processing utilize them well. Wavelets, one of such techniques, have the potential to be utilized in feature selection method. The aim of this paper is to assess the capability of Haar wavelet power spectrum in the problem of clustering and gene selection based on expression data in the context of disease classification and to propose a method based on Haar wavelet power spectrum. RESULTS: Haar wavelet power spectra of genes were analysed and it was observed to be different in different diagnostic categories. This difference in trend and magnitude of the spectrum may be utilized in gene selection. Most of the genes selected by earlier complex methods were selected by the very simple present method. Each earlier works proved only few genes are quite enough to approach the classification problem [1]. Hence the present method may be tried in conjunction with other classification methods. The technique was applied without removing the noise in data to validate the robustness of the method against the noise or outliers in the data. No special softwares or complex implementation is needed. The qualities of the genes selected by the present method were analysed through their gene expression data. Most of them were observed to be related to solve the classification issue since they were dominant in the diagnostic category of the dataset for which they were selected as features. CONCLUSION: In the present paper, the problem of feature selection of microarray gene expression data was considered. We analyzed the wavelet power spectrum of genes and proposed a clustering and feature selection method useful for classification based on Haar wavelet power spectrum. Application of this technique in this area is novel, simple, and faster than other methods, fit for a wide range of data types. The results are encouraging and throw light into the possibility of using this technique for problem domains like disease classification, gene network identification and personalized drug design.
format Text
id pubmed-1618414
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16184142006-10-20 Feature selection using Haar wavelet power spectrum Subramani, Prabakaran Sahu, Rajendra Verma, Shekhar BMC Bioinformatics Research Article BACKGROUND: Feature selection is an approach to overcome the 'curse of dimensionality' in complex researches like disease classification using microarrays. Statistical methods are utilized more in this domain. Most of them do not fit for a wide range of datasets. The transform oriented signal processing domains are not probed much when other fields like image and video processing utilize them well. Wavelets, one of such techniques, have the potential to be utilized in feature selection method. The aim of this paper is to assess the capability of Haar wavelet power spectrum in the problem of clustering and gene selection based on expression data in the context of disease classification and to propose a method based on Haar wavelet power spectrum. RESULTS: Haar wavelet power spectra of genes were analysed and it was observed to be different in different diagnostic categories. This difference in trend and magnitude of the spectrum may be utilized in gene selection. Most of the genes selected by earlier complex methods were selected by the very simple present method. Each earlier works proved only few genes are quite enough to approach the classification problem [1]. Hence the present method may be tried in conjunction with other classification methods. The technique was applied without removing the noise in data to validate the robustness of the method against the noise or outliers in the data. No special softwares or complex implementation is needed. The qualities of the genes selected by the present method were analysed through their gene expression data. Most of them were observed to be related to solve the classification issue since they were dominant in the diagnostic category of the dataset for which they were selected as features. CONCLUSION: In the present paper, the problem of feature selection of microarray gene expression data was considered. We analyzed the wavelet power spectrum of genes and proposed a clustering and feature selection method useful for classification based on Haar wavelet power spectrum. Application of this technique in this area is novel, simple, and faster than other methods, fit for a wide range of data types. The results are encouraging and throw light into the possibility of using this technique for problem domains like disease classification, gene network identification and personalized drug design. BioMed Central 2006-10-05 /pmc/articles/PMC1618414/ /pubmed/17022808 http://dx.doi.org/10.1186/1471-2105-7-432 Text en Copyright © 2006 Subramani et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Subramani, Prabakaran
Sahu, Rajendra
Verma, Shekhar
Feature selection using Haar wavelet power spectrum
title Feature selection using Haar wavelet power spectrum
title_full Feature selection using Haar wavelet power spectrum
title_fullStr Feature selection using Haar wavelet power spectrum
title_full_unstemmed Feature selection using Haar wavelet power spectrum
title_short Feature selection using Haar wavelet power spectrum
title_sort feature selection using haar wavelet power spectrum
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1618414/
https://www.ncbi.nlm.nih.gov/pubmed/17022808
http://dx.doi.org/10.1186/1471-2105-7-432
work_keys_str_mv AT subramaniprabakaran featureselectionusinghaarwaveletpowerspectrum
AT sahurajendra featureselectionusinghaarwaveletpowerspectrum
AT vermashekhar featureselectionusinghaarwaveletpowerspectrum