Cargando…

An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset

Multidimensional medical data classification has recently received increased attention by researchers working on machine learning and data mining. In multidimensional dataset (MDD) each instance is associated with multiple class values. Due to its complex nature, feature selection and classifier bui...

Descripción completa

Detalles Bibliográficos
Autores principales: Devaraj, Senthilkumar, Paulraj, S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4601565/
https://www.ncbi.nlm.nih.gov/pubmed/26491718
http://dx.doi.org/10.1155/2015/821798
_version_ 1782394567324073984
author Devaraj, Senthilkumar
Paulraj, S.
author_facet Devaraj, Senthilkumar
Paulraj, S.
author_sort Devaraj, Senthilkumar
collection PubMed
description Multidimensional medical data classification has recently received increased attention by researchers working on machine learning and data mining. In multidimensional dataset (MDD) each instance is associated with multiple class values. Due to its complex nature, feature selection and classifier built from the MDD are typically more expensive or time-consuming. Therefore, we need a robust feature selection technique for selecting the optimum single subset of the features of the MDD for further analysis or to design a classifier. In this paper, an efficient feature selection algorithm is proposed for the classification of MDD. The proposed multidimensional feature subset selection (MFSS) algorithm yields a unique feature subset for further analysis or to build a classifier and there is a computational advantage on MDD compared with the existing feature selection algorithms. The proposed work is applied to benchmark multidimensional datasets. The number of features was reduced to 3% minimum and 30% maximum by using the proposed MFSS. In conclusion, the study results show that MFSS is an efficient feature selection algorithm without affecting the classification accuracy even for the reduced number of features. Also the proposed MFSS algorithm is suitable for both problem transformation and algorithm adaptation and it has great potentials in those applications generating multidimensional datasets.
format Online
Article
Text
id pubmed-4601565
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-46015652015-10-21 An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset Devaraj, Senthilkumar Paulraj, S. ScientificWorldJournal Research Article Multidimensional medical data classification has recently received increased attention by researchers working on machine learning and data mining. In multidimensional dataset (MDD) each instance is associated with multiple class values. Due to its complex nature, feature selection and classifier built from the MDD are typically more expensive or time-consuming. Therefore, we need a robust feature selection technique for selecting the optimum single subset of the features of the MDD for further analysis or to design a classifier. In this paper, an efficient feature selection algorithm is proposed for the classification of MDD. The proposed multidimensional feature subset selection (MFSS) algorithm yields a unique feature subset for further analysis or to build a classifier and there is a computational advantage on MDD compared with the existing feature selection algorithms. The proposed work is applied to benchmark multidimensional datasets. The number of features was reduced to 3% minimum and 30% maximum by using the proposed MFSS. In conclusion, the study results show that MFSS is an efficient feature selection algorithm without affecting the classification accuracy even for the reduced number of features. Also the proposed MFSS algorithm is suitable for both problem transformation and algorithm adaptation and it has great potentials in those applications generating multidimensional datasets. Hindawi Publishing Corporation 2015 2015-09-28 /pmc/articles/PMC4601565/ /pubmed/26491718 http://dx.doi.org/10.1155/2015/821798 Text en Copyright © 2015 S. Devaraj and S. Paulraj. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Devaraj, Senthilkumar
Paulraj, S.
An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset
title An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset
title_full An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset
title_fullStr An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset
title_full_unstemmed An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset
title_short An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset
title_sort efficient feature subset selection algorithm for classification of multidimensional dataset
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4601565/
https://www.ncbi.nlm.nih.gov/pubmed/26491718
http://dx.doi.org/10.1155/2015/821798
work_keys_str_mv AT devarajsenthilkumar anefficientfeaturesubsetselectionalgorithmforclassificationofmultidimensionaldataset
AT paulrajs anefficientfeaturesubsetselectionalgorithmforclassificationofmultidimensionaldataset
AT devarajsenthilkumar efficientfeaturesubsetselectionalgorithmforclassificationofmultidimensionaldataset
AT paulrajs efficientfeaturesubsetselectionalgorithmforclassificationofmultidimensionaldataset