Cargando…

An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data

Feature selection provides the optimal subset of features for data mining models. However, current feature selection methods for high-dimensional data also require a better balance between feature subset quality and computational cost. In this paper, an efficient hybrid feature selection method (HFI...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yongbin, Li, Tao, Li, Wenshan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9584659/
https://www.ncbi.nlm.nih.gov/pubmed/36275946
http://dx.doi.org/10.1155/2022/1452301
_version_ 1784813318612975616
author Zhu, Yongbin
Li, Tao
Li, Wenshan
author_facet Zhu, Yongbin
Li, Tao
Li, Wenshan
author_sort Zhu, Yongbin
collection PubMed
description Feature selection provides the optimal subset of features for data mining models. However, current feature selection methods for high-dimensional data also require a better balance between feature subset quality and computational cost. In this paper, an efficient hybrid feature selection method (HFIA) based on artificial immune algorithm optimization is proposed to solve the feature selection problem of high-dimensional data. The algorithm combines filter algorithms and improves clone selection algorithms to explore the feature space of high-dimensional data. According to the target requirements of feature selection, combined with biological research results, this method introduces the lethal mutation mechanism and the Cauchy operator to improve the search performance of the algorithm. Moreover, the adaptive adjustment factor is introduced in the mutation and update phases of the algorithm. The effective combination of these mechanisms enables the algorithm to obtain a better search ability and lower computational costs. Experimental comparisons with 19 state-of-the-art feature selection methods are conducted on 25 high-dimensional benchmark datasets. The results show that the feature reduction rate for all datasets is above 99%, and the performance improvement for the classifier is between 5% and 48.33%. Compared with the five classical filtering feature selection methods, the computational cost of HFIA is lower than the two of them, and it is far better than these five algorithms in terms of the feature reduction rate and classification accuracy improvement. Compared with the 14 hybrid feature selection methods reported in the latest literature, the average winning rates in terms of classification accuracy, feature reduction rate, and computational cost are 85.83%, 88.33%, and 96.67%, respectively.
format Online
Article
Text
id pubmed-9584659
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-95846592022-10-21 An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data Zhu, Yongbin Li, Tao Li, Wenshan Comput Intell Neurosci Research Article Feature selection provides the optimal subset of features for data mining models. However, current feature selection methods for high-dimensional data also require a better balance between feature subset quality and computational cost. In this paper, an efficient hybrid feature selection method (HFIA) based on artificial immune algorithm optimization is proposed to solve the feature selection problem of high-dimensional data. The algorithm combines filter algorithms and improves clone selection algorithms to explore the feature space of high-dimensional data. According to the target requirements of feature selection, combined with biological research results, this method introduces the lethal mutation mechanism and the Cauchy operator to improve the search performance of the algorithm. Moreover, the adaptive adjustment factor is introduced in the mutation and update phases of the algorithm. The effective combination of these mechanisms enables the algorithm to obtain a better search ability and lower computational costs. Experimental comparisons with 19 state-of-the-art feature selection methods are conducted on 25 high-dimensional benchmark datasets. The results show that the feature reduction rate for all datasets is above 99%, and the performance improvement for the classifier is between 5% and 48.33%. Compared with the five classical filtering feature selection methods, the computational cost of HFIA is lower than the two of them, and it is far better than these five algorithms in terms of the feature reduction rate and classification accuracy improvement. Compared with the 14 hybrid feature selection methods reported in the latest literature, the average winning rates in terms of classification accuracy, feature reduction rate, and computational cost are 85.83%, 88.33%, and 96.67%, respectively. Hindawi 2022-10-13 /pmc/articles/PMC9584659/ /pubmed/36275946 http://dx.doi.org/10.1155/2022/1452301 Text en Copyright © 2022 Yongbin Zhu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhu, Yongbin
Li, Tao
Li, Wenshan
An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
title An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
title_full An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
title_fullStr An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
title_full_unstemmed An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
title_short An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
title_sort efficient hybrid feature selection method using the artificial immune algorithm for high-dimensional data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9584659/
https://www.ncbi.nlm.nih.gov/pubmed/36275946
http://dx.doi.org/10.1155/2022/1452301
work_keys_str_mv AT zhuyongbin anefficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata
AT litao anefficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata
AT liwenshan anefficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata
AT zhuyongbin efficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata
AT litao efficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata
AT liwenshan efficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata