Cargando…
An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
Feature selection provides the optimal subset of features for data mining models. However, current feature selection methods for high-dimensional data also require a better balance between feature subset quality and computational cost. In this paper, an efficient hybrid feature selection method (HFI...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9584659/ https://www.ncbi.nlm.nih.gov/pubmed/36275946 http://dx.doi.org/10.1155/2022/1452301 |
_version_ | 1784813318612975616 |
---|---|
author | Zhu, Yongbin Li, Tao Li, Wenshan |
author_facet | Zhu, Yongbin Li, Tao Li, Wenshan |
author_sort | Zhu, Yongbin |
collection | PubMed |
description | Feature selection provides the optimal subset of features for data mining models. However, current feature selection methods for high-dimensional data also require a better balance between feature subset quality and computational cost. In this paper, an efficient hybrid feature selection method (HFIA) based on artificial immune algorithm optimization is proposed to solve the feature selection problem of high-dimensional data. The algorithm combines filter algorithms and improves clone selection algorithms to explore the feature space of high-dimensional data. According to the target requirements of feature selection, combined with biological research results, this method introduces the lethal mutation mechanism and the Cauchy operator to improve the search performance of the algorithm. Moreover, the adaptive adjustment factor is introduced in the mutation and update phases of the algorithm. The effective combination of these mechanisms enables the algorithm to obtain a better search ability and lower computational costs. Experimental comparisons with 19 state-of-the-art feature selection methods are conducted on 25 high-dimensional benchmark datasets. The results show that the feature reduction rate for all datasets is above 99%, and the performance improvement for the classifier is between 5% and 48.33%. Compared with the five classical filtering feature selection methods, the computational cost of HFIA is lower than the two of them, and it is far better than these five algorithms in terms of the feature reduction rate and classification accuracy improvement. Compared with the 14 hybrid feature selection methods reported in the latest literature, the average winning rates in terms of classification accuracy, feature reduction rate, and computational cost are 85.83%, 88.33%, and 96.67%, respectively. |
format | Online Article Text |
id | pubmed-9584659 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-95846592022-10-21 An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data Zhu, Yongbin Li, Tao Li, Wenshan Comput Intell Neurosci Research Article Feature selection provides the optimal subset of features for data mining models. However, current feature selection methods for high-dimensional data also require a better balance between feature subset quality and computational cost. In this paper, an efficient hybrid feature selection method (HFIA) based on artificial immune algorithm optimization is proposed to solve the feature selection problem of high-dimensional data. The algorithm combines filter algorithms and improves clone selection algorithms to explore the feature space of high-dimensional data. According to the target requirements of feature selection, combined with biological research results, this method introduces the lethal mutation mechanism and the Cauchy operator to improve the search performance of the algorithm. Moreover, the adaptive adjustment factor is introduced in the mutation and update phases of the algorithm. The effective combination of these mechanisms enables the algorithm to obtain a better search ability and lower computational costs. Experimental comparisons with 19 state-of-the-art feature selection methods are conducted on 25 high-dimensional benchmark datasets. The results show that the feature reduction rate for all datasets is above 99%, and the performance improvement for the classifier is between 5% and 48.33%. Compared with the five classical filtering feature selection methods, the computational cost of HFIA is lower than the two of them, and it is far better than these five algorithms in terms of the feature reduction rate and classification accuracy improvement. Compared with the 14 hybrid feature selection methods reported in the latest literature, the average winning rates in terms of classification accuracy, feature reduction rate, and computational cost are 85.83%, 88.33%, and 96.67%, respectively. Hindawi 2022-10-13 /pmc/articles/PMC9584659/ /pubmed/36275946 http://dx.doi.org/10.1155/2022/1452301 Text en Copyright © 2022 Yongbin Zhu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhu, Yongbin Li, Tao Li, Wenshan An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data |
title | An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data |
title_full | An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data |
title_fullStr | An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data |
title_full_unstemmed | An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data |
title_short | An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data |
title_sort | efficient hybrid feature selection method using the artificial immune algorithm for high-dimensional data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9584659/ https://www.ncbi.nlm.nih.gov/pubmed/36275946 http://dx.doi.org/10.1155/2022/1452301 |
work_keys_str_mv | AT zhuyongbin anefficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata AT litao anefficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata AT liwenshan anefficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata AT zhuyongbin efficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata AT litao efficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata AT liwenshan efficienthybridfeatureselectionmethodusingtheartificialimmunealgorithmforhighdimensionaldata |