Cargando…

Nested ensemble selection: An effective hybrid feature selection method

It has been shown that while feature selection algorithms are able to distinguish between relevant and irrelevant features, they fail to differentiate between relevant and redundant and correlated features. To address this issue, we propose a highly effective approach, called Nested Ensemble Selecti...

Descripción completa

Detalles Bibliográficos
Autores principales: Kamalov, Firuz, Sulieman, Hana, Moussa, Sherif, Reyes, Jorge Avante, Safaraliev, Murodbek
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558945/
https://www.ncbi.nlm.nih.gov/pubmed/37809839
http://dx.doi.org/10.1016/j.heliyon.2023.e19686
_version_ 1785117395956793344
author Kamalov, Firuz
Sulieman, Hana
Moussa, Sherif
Reyes, Jorge Avante
Safaraliev, Murodbek
author_facet Kamalov, Firuz
Sulieman, Hana
Moussa, Sherif
Reyes, Jorge Avante
Safaraliev, Murodbek
author_sort Kamalov, Firuz
collection PubMed
description It has been shown that while feature selection algorithms are able to distinguish between relevant and irrelevant features, they fail to differentiate between relevant and redundant and correlated features. To address this issue, we propose a highly effective approach, called Nested Ensemble Selection (NES), that is based on a combination of filter and wrapper methods. The proposed feature selection algorithm differs from the existing filter-wrapper hybrid methods in its simplicity and efficiency as well as precision. The new algorithm is able to separate the relevant variables from the irrelevant as well as the redundant and correlated features. Furthermore, we provide a robust heuristic for identifying the optimal number of selected features which remains one of the greatest challenges in feature selection. Numerical experiments on synthetic and real-life data demonstrate the effectiveness of the proposed method. The NES algorithm achieves perfect precision on the synthetic data and near optimal accuracy on the real-life data. The proposed method is compared against several popular algorithms including mRMR, Boruta, genetic, recursive feature elimination, Lasso, and Elastic Net. The results show that NES significantly outperforms the benchmarks algorithms especially on multi-class datasets.
format Online
Article
Text
id pubmed-10558945
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-105589452023-10-08 Nested ensemble selection: An effective hybrid feature selection method Kamalov, Firuz Sulieman, Hana Moussa, Sherif Reyes, Jorge Avante Safaraliev, Murodbek Heliyon Research Article It has been shown that while feature selection algorithms are able to distinguish between relevant and irrelevant features, they fail to differentiate between relevant and redundant and correlated features. To address this issue, we propose a highly effective approach, called Nested Ensemble Selection (NES), that is based on a combination of filter and wrapper methods. The proposed feature selection algorithm differs from the existing filter-wrapper hybrid methods in its simplicity and efficiency as well as precision. The new algorithm is able to separate the relevant variables from the irrelevant as well as the redundant and correlated features. Furthermore, we provide a robust heuristic for identifying the optimal number of selected features which remains one of the greatest challenges in feature selection. Numerical experiments on synthetic and real-life data demonstrate the effectiveness of the proposed method. The NES algorithm achieves perfect precision on the synthetic data and near optimal accuracy on the real-life data. The proposed method is compared against several popular algorithms including mRMR, Boruta, genetic, recursive feature elimination, Lasso, and Elastic Net. The results show that NES significantly outperforms the benchmarks algorithms especially on multi-class datasets. Elsevier 2023-09-09 /pmc/articles/PMC10558945/ /pubmed/37809839 http://dx.doi.org/10.1016/j.heliyon.2023.e19686 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Kamalov, Firuz
Sulieman, Hana
Moussa, Sherif
Reyes, Jorge Avante
Safaraliev, Murodbek
Nested ensemble selection: An effective hybrid feature selection method
title Nested ensemble selection: An effective hybrid feature selection method
title_full Nested ensemble selection: An effective hybrid feature selection method
title_fullStr Nested ensemble selection: An effective hybrid feature selection method
title_full_unstemmed Nested ensemble selection: An effective hybrid feature selection method
title_short Nested ensemble selection: An effective hybrid feature selection method
title_sort nested ensemble selection: an effective hybrid feature selection method
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558945/
https://www.ncbi.nlm.nih.gov/pubmed/37809839
http://dx.doi.org/10.1016/j.heliyon.2023.e19686
work_keys_str_mv AT kamalovfiruz nestedensembleselectionaneffectivehybridfeatureselectionmethod
AT suliemanhana nestedensembleselectionaneffectivehybridfeatureselectionmethod
AT moussasherif nestedensembleselectionaneffectivehybridfeatureselectionmethod
AT reyesjorgeavante nestedensembleselectionaneffectivehybridfeatureselectionmethod
AT safaralievmurodbek nestedensembleselectionaneffectivehybridfeatureselectionmethod