Cargando…
Nested ensemble selection: An effective hybrid feature selection method
It has been shown that while feature selection algorithms are able to distinguish between relevant and irrelevant features, they fail to differentiate between relevant and redundant and correlated features. To address this issue, we propose a highly effective approach, called Nested Ensemble Selecti...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558945/ https://www.ncbi.nlm.nih.gov/pubmed/37809839 http://dx.doi.org/10.1016/j.heliyon.2023.e19686 |
_version_ | 1785117395956793344 |
---|---|
author | Kamalov, Firuz Sulieman, Hana Moussa, Sherif Reyes, Jorge Avante Safaraliev, Murodbek |
author_facet | Kamalov, Firuz Sulieman, Hana Moussa, Sherif Reyes, Jorge Avante Safaraliev, Murodbek |
author_sort | Kamalov, Firuz |
collection | PubMed |
description | It has been shown that while feature selection algorithms are able to distinguish between relevant and irrelevant features, they fail to differentiate between relevant and redundant and correlated features. To address this issue, we propose a highly effective approach, called Nested Ensemble Selection (NES), that is based on a combination of filter and wrapper methods. The proposed feature selection algorithm differs from the existing filter-wrapper hybrid methods in its simplicity and efficiency as well as precision. The new algorithm is able to separate the relevant variables from the irrelevant as well as the redundant and correlated features. Furthermore, we provide a robust heuristic for identifying the optimal number of selected features which remains one of the greatest challenges in feature selection. Numerical experiments on synthetic and real-life data demonstrate the effectiveness of the proposed method. The NES algorithm achieves perfect precision on the synthetic data and near optimal accuracy on the real-life data. The proposed method is compared against several popular algorithms including mRMR, Boruta, genetic, recursive feature elimination, Lasso, and Elastic Net. The results show that NES significantly outperforms the benchmarks algorithms especially on multi-class datasets. |
format | Online Article Text |
id | pubmed-10558945 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-105589452023-10-08 Nested ensemble selection: An effective hybrid feature selection method Kamalov, Firuz Sulieman, Hana Moussa, Sherif Reyes, Jorge Avante Safaraliev, Murodbek Heliyon Research Article It has been shown that while feature selection algorithms are able to distinguish between relevant and irrelevant features, they fail to differentiate between relevant and redundant and correlated features. To address this issue, we propose a highly effective approach, called Nested Ensemble Selection (NES), that is based on a combination of filter and wrapper methods. The proposed feature selection algorithm differs from the existing filter-wrapper hybrid methods in its simplicity and efficiency as well as precision. The new algorithm is able to separate the relevant variables from the irrelevant as well as the redundant and correlated features. Furthermore, we provide a robust heuristic for identifying the optimal number of selected features which remains one of the greatest challenges in feature selection. Numerical experiments on synthetic and real-life data demonstrate the effectiveness of the proposed method. The NES algorithm achieves perfect precision on the synthetic data and near optimal accuracy on the real-life data. The proposed method is compared against several popular algorithms including mRMR, Boruta, genetic, recursive feature elimination, Lasso, and Elastic Net. The results show that NES significantly outperforms the benchmarks algorithms especially on multi-class datasets. Elsevier 2023-09-09 /pmc/articles/PMC10558945/ /pubmed/37809839 http://dx.doi.org/10.1016/j.heliyon.2023.e19686 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Kamalov, Firuz Sulieman, Hana Moussa, Sherif Reyes, Jorge Avante Safaraliev, Murodbek Nested ensemble selection: An effective hybrid feature selection method |
title | Nested ensemble selection: An effective hybrid feature selection method |
title_full | Nested ensemble selection: An effective hybrid feature selection method |
title_fullStr | Nested ensemble selection: An effective hybrid feature selection method |
title_full_unstemmed | Nested ensemble selection: An effective hybrid feature selection method |
title_short | Nested ensemble selection: An effective hybrid feature selection method |
title_sort | nested ensemble selection: an effective hybrid feature selection method |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558945/ https://www.ncbi.nlm.nih.gov/pubmed/37809839 http://dx.doi.org/10.1016/j.heliyon.2023.e19686 |
work_keys_str_mv | AT kamalovfiruz nestedensembleselectionaneffectivehybridfeatureselectionmethod AT suliemanhana nestedensembleselectionaneffectivehybridfeatureselectionmethod AT moussasherif nestedensembleselectionaneffectivehybridfeatureselectionmethod AT reyesjorgeavante nestedensembleselectionaneffectivehybridfeatureselectionmethod AT safaralievmurodbek nestedensembleselectionaneffectivehybridfeatureselectionmethod |