Cargando…
Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
Selecting appropriate feature subsets is a vital task in machine learning. Its main goal is to remove noisy, irrelevant, and redundant feature subsets that could negatively impact the learning model’s accuracy and improve classification performance without information loss. Therefore, more advanced...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536540/ https://www.ncbi.nlm.nih.gov/pubmed/36201524 http://dx.doi.org/10.1371/journal.pone.0274850 |
_version_ | 1784803001346555904 |
---|---|
author | Akinola, Olatunji A. Agushaka, Jeffrey O. Ezugwu, Absalom E. |
author_facet | Akinola, Olatunji A. Agushaka, Jeffrey O. Ezugwu, Absalom E. |
author_sort | Akinola, Olatunji A. |
collection | PubMed |
description | Selecting appropriate feature subsets is a vital task in machine learning. Its main goal is to remove noisy, irrelevant, and redundant feature subsets that could negatively impact the learning model’s accuracy and improve classification performance without information loss. Therefore, more advanced optimization methods have been employed to locate the optimal subset of features. This paper presents a binary version of the dwarf mongoose optimization called the BDMO algorithm to solve the high-dimensional feature selection problem. The effectiveness of this approach was validated using 18 high-dimensional datasets from the Arizona State University feature selection repository and compared the efficacy of the BDMO with other well-known feature selection techniques in the literature. The results show that the BDMO outperforms other methods producing the least average fitness value in 14 out of 18 datasets which means that it achieved 77.77% on the overall best fitness values. The result also shows BDMO demonstrating stability by returning the least standard deviation (SD) value in 13 of 18 datasets (72.22%). Furthermore, the study achieved higher validation accuracy in 15 of the 18 datasets (83.33%) over other methods. The proposed approach also yielded the highest validation accuracy attainable in the COIL20 and Leukemia datasets which vividly portray the superiority of the BDMO. |
format | Online Article Text |
id | pubmed-9536540 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-95365402022-10-07 Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems Akinola, Olatunji A. Agushaka, Jeffrey O. Ezugwu, Absalom E. PLoS One Research Article Selecting appropriate feature subsets is a vital task in machine learning. Its main goal is to remove noisy, irrelevant, and redundant feature subsets that could negatively impact the learning model’s accuracy and improve classification performance without information loss. Therefore, more advanced optimization methods have been employed to locate the optimal subset of features. This paper presents a binary version of the dwarf mongoose optimization called the BDMO algorithm to solve the high-dimensional feature selection problem. The effectiveness of this approach was validated using 18 high-dimensional datasets from the Arizona State University feature selection repository and compared the efficacy of the BDMO with other well-known feature selection techniques in the literature. The results show that the BDMO outperforms other methods producing the least average fitness value in 14 out of 18 datasets which means that it achieved 77.77% on the overall best fitness values. The result also shows BDMO demonstrating stability by returning the least standard deviation (SD) value in 13 of 18 datasets (72.22%). Furthermore, the study achieved higher validation accuracy in 15 of the 18 datasets (83.33%) over other methods. The proposed approach also yielded the highest validation accuracy attainable in the COIL20 and Leukemia datasets which vividly portray the superiority of the BDMO. Public Library of Science 2022-10-06 /pmc/articles/PMC9536540/ /pubmed/36201524 http://dx.doi.org/10.1371/journal.pone.0274850 Text en © 2022 Akinola et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Akinola, Olatunji A. Agushaka, Jeffrey O. Ezugwu, Absalom E. Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems |
title | Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems |
title_full | Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems |
title_fullStr | Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems |
title_full_unstemmed | Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems |
title_short | Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems |
title_sort | binary dwarf mongoose optimizer for solving high-dimensional feature selection problems |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536540/ https://www.ncbi.nlm.nih.gov/pubmed/36201524 http://dx.doi.org/10.1371/journal.pone.0274850 |
work_keys_str_mv | AT akinolaolatunjia binarydwarfmongooseoptimizerforsolvinghighdimensionalfeatureselectionproblems AT agushakajeffreyo binarydwarfmongooseoptimizerforsolvinghighdimensionalfeatureselectionproblems AT ezugwuabsalome binarydwarfmongooseoptimizerforsolvinghighdimensionalfeatureselectionproblems |