Cargando…

Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems

Selecting appropriate feature subsets is a vital task in machine learning. Its main goal is to remove noisy, irrelevant, and redundant feature subsets that could negatively impact the learning model’s accuracy and improve classification performance without information loss. Therefore, more advanced...

Descripción completa

Detalles Bibliográficos
Autores principales: Akinola, Olatunji A., Agushaka, Jeffrey O., Ezugwu, Absalom E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536540/
https://www.ncbi.nlm.nih.gov/pubmed/36201524
http://dx.doi.org/10.1371/journal.pone.0274850
_version_ 1784803001346555904
author Akinola, Olatunji A.
Agushaka, Jeffrey O.
Ezugwu, Absalom E.
author_facet Akinola, Olatunji A.
Agushaka, Jeffrey O.
Ezugwu, Absalom E.
author_sort Akinola, Olatunji A.
collection PubMed
description Selecting appropriate feature subsets is a vital task in machine learning. Its main goal is to remove noisy, irrelevant, and redundant feature subsets that could negatively impact the learning model’s accuracy and improve classification performance without information loss. Therefore, more advanced optimization methods have been employed to locate the optimal subset of features. This paper presents a binary version of the dwarf mongoose optimization called the BDMO algorithm to solve the high-dimensional feature selection problem. The effectiveness of this approach was validated using 18 high-dimensional datasets from the Arizona State University feature selection repository and compared the efficacy of the BDMO with other well-known feature selection techniques in the literature. The results show that the BDMO outperforms other methods producing the least average fitness value in 14 out of 18 datasets which means that it achieved 77.77% on the overall best fitness values. The result also shows BDMO demonstrating stability by returning the least standard deviation (SD) value in 13 of 18 datasets (72.22%). Furthermore, the study achieved higher validation accuracy in 15 of the 18 datasets (83.33%) over other methods. The proposed approach also yielded the highest validation accuracy attainable in the COIL20 and Leukemia datasets which vividly portray the superiority of the BDMO.
format Online
Article
Text
id pubmed-9536540
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-95365402022-10-07 Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems Akinola, Olatunji A. Agushaka, Jeffrey O. Ezugwu, Absalom E. PLoS One Research Article Selecting appropriate feature subsets is a vital task in machine learning. Its main goal is to remove noisy, irrelevant, and redundant feature subsets that could negatively impact the learning model’s accuracy and improve classification performance without information loss. Therefore, more advanced optimization methods have been employed to locate the optimal subset of features. This paper presents a binary version of the dwarf mongoose optimization called the BDMO algorithm to solve the high-dimensional feature selection problem. The effectiveness of this approach was validated using 18 high-dimensional datasets from the Arizona State University feature selection repository and compared the efficacy of the BDMO with other well-known feature selection techniques in the literature. The results show that the BDMO outperforms other methods producing the least average fitness value in 14 out of 18 datasets which means that it achieved 77.77% on the overall best fitness values. The result also shows BDMO demonstrating stability by returning the least standard deviation (SD) value in 13 of 18 datasets (72.22%). Furthermore, the study achieved higher validation accuracy in 15 of the 18 datasets (83.33%) over other methods. The proposed approach also yielded the highest validation accuracy attainable in the COIL20 and Leukemia datasets which vividly portray the superiority of the BDMO. Public Library of Science 2022-10-06 /pmc/articles/PMC9536540/ /pubmed/36201524 http://dx.doi.org/10.1371/journal.pone.0274850 Text en © 2022 Akinola et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Akinola, Olatunji A.
Agushaka, Jeffrey O.
Ezugwu, Absalom E.
Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
title Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
title_full Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
title_fullStr Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
title_full_unstemmed Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
title_short Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
title_sort binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536540/
https://www.ncbi.nlm.nih.gov/pubmed/36201524
http://dx.doi.org/10.1371/journal.pone.0274850
work_keys_str_mv AT akinolaolatunjia binarydwarfmongooseoptimizerforsolvinghighdimensionalfeatureselectionproblems
AT agushakajeffreyo binarydwarfmongooseoptimizerforsolvinghighdimensionalfeatureselectionproblems
AT ezugwuabsalome binarydwarfmongooseoptimizerforsolvinghighdimensionalfeatureselectionproblems