Cargando…

A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection

Feature Selection (FS) is considered as an important preprocessing step in data mining and is used to remove redundant or unrelated features from high-dimensional data. Most optimization algorithms for FS problems are not balanced in search. A hybrid algorithm called nonlinear binary grasshopper wha...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Lingling, Liang, Xiyue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9449924/
https://www.ncbi.nlm.nih.gov/pubmed/36089930
http://dx.doi.org/10.1007/s42235-022-00253-6
_version_ 1784784408599855104
author Fang, Lingling
Liang, Xiyue
author_facet Fang, Lingling
Liang, Xiyue
author_sort Fang, Lingling
collection PubMed
description Feature Selection (FS) is considered as an important preprocessing step in data mining and is used to remove redundant or unrelated features from high-dimensional data. Most optimization algorithms for FS problems are not balanced in search. A hybrid algorithm called nonlinear binary grasshopper whale optimization algorithm (NL-BGWOA) is proposed to solve the problem in this paper. In the proposed method, a new position updating strategy combining the position changes of whales and grasshoppers population is expressed, which optimizes the diversity of searching in the target domain. Ten distinct high-dimensional UCI datasets, the multi-modal Parkinson's speech datasets, and the COVID-19 symptom dataset are used to validate the proposed method. It has been demonstrated that the proposed NL-BGWOA performs well across most of high-dimensional datasets, which shows a high accuracy rate of up to 0.9895. Furthermore, the experimental results on the medical datasets also demonstrate the advantages of the proposed method in actual FS problem, including accuracy, size of feature subsets, and fitness with best values of 0.913, 5.7, and 0.0873, respectively. The results reveal that the proposed NL-BGWOA has comprehensive superiority in solving the FS problem of high-dimensional data.
format Online
Article
Text
id pubmed-9449924
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-94499242022-09-07 A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection Fang, Lingling Liang, Xiyue J Bionic Eng Research Article Feature Selection (FS) is considered as an important preprocessing step in data mining and is used to remove redundant or unrelated features from high-dimensional data. Most optimization algorithms for FS problems are not balanced in search. A hybrid algorithm called nonlinear binary grasshopper whale optimization algorithm (NL-BGWOA) is proposed to solve the problem in this paper. In the proposed method, a new position updating strategy combining the position changes of whales and grasshoppers population is expressed, which optimizes the diversity of searching in the target domain. Ten distinct high-dimensional UCI datasets, the multi-modal Parkinson's speech datasets, and the COVID-19 symptom dataset are used to validate the proposed method. It has been demonstrated that the proposed NL-BGWOA performs well across most of high-dimensional datasets, which shows a high accuracy rate of up to 0.9895. Furthermore, the experimental results on the medical datasets also demonstrate the advantages of the proposed method in actual FS problem, including accuracy, size of feature subsets, and fitness with best values of 0.913, 5.7, and 0.0873, respectively. The results reveal that the proposed NL-BGWOA has comprehensive superiority in solving the FS problem of high-dimensional data. Springer Nature Singapore 2022-09-07 2023 /pmc/articles/PMC9449924/ /pubmed/36089930 http://dx.doi.org/10.1007/s42235-022-00253-6 Text en © Jilin University 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Research Article
Fang, Lingling
Liang, Xiyue
A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection
title A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection
title_full A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection
title_fullStr A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection
title_full_unstemmed A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection
title_short A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection
title_sort novel method based on nonlinear binary grasshopper whale optimization algorithm for feature selection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9449924/
https://www.ncbi.nlm.nih.gov/pubmed/36089930
http://dx.doi.org/10.1007/s42235-022-00253-6
work_keys_str_mv AT fanglingling anovelmethodbasedonnonlinearbinarygrasshopperwhaleoptimizationalgorithmforfeatureselection
AT liangxiyue anovelmethodbasedonnonlinearbinarygrasshopperwhaleoptimizationalgorithmforfeatureselection
AT fanglingling novelmethodbasedonnonlinearbinarygrasshopperwhaleoptimizationalgorithmforfeatureselection
AT liangxiyue novelmethodbasedonnonlinearbinarygrasshopperwhaleoptimizationalgorithmforfeatureselection