Cargando…

Boosting Slime Mould Algorithm for High-Dimensional Gene Data Mining: Diversity Analysis and Feature Selection

Slime mould algorithm (SMA) is a new metaheuristic algorithm, which simulates the behavior and morphology changes of slime mould during foraging. The slime mould algorithm has good performance; however, the basic version of SMA still has some problems. When faced with some complex problems, it may f...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiu, Feng, Guo, Ran, Chen, Huiling, Liang, Guoxi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9584684/
https://www.ncbi.nlm.nih.gov/pubmed/36277020
http://dx.doi.org/10.1155/2022/8011003
_version_ 1784813324529041408
author Qiu, Feng
Guo, Ran
Chen, Huiling
Liang, Guoxi
author_facet Qiu, Feng
Guo, Ran
Chen, Huiling
Liang, Guoxi
author_sort Qiu, Feng
collection PubMed
description Slime mould algorithm (SMA) is a new metaheuristic algorithm, which simulates the behavior and morphology changes of slime mould during foraging. The slime mould algorithm has good performance; however, the basic version of SMA still has some problems. When faced with some complex problems, it may fall into local optimum and cannot find the optimal solution. Aiming at this problem, an improved SMA is proposed to alleviate the disadvantages of SMA. Based on the original SMA, Gaussian mutation and Levy flight are introduced to improve the global search performance of the SMA. Adding Gaussian mutation to SMA can improve the diversity of the population, and Levy flight can alleviate the local optimum of SMA, so that the algorithm can find the optimal solution as soon as possible. In order to verify the effectiveness of the proposed algorithm, a continuous version of the proposed algorithm, GLSMA, is tested on 33 classical continuous optimization problems. Then, on 14 high-dimensional gene datasets, the effectiveness of the proposed discrete version, namely, BGLSMA, is verified by comparing with other feature selection algorithm. Experimental results reveal that the performance of the continuous version of the algorithm is better than the original algorithm, and the defects of the original algorithm are alleviated. Besides, the discrete version of the algorithm has a higher classification accuracy when fewer features are selected. This proves that the improved algorithm has practical value in high-dimensional gene feature selection.
format Online
Article
Text
id pubmed-9584684
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-95846842022-10-21 Boosting Slime Mould Algorithm for High-Dimensional Gene Data Mining: Diversity Analysis and Feature Selection Qiu, Feng Guo, Ran Chen, Huiling Liang, Guoxi Comput Math Methods Med Research Article Slime mould algorithm (SMA) is a new metaheuristic algorithm, which simulates the behavior and morphology changes of slime mould during foraging. The slime mould algorithm has good performance; however, the basic version of SMA still has some problems. When faced with some complex problems, it may fall into local optimum and cannot find the optimal solution. Aiming at this problem, an improved SMA is proposed to alleviate the disadvantages of SMA. Based on the original SMA, Gaussian mutation and Levy flight are introduced to improve the global search performance of the SMA. Adding Gaussian mutation to SMA can improve the diversity of the population, and Levy flight can alleviate the local optimum of SMA, so that the algorithm can find the optimal solution as soon as possible. In order to verify the effectiveness of the proposed algorithm, a continuous version of the proposed algorithm, GLSMA, is tested on 33 classical continuous optimization problems. Then, on 14 high-dimensional gene datasets, the effectiveness of the proposed discrete version, namely, BGLSMA, is verified by comparing with other feature selection algorithm. Experimental results reveal that the performance of the continuous version of the algorithm is better than the original algorithm, and the defects of the original algorithm are alleviated. Besides, the discrete version of the algorithm has a higher classification accuracy when fewer features are selected. This proves that the improved algorithm has practical value in high-dimensional gene feature selection. Hindawi 2022-10-13 /pmc/articles/PMC9584684/ /pubmed/36277020 http://dx.doi.org/10.1155/2022/8011003 Text en Copyright © 2022 Feng Qiu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Qiu, Feng
Guo, Ran
Chen, Huiling
Liang, Guoxi
Boosting Slime Mould Algorithm for High-Dimensional Gene Data Mining: Diversity Analysis and Feature Selection
title Boosting Slime Mould Algorithm for High-Dimensional Gene Data Mining: Diversity Analysis and Feature Selection
title_full Boosting Slime Mould Algorithm for High-Dimensional Gene Data Mining: Diversity Analysis and Feature Selection
title_fullStr Boosting Slime Mould Algorithm for High-Dimensional Gene Data Mining: Diversity Analysis and Feature Selection
title_full_unstemmed Boosting Slime Mould Algorithm for High-Dimensional Gene Data Mining: Diversity Analysis and Feature Selection
title_short Boosting Slime Mould Algorithm for High-Dimensional Gene Data Mining: Diversity Analysis and Feature Selection
title_sort boosting slime mould algorithm for high-dimensional gene data mining: diversity analysis and feature selection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9584684/
https://www.ncbi.nlm.nih.gov/pubmed/36277020
http://dx.doi.org/10.1155/2022/8011003
work_keys_str_mv AT qiufeng boostingslimemouldalgorithmforhighdimensionalgenedataminingdiversityanalysisandfeatureselection
AT guoran boostingslimemouldalgorithmforhighdimensionalgenedataminingdiversityanalysisandfeatureselection
AT chenhuiling boostingslimemouldalgorithmforhighdimensionalgenedataminingdiversityanalysisandfeatureselection
AT liangguoxi boostingslimemouldalgorithmforhighdimensionalgenedataminingdiversityanalysisandfeatureselection