Cargando…

Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis

The importance of medical data and the crucial nature of the decisions that are based on such data, as well as the large increase in its volume, has encouraged researchers to develop feature selection (FS)-based approaches to identify the most relevant data for specific medical problems In this pape...

Descripción completa

Detalles Bibliográficos
Autores principales: Alweshah, Mohammed, Alkhalaileh, Saleh, Al-Betar, Mohammed Azmi, Bakar, Azuraliza Abu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553647/
https://www.ncbi.nlm.nih.gov/pubmed/34728909
http://dx.doi.org/10.1016/j.knosys.2021.107629
_version_ 1784591622809321472
author Alweshah, Mohammed
Alkhalaileh, Saleh
Al-Betar, Mohammed Azmi
Bakar, Azuraliza Abu
author_facet Alweshah, Mohammed
Alkhalaileh, Saleh
Al-Betar, Mohammed Azmi
Bakar, Azuraliza Abu
author_sort Alweshah, Mohammed
collection PubMed
description The importance of medical data and the crucial nature of the decisions that are based on such data, as well as the large increase in its volume, has encouraged researchers to develop feature selection (FS)-based approaches to identify the most relevant data for specific medical problems In this paper, two intelligent wrapper FS approaches based on a new metaheuristic algorithm named the coronavirus herd immunity optimizer (CHIO) were applied with and without the incorporation of a greedy crossover (GC) operator strategy to enhance exploration of the search space by CHIO. The two proposed approaches, CHIO and CHIO-GC, were evaluated using 23 medical benchmark datasets and a real-world COVID-19 dataset. The experimental results indicated that CHIO-GC outperformed CHIO in terms of search capability, as reflected in classification accuracy, selection size, F-measure, standard deviation and convergence speed. The GC operator was able to enhance the balance between exploration and exploitation of the CHIO in the search and correct suboptimal solutions for faster convergence. The proposed CHIO-GC was also compared with two previous wrapper FS approaches, namely, binary moth flame optimization with Lévy flight (LBMFO_V3) and the hyper learning binary dragonfly algorithm (HLBDA), as well as four filter methods namely, Chi-square, Relief, correlation-based feature selection and information gain. CHIO-GC surpassed LBMFO_V3 and the four filter methods with an accuracy rate of 0.79 on 23 medical benchmark datasets. CHIO-GC also surpassed HLBDA with an accuracy rate of 0.93 when applied to the COVID-19 dataset. These encouraging results were obtained by striking a sufficient balance between the two search phases of CHIO-GC during the hunt for correct solutions, which also increased the convergence rate. This was accomplished by integrating a greedy crossover technique into the CHIO algorithm to remedy the inferior solutions found during premature convergence and while locked into a local optimum search space.
format Online
Article
Text
id pubmed-8553647
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-85536472021-10-29 Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis Alweshah, Mohammed Alkhalaileh, Saleh Al-Betar, Mohammed Azmi Bakar, Azuraliza Abu Knowl Based Syst Article The importance of medical data and the crucial nature of the decisions that are based on such data, as well as the large increase in its volume, has encouraged researchers to develop feature selection (FS)-based approaches to identify the most relevant data for specific medical problems In this paper, two intelligent wrapper FS approaches based on a new metaheuristic algorithm named the coronavirus herd immunity optimizer (CHIO) were applied with and without the incorporation of a greedy crossover (GC) operator strategy to enhance exploration of the search space by CHIO. The two proposed approaches, CHIO and CHIO-GC, were evaluated using 23 medical benchmark datasets and a real-world COVID-19 dataset. The experimental results indicated that CHIO-GC outperformed CHIO in terms of search capability, as reflected in classification accuracy, selection size, F-measure, standard deviation and convergence speed. The GC operator was able to enhance the balance between exploration and exploitation of the CHIO in the search and correct suboptimal solutions for faster convergence. The proposed CHIO-GC was also compared with two previous wrapper FS approaches, namely, binary moth flame optimization with Lévy flight (LBMFO_V3) and the hyper learning binary dragonfly algorithm (HLBDA), as well as four filter methods namely, Chi-square, Relief, correlation-based feature selection and information gain. CHIO-GC surpassed LBMFO_V3 and the four filter methods with an accuracy rate of 0.79 on 23 medical benchmark datasets. CHIO-GC also surpassed HLBDA with an accuracy rate of 0.93 when applied to the COVID-19 dataset. These encouraging results were obtained by striking a sufficient balance between the two search phases of CHIO-GC during the hunt for correct solutions, which also increased the convergence rate. This was accomplished by integrating a greedy crossover technique into the CHIO algorithm to remedy the inferior solutions found during premature convergence and while locked into a local optimum search space. Elsevier B.V. 2022-01-10 2021-10-29 /pmc/articles/PMC8553647/ /pubmed/34728909 http://dx.doi.org/10.1016/j.knosys.2021.107629 Text en © 2021 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Alweshah, Mohammed
Alkhalaileh, Saleh
Al-Betar, Mohammed Azmi
Bakar, Azuraliza Abu
Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis
title Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis
title_full Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis
title_fullStr Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis
title_full_unstemmed Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis
title_short Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis
title_sort coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553647/
https://www.ncbi.nlm.nih.gov/pubmed/34728909
http://dx.doi.org/10.1016/j.knosys.2021.107629
work_keys_str_mv AT alweshahmohammed coronavirusherdimmunityoptimizerwithgreedycrossoverforfeatureselectioninmedicaldiagnosis
AT alkhalailehsaleh coronavirusherdimmunityoptimizerwithgreedycrossoverforfeatureselectioninmedicaldiagnosis
AT albetarmohammedazmi coronavirusherdimmunityoptimizerwithgreedycrossoverforfeatureselectioninmedicaldiagnosis
AT bakarazuralizaabu coronavirusherdimmunityoptimizerwithgreedycrossoverforfeatureselectioninmedicaldiagnosis