Cargando…
Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis
The importance of medical data and the crucial nature of the decisions that are based on such data, as well as the large increase in its volume, has encouraged researchers to develop feature selection (FS)-based approaches to identify the most relevant data for specific medical problems In this pape...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier B.V.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553647/ https://www.ncbi.nlm.nih.gov/pubmed/34728909 http://dx.doi.org/10.1016/j.knosys.2021.107629 |
_version_ | 1784591622809321472 |
---|---|
author | Alweshah, Mohammed Alkhalaileh, Saleh Al-Betar, Mohammed Azmi Bakar, Azuraliza Abu |
author_facet | Alweshah, Mohammed Alkhalaileh, Saleh Al-Betar, Mohammed Azmi Bakar, Azuraliza Abu |
author_sort | Alweshah, Mohammed |
collection | PubMed |
description | The importance of medical data and the crucial nature of the decisions that are based on such data, as well as the large increase in its volume, has encouraged researchers to develop feature selection (FS)-based approaches to identify the most relevant data for specific medical problems In this paper, two intelligent wrapper FS approaches based on a new metaheuristic algorithm named the coronavirus herd immunity optimizer (CHIO) were applied with and without the incorporation of a greedy crossover (GC) operator strategy to enhance exploration of the search space by CHIO. The two proposed approaches, CHIO and CHIO-GC, were evaluated using 23 medical benchmark datasets and a real-world COVID-19 dataset. The experimental results indicated that CHIO-GC outperformed CHIO in terms of search capability, as reflected in classification accuracy, selection size, F-measure, standard deviation and convergence speed. The GC operator was able to enhance the balance between exploration and exploitation of the CHIO in the search and correct suboptimal solutions for faster convergence. The proposed CHIO-GC was also compared with two previous wrapper FS approaches, namely, binary moth flame optimization with Lévy flight (LBMFO_V3) and the hyper learning binary dragonfly algorithm (HLBDA), as well as four filter methods namely, Chi-square, Relief, correlation-based feature selection and information gain. CHIO-GC surpassed LBMFO_V3 and the four filter methods with an accuracy rate of 0.79 on 23 medical benchmark datasets. CHIO-GC also surpassed HLBDA with an accuracy rate of 0.93 when applied to the COVID-19 dataset. These encouraging results were obtained by striking a sufficient balance between the two search phases of CHIO-GC during the hunt for correct solutions, which also increased the convergence rate. This was accomplished by integrating a greedy crossover technique into the CHIO algorithm to remedy the inferior solutions found during premature convergence and while locked into a local optimum search space. |
format | Online Article Text |
id | pubmed-8553647 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-85536472021-10-29 Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis Alweshah, Mohammed Alkhalaileh, Saleh Al-Betar, Mohammed Azmi Bakar, Azuraliza Abu Knowl Based Syst Article The importance of medical data and the crucial nature of the decisions that are based on such data, as well as the large increase in its volume, has encouraged researchers to develop feature selection (FS)-based approaches to identify the most relevant data for specific medical problems In this paper, two intelligent wrapper FS approaches based on a new metaheuristic algorithm named the coronavirus herd immunity optimizer (CHIO) were applied with and without the incorporation of a greedy crossover (GC) operator strategy to enhance exploration of the search space by CHIO. The two proposed approaches, CHIO and CHIO-GC, were evaluated using 23 medical benchmark datasets and a real-world COVID-19 dataset. The experimental results indicated that CHIO-GC outperformed CHIO in terms of search capability, as reflected in classification accuracy, selection size, F-measure, standard deviation and convergence speed. The GC operator was able to enhance the balance between exploration and exploitation of the CHIO in the search and correct suboptimal solutions for faster convergence. The proposed CHIO-GC was also compared with two previous wrapper FS approaches, namely, binary moth flame optimization with Lévy flight (LBMFO_V3) and the hyper learning binary dragonfly algorithm (HLBDA), as well as four filter methods namely, Chi-square, Relief, correlation-based feature selection and information gain. CHIO-GC surpassed LBMFO_V3 and the four filter methods with an accuracy rate of 0.79 on 23 medical benchmark datasets. CHIO-GC also surpassed HLBDA with an accuracy rate of 0.93 when applied to the COVID-19 dataset. These encouraging results were obtained by striking a sufficient balance between the two search phases of CHIO-GC during the hunt for correct solutions, which also increased the convergence rate. This was accomplished by integrating a greedy crossover technique into the CHIO algorithm to remedy the inferior solutions found during premature convergence and while locked into a local optimum search space. Elsevier B.V. 2022-01-10 2021-10-29 /pmc/articles/PMC8553647/ /pubmed/34728909 http://dx.doi.org/10.1016/j.knosys.2021.107629 Text en © 2021 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Alweshah, Mohammed Alkhalaileh, Saleh Al-Betar, Mohammed Azmi Bakar, Azuraliza Abu Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis |
title | Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis |
title_full | Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis |
title_fullStr | Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis |
title_full_unstemmed | Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis |
title_short | Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis |
title_sort | coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553647/ https://www.ncbi.nlm.nih.gov/pubmed/34728909 http://dx.doi.org/10.1016/j.knosys.2021.107629 |
work_keys_str_mv | AT alweshahmohammed coronavirusherdimmunityoptimizerwithgreedycrossoverforfeatureselectioninmedicaldiagnosis AT alkhalailehsaleh coronavirusherdimmunityoptimizerwithgreedycrossoverforfeatureselectioninmedicaldiagnosis AT albetarmohammedazmi coronavirusherdimmunityoptimizerwithgreedycrossoverforfeatureselectioninmedicaldiagnosis AT bakarazuralizaabu coronavirusherdimmunityoptimizerwithgreedycrossoverforfeatureselectioninmedicaldiagnosis |