Cargando…

A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset

Feature selection is one of the most important challenges in machine learning and data science. This process is usually performed in the data preprocessing phase, where the data is transformed to a proper format for further operations by machine learning algorithm. Many real-world datasets are highl...

Descripción completa

Detalles Bibliográficos
Autores principales: Bacanin, Nebojsa, Venkatachalam, K., Bezdan, Timea, Zivkovic, Miodrag, Abouhawwash, Mohamed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9901218/
https://www.ncbi.nlm.nih.gov/pubmed/36785847
http://dx.doi.org/10.1016/j.micpro.2023.104778
_version_ 1784882993157898240
author Bacanin, Nebojsa
Venkatachalam, K.
Bezdan, Timea
Zivkovic, Miodrag
Abouhawwash, Mohamed
author_facet Bacanin, Nebojsa
Venkatachalam, K.
Bezdan, Timea
Zivkovic, Miodrag
Abouhawwash, Mohamed
author_sort Bacanin, Nebojsa
collection PubMed
description Feature selection is one of the most important challenges in machine learning and data science. This process is usually performed in the data preprocessing phase, where the data is transformed to a proper format for further operations by machine learning algorithm. Many real-world datasets are highly dimensional with many irrelevant, even redundant features. These kinds of features do not improve classification accuracy and can even shrink down performance of a classifier. The goal of feature selection is to find optimal (or sub-optimal) subset of features that contain relevant information about the dataset from which machine learning algorithms can derive useful conclusions. In this manuscript, a novel version of firefly algorithm (FA) is proposed and adapted for feature selection challenge. Proposed method significantly improves performance of the basic FA, and also outperforms other state-of-the-art metaheuristics for both, benchmark bound-constrained and practical feature selection tasks. Method was first validated on standard unconstrained benchmarks and later it was applied for feature selection by using 21 standard University of California, Irvine (UCL) datasets. Moreover, presented approach was also tested for relatively novel COVID-19 dataset for predicting patients health, and one microcontroller microarray dataset. Results obtained in all practical simulations attest robustness and efficiency of proposed algorithm in terms of convergence, solutions’ quality and classification accuracy. More precisely, the proposed approach obtained the best classification accuracy on 13 out of 21 total datasets, significantly outperforming other competitor methods.
format Online
Article
Text
id pubmed-9901218
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-99012182023-02-07 A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset Bacanin, Nebojsa Venkatachalam, K. Bezdan, Timea Zivkovic, Miodrag Abouhawwash, Mohamed Microprocess Microsyst Article Feature selection is one of the most important challenges in machine learning and data science. This process is usually performed in the data preprocessing phase, where the data is transformed to a proper format for further operations by machine learning algorithm. Many real-world datasets are highly dimensional with many irrelevant, even redundant features. These kinds of features do not improve classification accuracy and can even shrink down performance of a classifier. The goal of feature selection is to find optimal (or sub-optimal) subset of features that contain relevant information about the dataset from which machine learning algorithms can derive useful conclusions. In this manuscript, a novel version of firefly algorithm (FA) is proposed and adapted for feature selection challenge. Proposed method significantly improves performance of the basic FA, and also outperforms other state-of-the-art metaheuristics for both, benchmark bound-constrained and practical feature selection tasks. Method was first validated on standard unconstrained benchmarks and later it was applied for feature selection by using 21 standard University of California, Irvine (UCL) datasets. Moreover, presented approach was also tested for relatively novel COVID-19 dataset for predicting patients health, and one microcontroller microarray dataset. Results obtained in all practical simulations attest robustness and efficiency of proposed algorithm in terms of convergence, solutions’ quality and classification accuracy. More precisely, the proposed approach obtained the best classification accuracy on 13 out of 21 total datasets, significantly outperforming other competitor methods. Elsevier B.V. 2023-04 2023-02-06 /pmc/articles/PMC9901218/ /pubmed/36785847 http://dx.doi.org/10.1016/j.micpro.2023.104778 Text en © 2023 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Bacanin, Nebojsa
Venkatachalam, K.
Bezdan, Timea
Zivkovic, Miodrag
Abouhawwash, Mohamed
A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset
title A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset
title_full A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset
title_fullStr A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset
title_full_unstemmed A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset
title_short A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset
title_sort novel firefly algorithm approach for efficient feature selection with covid-19 dataset
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9901218/
https://www.ncbi.nlm.nih.gov/pubmed/36785847
http://dx.doi.org/10.1016/j.micpro.2023.104778
work_keys_str_mv AT bacaninnebojsa anovelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset
AT venkatachalamk anovelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset
AT bezdantimea anovelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset
AT zivkovicmiodrag anovelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset
AT abouhawwashmohamed anovelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset
AT bacaninnebojsa novelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset
AT venkatachalamk novelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset
AT bezdantimea novelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset
AT zivkovicmiodrag novelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset
AT abouhawwashmohamed novelfireflyalgorithmapproachforefficientfeatureselectionwithcovid19dataset