Cargando…

Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss

Healthcare systems worldwide have been struggling since the beginning of the COVID-19 pandemic. The early diagnosis of this unprecedented infection has become their ultimate objective. Detecting positive patients from chest X-ray images is a quick and efficient solution for overloaded hospitals. Man...

Descripción completa

Detalles Bibliográficos
Autores principales: Chamseddine, Ekram, Mansouri, Nesrine, Soui, Makram, Abed, Mourad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9422401/
https://www.ncbi.nlm.nih.gov/pubmed/36061418
http://dx.doi.org/10.1016/j.asoc.2022.109588
_version_ 1784777803611242496
author Chamseddine, Ekram
Mansouri, Nesrine
Soui, Makram
Abed, Mourad
author_facet Chamseddine, Ekram
Mansouri, Nesrine
Soui, Makram
Abed, Mourad
author_sort Chamseddine, Ekram
collection PubMed
description Healthcare systems worldwide have been struggling since the beginning of the COVID-19 pandemic. The early diagnosis of this unprecedented infection has become their ultimate objective. Detecting positive patients from chest X-ray images is a quick and efficient solution for overloaded hospitals. Many studies based on deep learning (DL) techniques have shown high performance in classifying COVID-19 chest X-ray images. However, most of these studies suffer from a class imbalance problem mainly due to the limited number of COVID-19 samples. Such a problem may significantly reduce the efficiency of DL classifiers. In this work, we aim to build an accurate model that assists clinicians in the early diagnosis of COVID-19 using balanced data. To this end, we trained six state-of-the-art convolutional neural networks (CNNs) via transfer learning (TL) on three different COVID-19 datasets. The models were developed to perform a multi-classification task that distinguishes between COVID-19, normal, and viral pneumonia cases. To address the class imbalance issue, we first investigated the Weighted Categorical Loss (WCL) and then the Synthetic Minority Oversampling Technique (SMOTE) on each dataset separately. After a comparative study of the obtained results, we selected the model that achieved high classification results in terms of accuracy, sensitivity, specificity, precision, F1 score, and AUC compared to other recent works. DenseNet201 and VGG-19 claimed the best scores. With an accuracy of 98.87%, an F1_Score of 98.21%, a sensitivity of 98.86%, a specificity of 99.43%, a precision of 100%, and an AUC of 99.15%, the WCL combined with CheXNet outperformed the other examined models.
format Online
Article
Text
id pubmed-9422401
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-94224012022-08-30 Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss Chamseddine, Ekram Mansouri, Nesrine Soui, Makram Abed, Mourad Appl Soft Comput Article Healthcare systems worldwide have been struggling since the beginning of the COVID-19 pandemic. The early diagnosis of this unprecedented infection has become their ultimate objective. Detecting positive patients from chest X-ray images is a quick and efficient solution for overloaded hospitals. Many studies based on deep learning (DL) techniques have shown high performance in classifying COVID-19 chest X-ray images. However, most of these studies suffer from a class imbalance problem mainly due to the limited number of COVID-19 samples. Such a problem may significantly reduce the efficiency of DL classifiers. In this work, we aim to build an accurate model that assists clinicians in the early diagnosis of COVID-19 using balanced data. To this end, we trained six state-of-the-art convolutional neural networks (CNNs) via transfer learning (TL) on three different COVID-19 datasets. The models were developed to perform a multi-classification task that distinguishes between COVID-19, normal, and viral pneumonia cases. To address the class imbalance issue, we first investigated the Weighted Categorical Loss (WCL) and then the Synthetic Minority Oversampling Technique (SMOTE) on each dataset separately. After a comparative study of the obtained results, we selected the model that achieved high classification results in terms of accuracy, sensitivity, specificity, precision, F1 score, and AUC compared to other recent works. DenseNet201 and VGG-19 claimed the best scores. With an accuracy of 98.87%, an F1_Score of 98.21%, a sensitivity of 98.86%, a specificity of 99.43%, a precision of 100%, and an AUC of 99.15%, the WCL combined with CheXNet outperformed the other examined models. Elsevier B.V. 2022-11 2022-08-29 /pmc/articles/PMC9422401/ /pubmed/36061418 http://dx.doi.org/10.1016/j.asoc.2022.109588 Text en © 2022 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Chamseddine, Ekram
Mansouri, Nesrine
Soui, Makram
Abed, Mourad
Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss
title Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss
title_full Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss
title_fullStr Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss
title_full_unstemmed Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss
title_short Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss
title_sort handling class imbalance in covid-19 chest x-ray images classification: using smote and weighted loss
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9422401/
https://www.ncbi.nlm.nih.gov/pubmed/36061418
http://dx.doi.org/10.1016/j.asoc.2022.109588
work_keys_str_mv AT chamseddineekram handlingclassimbalanceincovid19chestxrayimagesclassificationusingsmoteandweightedloss
AT mansourinesrine handlingclassimbalanceincovid19chestxrayimagesclassificationusingsmoteandweightedloss
AT souimakram handlingclassimbalanceincovid19chestxrayimagesclassificationusingsmoteandweightedloss
AT abedmourad handlingclassimbalanceincovid19chestxrayimagesclassificationusingsmoteandweightedloss