Cargando…

COVID-19 chest X-ray image classification in the presence of noisy labels()

The Corona Virus Disease 2019 (COVID-19) has been declared a worldwide pandemic, and a key method for diagnosing COVID-19 is chest X-ray imaging. The application of convolutional neural network with medical imaging helps to diagnose the disease accurately, where the label quality plays an important...

Descripción completa

Detalles Bibliográficos
Autores principales: Ying, Xiaoqing, Liu, Hao, Huang, Rong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9826538/
https://www.ncbi.nlm.nih.gov/pubmed/36644695
http://dx.doi.org/10.1016/j.displa.2023.102370
_version_ 1784866876082356224
author Ying, Xiaoqing
Liu, Hao
Huang, Rong
author_facet Ying, Xiaoqing
Liu, Hao
Huang, Rong
author_sort Ying, Xiaoqing
collection PubMed
description The Corona Virus Disease 2019 (COVID-19) has been declared a worldwide pandemic, and a key method for diagnosing COVID-19 is chest X-ray imaging. The application of convolutional neural network with medical imaging helps to diagnose the disease accurately, where the label quality plays an important role in the classification problem of COVID-19 chest X-rays. However, most of the existing classification methods ignore the problem that the labels are hardly completely true and effective, and noisy labels lead to a significant degradation in the performance of image classification frameworks. In addition, due to the wide distribution of lesions and the large number of local features of COVID-19 chest X-ray images, existing label recovery algorithms have to face the bottleneck problem of the difficult reuse of noisy samples. Therefore, this paper introduces a general classification framework for COVID-19 chest X-ray images with noisy labels and proposes a noisy label recovery algorithm based on subset label iterative propagation and replacement (SLIPR). Specifically, the proposed algorithm first obtains random subsets of the samples multiple times. Then, it integrates several techniques such as principal component analysis, low-rank representation, neighborhood graph regularization, and k-nearest neighbor for feature extraction and image classification. Finally, multi-level weight distribution and replacement are performed on the labels to cleanse the noise. In addition, for the label-recovered dataset, high confidence samples are further selected as the training set to improve the stability and accuracy of the classification framework without affecting its inherent performance. In this paper, three typical datasets are chosen to conduct extensive experiments and comparisons of existing algorithms under different metrics. Experimental results on three publicly available COVID-19 chest X-ray image datasets show that the proposed algorithm can effectively recover noisy labels and improve the accuracy of the image classification framework by 18.9% on the Tawsifur dataset, 19.92% on the Skytells dataset, and 16.72% on the CXRs dataset. Compared to the state-of-the-art algorithms, the gain of classification accuracy of SLIPR on the three datasets can reach 8.67%-19.38%, and the proposed algorithm also has certain scalability while ensuring data integrity.
format Online
Article
Text
id pubmed-9826538
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-98265382023-01-09 COVID-19 chest X-ray image classification in the presence of noisy labels() Ying, Xiaoqing Liu, Hao Huang, Rong Displays Article The Corona Virus Disease 2019 (COVID-19) has been declared a worldwide pandemic, and a key method for diagnosing COVID-19 is chest X-ray imaging. The application of convolutional neural network with medical imaging helps to diagnose the disease accurately, where the label quality plays an important role in the classification problem of COVID-19 chest X-rays. However, most of the existing classification methods ignore the problem that the labels are hardly completely true and effective, and noisy labels lead to a significant degradation in the performance of image classification frameworks. In addition, due to the wide distribution of lesions and the large number of local features of COVID-19 chest X-ray images, existing label recovery algorithms have to face the bottleneck problem of the difficult reuse of noisy samples. Therefore, this paper introduces a general classification framework for COVID-19 chest X-ray images with noisy labels and proposes a noisy label recovery algorithm based on subset label iterative propagation and replacement (SLIPR). Specifically, the proposed algorithm first obtains random subsets of the samples multiple times. Then, it integrates several techniques such as principal component analysis, low-rank representation, neighborhood graph regularization, and k-nearest neighbor for feature extraction and image classification. Finally, multi-level weight distribution and replacement are performed on the labels to cleanse the noise. In addition, for the label-recovered dataset, high confidence samples are further selected as the training set to improve the stability and accuracy of the classification framework without affecting its inherent performance. In this paper, three typical datasets are chosen to conduct extensive experiments and comparisons of existing algorithms under different metrics. Experimental results on three publicly available COVID-19 chest X-ray image datasets show that the proposed algorithm can effectively recover noisy labels and improve the accuracy of the image classification framework by 18.9% on the Tawsifur dataset, 19.92% on the Skytells dataset, and 16.72% on the CXRs dataset. Compared to the state-of-the-art algorithms, the gain of classification accuracy of SLIPR on the three datasets can reach 8.67%-19.38%, and the proposed algorithm also has certain scalability while ensuring data integrity. Elsevier B.V. 2023-04 2023-01-08 /pmc/articles/PMC9826538/ /pubmed/36644695 http://dx.doi.org/10.1016/j.displa.2023.102370 Text en © 2023 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Ying, Xiaoqing
Liu, Hao
Huang, Rong
COVID-19 chest X-ray image classification in the presence of noisy labels()
title COVID-19 chest X-ray image classification in the presence of noisy labels()
title_full COVID-19 chest X-ray image classification in the presence of noisy labels()
title_fullStr COVID-19 chest X-ray image classification in the presence of noisy labels()
title_full_unstemmed COVID-19 chest X-ray image classification in the presence of noisy labels()
title_short COVID-19 chest X-ray image classification in the presence of noisy labels()
title_sort covid-19 chest x-ray image classification in the presence of noisy labels()
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9826538/
https://www.ncbi.nlm.nih.gov/pubmed/36644695
http://dx.doi.org/10.1016/j.displa.2023.102370
work_keys_str_mv AT yingxiaoqing covid19chestxrayimageclassificationinthepresenceofnoisylabels
AT liuhao covid19chestxrayimageclassificationinthepresenceofnoisylabels
AT huangrong covid19chestxrayimageclassificationinthepresenceofnoisylabels