Cargando…

Evaluation of deep learning for COVID‐19 diagnosis: Impact of image dataset organization

INTRODUCTION: Coronavirus disease 2019 (COVID‐19) has spread all over the world showing high transmissibility. Many studies have proposed diverse diagnostic methods based on deep learning using chest X‐ray images focusing on performance improvement. In reviewing them, this study noticed that evaluat...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kim, Ga Young, Kim, Jae Yong, Kim, Chae Hyeon, Kim, Sung Min
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2021
Materias:	Medical Imaging
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8292699/ https://www.ncbi.nlm.nih.gov/pubmed/34159697 http://dx.doi.org/10.1002/acm2.13320

_version_	1783724877320028160
author	Kim, Ga Young Kim, Jae Yong Kim, Chae Hyeon Kim, Sung Min
author_facet	Kim, Ga Young Kim, Jae Yong Kim, Chae Hyeon Kim, Sung Min
author_sort	Kim, Ga Young
collection	PubMed
description	INTRODUCTION: Coronavirus disease 2019 (COVID‐19) has spread all over the world showing high transmissibility. Many studies have proposed diverse diagnostic methods based on deep learning using chest X‐ray images focusing on performance improvement. In reviewing them, this study noticed that evaluation results might be influenced by dataset organization. Therefore, this study identified whether the high‐performance values can prove the clinical application potential. METHODS: This study selected chest X‐ray image databases which have been widely applied in previous studies. One database includes images for COVID‐19, while the others consist of normal and pneumonia images. Then, the COVID‐19 classification model was designed and trained on diverse database compositions and evaluated using confusion matrix‐based metrics. Also, each database was analyzed by graphical representation methods. RESULTS: The performance was significantly different according to dataset composition. Overall, higher performance was identified on the dataset organized with different databases for each class, compared with the dataset from same database. Also, there were significant differences in the image characteristics between different databases. CONCLUSIONS: The experimental results indicate that model may be trained based on differences of the image characteristics between databases and not on lesion features. This shows that evaluation metrics can be influenced by dataset organization, and high metric values would not directly mean the potential for clinical application. These emphasize the importance of suitable dataset organization for applying COVID‐19 diagnosis methods to real clinical sites. Radiologists should sufficiently understand about this issue as actual user of these methods.
format	Online Article Text
id	pubmed-8292699
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-82926992021-07-22 Evaluation of deep learning for COVID‐19 diagnosis: Impact of image dataset organization Kim, Ga Young Kim, Jae Yong Kim, Chae Hyeon Kim, Sung Min J Appl Clin Med Phys Medical Imaging INTRODUCTION: Coronavirus disease 2019 (COVID‐19) has spread all over the world showing high transmissibility. Many studies have proposed diverse diagnostic methods based on deep learning using chest X‐ray images focusing on performance improvement. In reviewing them, this study noticed that evaluation results might be influenced by dataset organization. Therefore, this study identified whether the high‐performance values can prove the clinical application potential. METHODS: This study selected chest X‐ray image databases which have been widely applied in previous studies. One database includes images for COVID‐19, while the others consist of normal and pneumonia images. Then, the COVID‐19 classification model was designed and trained on diverse database compositions and evaluated using confusion matrix‐based metrics. Also, each database was analyzed by graphical representation methods. RESULTS: The performance was significantly different according to dataset composition. Overall, higher performance was identified on the dataset organized with different databases for each class, compared with the dataset from same database. Also, there were significant differences in the image characteristics between different databases. CONCLUSIONS: The experimental results indicate that model may be trained based on differences of the image characteristics between databases and not on lesion features. This shows that evaluation metrics can be influenced by dataset organization, and high metric values would not directly mean the potential for clinical application. These emphasize the importance of suitable dataset organization for applying COVID‐19 diagnosis methods to real clinical sites. Radiologists should sufficiently understand about this issue as actual user of these methods. John Wiley and Sons Inc. 2021-06-23 /pmc/articles/PMC8292699/ /pubmed/34159697 http://dx.doi.org/10.1002/acm2.13320 Text en © 2021 The Authors. Journal of Applied Clinical Medical Physics published by Wiley Periodicals LLC on behalf of American Association of Physicists in Medicine. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Medical Imaging Kim, Ga Young Kim, Jae Yong Kim, Chae Hyeon Kim, Sung Min Evaluation of deep learning for COVID‐19 diagnosis: Impact of image dataset organization
title	Evaluation of deep learning for COVID‐19 diagnosis: Impact of image dataset organization
title_full	Evaluation of deep learning for COVID‐19 diagnosis: Impact of image dataset organization
title_fullStr	Evaluation of deep learning for COVID‐19 diagnosis: Impact of image dataset organization
title_full_unstemmed	Evaluation of deep learning for COVID‐19 diagnosis: Impact of image dataset organization
title_short	Evaluation of deep learning for COVID‐19 diagnosis: Impact of image dataset organization
title_sort	evaluation of deep learning for covid‐19 diagnosis: impact of image dataset organization
topic	Medical Imaging
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8292699/ https://www.ncbi.nlm.nih.gov/pubmed/34159697 http://dx.doi.org/10.1002/acm2.13320
work_keys_str_mv	AT kimgayoung evaluationofdeeplearningforcovid19diagnosisimpactofimagedatasetorganization AT kimjaeyong evaluationofdeeplearningforcovid19diagnosisimpactofimagedatasetorganization AT kimchaehyeon evaluationofdeeplearningforcovid19diagnosisimpactofimagedatasetorganization AT kimsungmin evaluationofdeeplearningforcovid19diagnosisimpactofimagedatasetorganization

Evaluation of deep learning for COVID‐19 diagnosis: Impact of image dataset organization

Ejemplares similares