Cargando…

Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification

A number of recent papers have shown experimental evidence that suggests it is possible to build highly accurate deep neural network models to detect COVID-19 from chest X-ray images. In this paper, we show that good generalization to unseen sources has not been achieved. Experiments with richer dat...

Descripción completa

Detalles Bibliográficos
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	IEEE 2021
Materias:	Biomedical Engineering
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8224464/ https://www.ncbi.nlm.nih.gov/pubmed/34178559 http://dx.doi.org/10.1109/ACCESS.2021.3079716

_version_	1783711894621650944
collection	PubMed
description	A number of recent papers have shown experimental evidence that suggests it is possible to build highly accurate deep neural network models to detect COVID-19 from chest X-ray images. In this paper, we show that good generalization to unseen sources has not been achieved. Experiments with richer data sets than have previously been used show models have high accuracy on seen sources, but poor accuracy on unseen sources. The reason for the disparity is that the convolutional neural network model, which learns features, can focus on differences in X-ray machines or in positioning within the machines, for example. Any feature that a person would clearly rule out is called a confounding feature. Some of the models were trained on COVID-19 image data taken from publications, which may be different than raw images. Some data sets were of pediatric cases with pneumonia where COVID-19 chest X-rays are almost exclusively from adults, so lung size becomes a spurious feature that can be exploited. In this work, we have eliminated many confounding features by working with as close to raw data as possible. Still, deep learned models may leverage source specific confounders to differentiate COVID-19 from pneumonia preventing generalizing to new data sources (i.e. external sites). Our models have achieved an AUC of 1.00 on seen data sources but in the worst case only scored an AUC of 0.38 on unseen ones. This indicates that such models need further assessment/development before they can be broadly clinically deployed. An example of fine-tuning to improve performance at a new site is given.
format	Online Article Text
id	pubmed-8224464
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	IEEE
record_format	MEDLINE/PubMed
spelling	pubmed-82244642021-06-24 Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification IEEE Access Biomedical Engineering A number of recent papers have shown experimental evidence that suggests it is possible to build highly accurate deep neural network models to detect COVID-19 from chest X-ray images. In this paper, we show that good generalization to unseen sources has not been achieved. Experiments with richer data sets than have previously been used show models have high accuracy on seen sources, but poor accuracy on unseen sources. The reason for the disparity is that the convolutional neural network model, which learns features, can focus on differences in X-ray machines or in positioning within the machines, for example. Any feature that a person would clearly rule out is called a confounding feature. Some of the models were trained on COVID-19 image data taken from publications, which may be different than raw images. Some data sets were of pediatric cases with pneumonia where COVID-19 chest X-rays are almost exclusively from adults, so lung size becomes a spurious feature that can be exploited. In this work, we have eliminated many confounding features by working with as close to raw data as possible. Still, deep learned models may leverage source specific confounders to differentiate COVID-19 from pneumonia preventing generalizing to new data sources (i.e. external sites). Our models have achieved an AUC of 1.00 on seen data sources but in the worst case only scored an AUC of 0.38 on unseen ones. This indicates that such models need further assessment/development before they can be broadly clinically deployed. An example of fine-tuning to improve performance at a new site is given. IEEE 2021-05-13 /pmc/articles/PMC8224464/ /pubmed/34178559 http://dx.doi.org/10.1109/ACCESS.2021.3079716 Text en This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
spellingShingle	Biomedical Engineering Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification
title	Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification
title_full	Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification
title_fullStr	Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification
title_full_unstemmed	Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification
title_short	Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification
title_sort	discovery of a generalization gap of convolutional neural networks on covid-19 x-rays classification
topic	Biomedical Engineering
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8224464/ https://www.ncbi.nlm.nih.gov/pubmed/34178559 http://dx.doi.org/10.1109/ACCESS.2021.3079716
work_keys_str_mv	AT discoveryofageneralizationgapofconvolutionalneuralnetworksoncovid19xraysclassification AT discoveryofageneralizationgapofconvolutionalneuralnetworksoncovid19xraysclassification AT discoveryofageneralizationgapofconvolutionalneuralnetworksoncovid19xraysclassification AT discoveryofageneralizationgapofconvolutionalneuralnetworksoncovid19xraysclassification AT discoveryofageneralizationgapofconvolutionalneuralnetworksoncovid19xraysclassification

Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification

Ejemplares similares