Cargando…

Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets

Deep learning faces a significant challenge wherein the trained models often underperform when used with external test data sets. This issue has been attributed to spurious correlations between irrelevant features in the input data and corresponding labels. This study uses the classification of COVI...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Ran, Griner, Dalton, Garrett, John W., Qi, Zhihua, Chen, Guang-Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10403557/
https://www.ncbi.nlm.nih.gov/pubmed/37542078
http://dx.doi.org/10.1038/s41598-023-39855-3
_version_ 1785085094251200512
author Zhang, Ran
Griner, Dalton
Garrett, John W.
Qi, Zhihua
Chen, Guang-Hong
author_facet Zhang, Ran
Griner, Dalton
Garrett, John W.
Qi, Zhihua
Chen, Guang-Hong
author_sort Zhang, Ran
collection PubMed
description Deep learning faces a significant challenge wherein the trained models often underperform when used with external test data sets. This issue has been attributed to spurious correlations between irrelevant features in the input data and corresponding labels. This study uses the classification of COVID-19 from chest x-ray radiographs as an example to demonstrate that the image contrast and sharpness, which are characteristics of a chest radiograph dependent on data acquisition systems and imaging parameters, can be intrinsic shortcuts that impair the model’s generalizability. The study proposes training certified shortcut detective models that meet a set of qualification criteria which can then identify these intrinsic shortcuts in a curated data set.
format Online
Article
Text
id pubmed-10403557
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-104035572023-08-06 Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets Zhang, Ran Griner, Dalton Garrett, John W. Qi, Zhihua Chen, Guang-Hong Sci Rep Article Deep learning faces a significant challenge wherein the trained models often underperform when used with external test data sets. This issue has been attributed to spurious correlations between irrelevant features in the input data and corresponding labels. This study uses the classification of COVID-19 from chest x-ray radiographs as an example to demonstrate that the image contrast and sharpness, which are characteristics of a chest radiograph dependent on data acquisition systems and imaging parameters, can be intrinsic shortcuts that impair the model’s generalizability. The study proposes training certified shortcut detective models that meet a set of qualification criteria which can then identify these intrinsic shortcuts in a curated data set. Nature Publishing Group UK 2023-08-04 /pmc/articles/PMC10403557/ /pubmed/37542078 http://dx.doi.org/10.1038/s41598-023-39855-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zhang, Ran
Griner, Dalton
Garrett, John W.
Qi, Zhihua
Chen, Guang-Hong
Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets
title Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets
title_full Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets
title_fullStr Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets
title_full_unstemmed Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets
title_short Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets
title_sort training certified detectives to track down the intrinsic shortcuts in covid-19 chest x-ray data sets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10403557/
https://www.ncbi.nlm.nih.gov/pubmed/37542078
http://dx.doi.org/10.1038/s41598-023-39855-3
work_keys_str_mv AT zhangran trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets
AT grinerdalton trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets
AT garrettjohnw trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets
AT qizhihua trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets
AT chenguanghong trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets