Cargando…
Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets
Deep learning faces a significant challenge wherein the trained models often underperform when used with external test data sets. This issue has been attributed to spurious correlations between irrelevant features in the input data and corresponding labels. This study uses the classification of COVI...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10403557/ https://www.ncbi.nlm.nih.gov/pubmed/37542078 http://dx.doi.org/10.1038/s41598-023-39855-3 |
_version_ | 1785085094251200512 |
---|---|
author | Zhang, Ran Griner, Dalton Garrett, John W. Qi, Zhihua Chen, Guang-Hong |
author_facet | Zhang, Ran Griner, Dalton Garrett, John W. Qi, Zhihua Chen, Guang-Hong |
author_sort | Zhang, Ran |
collection | PubMed |
description | Deep learning faces a significant challenge wherein the trained models often underperform when used with external test data sets. This issue has been attributed to spurious correlations between irrelevant features in the input data and corresponding labels. This study uses the classification of COVID-19 from chest x-ray radiographs as an example to demonstrate that the image contrast and sharpness, which are characteristics of a chest radiograph dependent on data acquisition systems and imaging parameters, can be intrinsic shortcuts that impair the model’s generalizability. The study proposes training certified shortcut detective models that meet a set of qualification criteria which can then identify these intrinsic shortcuts in a curated data set. |
format | Online Article Text |
id | pubmed-10403557 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-104035572023-08-06 Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets Zhang, Ran Griner, Dalton Garrett, John W. Qi, Zhihua Chen, Guang-Hong Sci Rep Article Deep learning faces a significant challenge wherein the trained models often underperform when used with external test data sets. This issue has been attributed to spurious correlations between irrelevant features in the input data and corresponding labels. This study uses the classification of COVID-19 from chest x-ray radiographs as an example to demonstrate that the image contrast and sharpness, which are characteristics of a chest radiograph dependent on data acquisition systems and imaging parameters, can be intrinsic shortcuts that impair the model’s generalizability. The study proposes training certified shortcut detective models that meet a set of qualification criteria which can then identify these intrinsic shortcuts in a curated data set. Nature Publishing Group UK 2023-08-04 /pmc/articles/PMC10403557/ /pubmed/37542078 http://dx.doi.org/10.1038/s41598-023-39855-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zhang, Ran Griner, Dalton Garrett, John W. Qi, Zhihua Chen, Guang-Hong Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets |
title | Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets |
title_full | Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets |
title_fullStr | Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets |
title_full_unstemmed | Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets |
title_short | Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets |
title_sort | training certified detectives to track down the intrinsic shortcuts in covid-19 chest x-ray data sets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10403557/ https://www.ncbi.nlm.nih.gov/pubmed/37542078 http://dx.doi.org/10.1038/s41598-023-39855-3 |
work_keys_str_mv | AT zhangran trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets AT grinerdalton trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets AT garrettjohnw trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets AT qizhihua trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets AT chenguanghong trainingcertifieddetectivestotrackdowntheintrinsicshortcutsincovid19chestxraydatasets |