Cargando…

Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets

Deep learning faces a significant challenge wherein the trained models often underperform when used with external test data sets. This issue has been attributed to spurious correlations between irrelevant features in the input data and corresponding labels. This study uses the classification of COVI...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Ran, Griner, Dalton, Garrett, John W., Qi, Zhihua, Chen, Guang-Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Journal Experts 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168454/
https://www.ncbi.nlm.nih.gov/pubmed/37162826
http://dx.doi.org/10.21203/rs.3.rs-2818347/v1
Descripción
Sumario:Deep learning faces a significant challenge wherein the trained models often underperform when used with external test data sets. This issue has been attributed to spurious correlations between irrelevant features in the input data and corresponding labels. This study uses the classification of COVID-19 from chest x-ray radiographs as an example to demonstrate that the image contrast and sharpness, which are characteristics of a chest radiograph dependent on data acquisition systems and imaging parameters, can be intrinsic shortcuts that impair the model’s generalizability. The study proposes training certified shortcut detective models that meet a set of qualification criteria which can then identify these intrinsic shortcuts in a curated data set.