Cargando…

Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement

In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and r...

Descripción completa

Detalles Bibliográficos
Autores principales: Trivedi, Anusua, Robinson, Caleb, Blazes, Marian, Ortiz, Anthony, Desbiens, Jocelyn, Gupta, Sunil, Dodhia, Rahul, Bhatraju, Pavan K., Liles, W. Conrad, Kalpathy-Cramer, Jayashree, Lee, Aaron Y., Lavista Ferres, Juan M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536609/
https://www.ncbi.nlm.nih.gov/pubmed/36201483
http://dx.doi.org/10.1371/journal.pone.0274098
_version_ 1784803017246113792
author Trivedi, Anusua
Robinson, Caleb
Blazes, Marian
Ortiz, Anthony
Desbiens, Jocelyn
Gupta, Sunil
Dodhia, Rahul
Bhatraju, Pavan K.
Liles, W. Conrad
Kalpathy-Cramer, Jayashree
Lee, Aaron Y.
Lavista Ferres, Juan M.
author_facet Trivedi, Anusua
Robinson, Caleb
Blazes, Marian
Ortiz, Anthony
Desbiens, Jocelyn
Gupta, Sunil
Dodhia, Rahul
Bhatraju, Pavan K.
Liles, W. Conrad
Kalpathy-Cramer, Jayashree
Lee, Aaron Y.
Lavista Ferres, Juan M.
author_sort Trivedi, Anusua
collection PubMed
description In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and researchers often pool CXR data from multiple sources, for example, using different x-ray machines in various patient populations under different clinical scenarios. Deep learning models trained on such datasets have been shown to overfit to erroneous features instead of learning pulmonary characteristics in a phenomenon known as shortcut learning. We propose adding feature disentanglement to the training process. This technique forces the models to identify pulmonary features from the images and penalizes them for learning features that can discriminate between the original datasets that the images come from. We find that models trained in this way indeed have better generalization performance on unseen data; in the best case we found that it improved AUC by 0.13 on held out data. We further find that this outperforms masking out non-lung parts of the CXRs and performing histogram equalization, both of which are recently proposed methods for removing biases in CXR datasets.
format Online
Article
Text
id pubmed-9536609
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-95366092022-10-07 Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement Trivedi, Anusua Robinson, Caleb Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Kalpathy-Cramer, Jayashree Lee, Aaron Y. Lavista Ferres, Juan M. PLoS One Research Article In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and researchers often pool CXR data from multiple sources, for example, using different x-ray machines in various patient populations under different clinical scenarios. Deep learning models trained on such datasets have been shown to overfit to erroneous features instead of learning pulmonary characteristics in a phenomenon known as shortcut learning. We propose adding feature disentanglement to the training process. This technique forces the models to identify pulmonary features from the images and penalizes them for learning features that can discriminate between the original datasets that the images come from. We find that models trained in this way indeed have better generalization performance on unseen data; in the best case we found that it improved AUC by 0.13 on held out data. We further find that this outperforms masking out non-lung parts of the CXRs and performing histogram equalization, both of which are recently proposed methods for removing biases in CXR datasets. Public Library of Science 2022-10-06 /pmc/articles/PMC9536609/ /pubmed/36201483 http://dx.doi.org/10.1371/journal.pone.0274098 Text en © 2022 Trivedi et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Trivedi, Anusua
Robinson, Caleb
Blazes, Marian
Ortiz, Anthony
Desbiens, Jocelyn
Gupta, Sunil
Dodhia, Rahul
Bhatraju, Pavan K.
Liles, W. Conrad
Kalpathy-Cramer, Jayashree
Lee, Aaron Y.
Lavista Ferres, Juan M.
Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_full Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_fullStr Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_full_unstemmed Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_short Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_sort deep learning models for covid-19 chest x-ray classification: preventing shortcut learning using feature disentanglement
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536609/
https://www.ncbi.nlm.nih.gov/pubmed/36201483
http://dx.doi.org/10.1371/journal.pone.0274098
work_keys_str_mv AT trivedianusua deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT robinsoncaleb deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT blazesmarian deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT ortizanthony deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT desbiensjocelyn deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT guptasunil deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT dodhiarahul deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT bhatrajupavank deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT lileswconrad deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT kalpathycramerjayashree deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT leeaarony deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT lavistaferresjuanm deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement