Cargando…
Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and r...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536609/ https://www.ncbi.nlm.nih.gov/pubmed/36201483 http://dx.doi.org/10.1371/journal.pone.0274098 |
_version_ | 1784803017246113792 |
---|---|
author | Trivedi, Anusua Robinson, Caleb Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Kalpathy-Cramer, Jayashree Lee, Aaron Y. Lavista Ferres, Juan M. |
author_facet | Trivedi, Anusua Robinson, Caleb Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Kalpathy-Cramer, Jayashree Lee, Aaron Y. Lavista Ferres, Juan M. |
author_sort | Trivedi, Anusua |
collection | PubMed |
description | In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and researchers often pool CXR data from multiple sources, for example, using different x-ray machines in various patient populations under different clinical scenarios. Deep learning models trained on such datasets have been shown to overfit to erroneous features instead of learning pulmonary characteristics in a phenomenon known as shortcut learning. We propose adding feature disentanglement to the training process. This technique forces the models to identify pulmonary features from the images and penalizes them for learning features that can discriminate between the original datasets that the images come from. We find that models trained in this way indeed have better generalization performance on unseen data; in the best case we found that it improved AUC by 0.13 on held out data. We further find that this outperforms masking out non-lung parts of the CXRs and performing histogram equalization, both of which are recently proposed methods for removing biases in CXR datasets. |
format | Online Article Text |
id | pubmed-9536609 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-95366092022-10-07 Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement Trivedi, Anusua Robinson, Caleb Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Kalpathy-Cramer, Jayashree Lee, Aaron Y. Lavista Ferres, Juan M. PLoS One Research Article In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and researchers often pool CXR data from multiple sources, for example, using different x-ray machines in various patient populations under different clinical scenarios. Deep learning models trained on such datasets have been shown to overfit to erroneous features instead of learning pulmonary characteristics in a phenomenon known as shortcut learning. We propose adding feature disentanglement to the training process. This technique forces the models to identify pulmonary features from the images and penalizes them for learning features that can discriminate between the original datasets that the images come from. We find that models trained in this way indeed have better generalization performance on unseen data; in the best case we found that it improved AUC by 0.13 on held out data. We further find that this outperforms masking out non-lung parts of the CXRs and performing histogram equalization, both of which are recently proposed methods for removing biases in CXR datasets. Public Library of Science 2022-10-06 /pmc/articles/PMC9536609/ /pubmed/36201483 http://dx.doi.org/10.1371/journal.pone.0274098 Text en © 2022 Trivedi et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Trivedi, Anusua Robinson, Caleb Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Kalpathy-Cramer, Jayashree Lee, Aaron Y. Lavista Ferres, Juan M. Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_full | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_fullStr | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_full_unstemmed | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_short | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_sort | deep learning models for covid-19 chest x-ray classification: preventing shortcut learning using feature disentanglement |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536609/ https://www.ncbi.nlm.nih.gov/pubmed/36201483 http://dx.doi.org/10.1371/journal.pone.0274098 |
work_keys_str_mv | AT trivedianusua deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT robinsoncaleb deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT blazesmarian deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT ortizanthony deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT desbiensjocelyn deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT guptasunil deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT dodhiarahul deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT bhatrajupavank deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT lileswconrad deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT kalpathycramerjayashree deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT leeaarony deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT lavistaferresjuanm deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement |