Cargando…

Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement

In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and r...

Descripción completa

Detalles Bibliográficos
Autores principales: Robinson, Caleb, Trivedi, Anusua, Blazes, Marian, Ortiz, Anthony, Desbiens, Jocelyn, Gupta, Sunil, Dodhia, Rahul, Bhatraju, Pavan K., Liles, W. Conrad, Lee, Aaron, Kalpathy-Cramer, Jayashree, Ferres, Juan M. Lavista
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7885941/
https://www.ncbi.nlm.nih.gov/pubmed/33594382
http://dx.doi.org/10.1101/2021.02.11.20196766
_version_ 1783651696197500928
author Robinson, Caleb
Trivedi, Anusua
Blazes, Marian
Ortiz, Anthony
Desbiens, Jocelyn
Gupta, Sunil
Dodhia, Rahul
Bhatraju, Pavan K.
Liles, W. Conrad
Lee, Aaron
Kalpathy-Cramer, Jayashree
Ferres, Juan M. Lavista
author_facet Robinson, Caleb
Trivedi, Anusua
Blazes, Marian
Ortiz, Anthony
Desbiens, Jocelyn
Gupta, Sunil
Dodhia, Rahul
Bhatraju, Pavan K.
Liles, W. Conrad
Lee, Aaron
Kalpathy-Cramer, Jayashree
Ferres, Juan M. Lavista
author_sort Robinson, Caleb
collection PubMed
description In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and researchers often pool CXR data from multiple sources, for example, using different x-ray machines in various patient populations under different clinical scenarios. Deep learning models trained on such datasets have been shown to overfit to erroneous features instead of learning pulmonary characteristics – a phenomenon known as shortcut learning. We propose adding feature disentanglement to the training process, forcing the models to identify pulmonary features from the images while penalizing them for learning features that can discriminate between the original datasets that the images come from. We find that models trained in this way indeed have better generalization performance on unseen data; in the best case we found that it improved AUC by 0.13 on held out data. We further find that this outperforms masking out non-lung parts of the CXRs and performing histogram equalization, both of which are recently proposed methods for removing biases in CXR datasets.
format Online
Article
Text
id pubmed-7885941
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-78859412021-02-17 Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement Robinson, Caleb Trivedi, Anusua Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Lee, Aaron Kalpathy-Cramer, Jayashree Ferres, Juan M. Lavista medRxiv Article In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and researchers often pool CXR data from multiple sources, for example, using different x-ray machines in various patient populations under different clinical scenarios. Deep learning models trained on such datasets have been shown to overfit to erroneous features instead of learning pulmonary characteristics – a phenomenon known as shortcut learning. We propose adding feature disentanglement to the training process, forcing the models to identify pulmonary features from the images while penalizing them for learning features that can discriminate between the original datasets that the images come from. We find that models trained in this way indeed have better generalization performance on unseen data; in the best case we found that it improved AUC by 0.13 on held out data. We further find that this outperforms masking out non-lung parts of the CXRs and performing histogram equalization, both of which are recently proposed methods for removing biases in CXR datasets. Cold Spring Harbor Laboratory 2021-02-13 /pmc/articles/PMC7885941/ /pubmed/33594382 http://dx.doi.org/10.1101/2021.02.11.20196766 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Robinson, Caleb
Trivedi, Anusua
Blazes, Marian
Ortiz, Anthony
Desbiens, Jocelyn
Gupta, Sunil
Dodhia, Rahul
Bhatraju, Pavan K.
Liles, W. Conrad
Lee, Aaron
Kalpathy-Cramer, Jayashree
Ferres, Juan M. Lavista
Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_full Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_fullStr Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_full_unstemmed Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_short Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
title_sort deep learning models for covid-19 chest x-ray classification: preventing shortcut learning using feature disentanglement
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7885941/
https://www.ncbi.nlm.nih.gov/pubmed/33594382
http://dx.doi.org/10.1101/2021.02.11.20196766
work_keys_str_mv AT robinsoncaleb deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT trivedianusua deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT blazesmarian deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT ortizanthony deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT desbiensjocelyn deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT guptasunil deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT dodhiarahul deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT bhatrajupavank deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT lileswconrad deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT leeaaron deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT kalpathycramerjayashree deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement
AT ferresjuanmlavista deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement