Cargando…
Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement
In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and r...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7885941/ https://www.ncbi.nlm.nih.gov/pubmed/33594382 http://dx.doi.org/10.1101/2021.02.11.20196766 |
_version_ | 1783651696197500928 |
---|---|
author | Robinson, Caleb Trivedi, Anusua Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Lee, Aaron Kalpathy-Cramer, Jayashree Ferres, Juan M. Lavista |
author_facet | Robinson, Caleb Trivedi, Anusua Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Lee, Aaron Kalpathy-Cramer, Jayashree Ferres, Juan M. Lavista |
author_sort | Robinson, Caleb |
collection | PubMed |
description | In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and researchers often pool CXR data from multiple sources, for example, using different x-ray machines in various patient populations under different clinical scenarios. Deep learning models trained on such datasets have been shown to overfit to erroneous features instead of learning pulmonary characteristics – a phenomenon known as shortcut learning. We propose adding feature disentanglement to the training process, forcing the models to identify pulmonary features from the images while penalizing them for learning features that can discriminate between the original datasets that the images come from. We find that models trained in this way indeed have better generalization performance on unseen data; in the best case we found that it improved AUC by 0.13 on held out data. We further find that this outperforms masking out non-lung parts of the CXRs and performing histogram equalization, both of which are recently proposed methods for removing biases in CXR datasets. |
format | Online Article Text |
id | pubmed-7885941 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-78859412021-02-17 Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement Robinson, Caleb Trivedi, Anusua Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Lee, Aaron Kalpathy-Cramer, Jayashree Ferres, Juan M. Lavista medRxiv Article In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and researchers often pool CXR data from multiple sources, for example, using different x-ray machines in various patient populations under different clinical scenarios. Deep learning models trained on such datasets have been shown to overfit to erroneous features instead of learning pulmonary characteristics – a phenomenon known as shortcut learning. We propose adding feature disentanglement to the training process, forcing the models to identify pulmonary features from the images while penalizing them for learning features that can discriminate between the original datasets that the images come from. We find that models trained in this way indeed have better generalization performance on unseen data; in the best case we found that it improved AUC by 0.13 on held out data. We further find that this outperforms masking out non-lung parts of the CXRs and performing histogram equalization, both of which are recently proposed methods for removing biases in CXR datasets. Cold Spring Harbor Laboratory 2021-02-13 /pmc/articles/PMC7885941/ /pubmed/33594382 http://dx.doi.org/10.1101/2021.02.11.20196766 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Robinson, Caleb Trivedi, Anusua Blazes, Marian Ortiz, Anthony Desbiens, Jocelyn Gupta, Sunil Dodhia, Rahul Bhatraju, Pavan K. Liles, W. Conrad Lee, Aaron Kalpathy-Cramer, Jayashree Ferres, Juan M. Lavista Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_full | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_fullStr | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_full_unstemmed | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_short | Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement |
title_sort | deep learning models for covid-19 chest x-ray classification: preventing shortcut learning using feature disentanglement |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7885941/ https://www.ncbi.nlm.nih.gov/pubmed/33594382 http://dx.doi.org/10.1101/2021.02.11.20196766 |
work_keys_str_mv | AT robinsoncaleb deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT trivedianusua deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT blazesmarian deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT ortizanthony deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT desbiensjocelyn deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT guptasunil deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT dodhiarahul deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT bhatrajupavank deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT lileswconrad deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT leeaaron deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT kalpathycramerjayashree deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement AT ferresjuanmlavista deeplearningmodelsforcovid19chestxrayclassificationpreventingshortcutlearningusingfeaturedisentanglement |