Cargando…
Deep learning with robustness to missing data: A novel approach to the detection of COVID-19
In the context of the current global pandemic and the limitations of the RT-PCR test, we propose a novel deep learning architecture, DFCN (Denoising Fully Connected Network). Since medical facilities around the world differ enormously in what laboratory tests or chest imaging may be available, DFCN...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323880/ https://www.ncbi.nlm.nih.gov/pubmed/34329354 http://dx.doi.org/10.1371/journal.pone.0255301 |
_version_ | 1783731323321450496 |
---|---|
author | Çallı, Erdi Murphy, Keelin Kurstjens, Steef Samson, Tijs Herpers, Robert Smits, Henk Rutten, Matthieu van Ginneken, Bram |
author_facet | Çallı, Erdi Murphy, Keelin Kurstjens, Steef Samson, Tijs Herpers, Robert Smits, Henk Rutten, Matthieu van Ginneken, Bram |
author_sort | Çallı, Erdi |
collection | PubMed |
description | In the context of the current global pandemic and the limitations of the RT-PCR test, we propose a novel deep learning architecture, DFCN (Denoising Fully Connected Network). Since medical facilities around the world differ enormously in what laboratory tests or chest imaging may be available, DFCN is designed to be robust to missing input data. An ablation study extensively evaluates the performance benefits of the DFCN as well as its robustness to missing inputs. Data from 1088 patients with confirmed RT-PCR results are obtained from two independent medical facilities. The data includes results from 27 laboratory tests and a chest x-ray scored by a deep learning model. Training and test datasets are taken from different medical facilities. Data is made publicly available. The performance of DFCN in predicting the RT-PCR result is compared with 3 related architectures as well as a Random Forest baseline. All models are trained with varying levels of masked input data to encourage robustness to missing inputs. Missing data is simulated at test time by masking inputs randomly. DFCN outperforms all other models with statistical significance using random subsets of input data with 2-27 available inputs. When all 28 inputs are available DFCN obtains an AUC of 0.924, higher than any other model. Furthermore, with clinically meaningful subsets of parameters consisting of just 6 and 7 inputs respectively, DFCN achieves higher AUCs than any other model, with values of 0.909 and 0.919. |
format | Online Article Text |
id | pubmed-8323880 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83238802021-07-31 Deep learning with robustness to missing data: A novel approach to the detection of COVID-19 Çallı, Erdi Murphy, Keelin Kurstjens, Steef Samson, Tijs Herpers, Robert Smits, Henk Rutten, Matthieu van Ginneken, Bram PLoS One Research Article In the context of the current global pandemic and the limitations of the RT-PCR test, we propose a novel deep learning architecture, DFCN (Denoising Fully Connected Network). Since medical facilities around the world differ enormously in what laboratory tests or chest imaging may be available, DFCN is designed to be robust to missing input data. An ablation study extensively evaluates the performance benefits of the DFCN as well as its robustness to missing inputs. Data from 1088 patients with confirmed RT-PCR results are obtained from two independent medical facilities. The data includes results from 27 laboratory tests and a chest x-ray scored by a deep learning model. Training and test datasets are taken from different medical facilities. Data is made publicly available. The performance of DFCN in predicting the RT-PCR result is compared with 3 related architectures as well as a Random Forest baseline. All models are trained with varying levels of masked input data to encourage robustness to missing inputs. Missing data is simulated at test time by masking inputs randomly. DFCN outperforms all other models with statistical significance using random subsets of input data with 2-27 available inputs. When all 28 inputs are available DFCN obtains an AUC of 0.924, higher than any other model. Furthermore, with clinically meaningful subsets of parameters consisting of just 6 and 7 inputs respectively, DFCN achieves higher AUCs than any other model, with values of 0.909 and 0.919. Public Library of Science 2021-07-30 /pmc/articles/PMC8323880/ /pubmed/34329354 http://dx.doi.org/10.1371/journal.pone.0255301 Text en © 2021 Çallı et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Çallı, Erdi Murphy, Keelin Kurstjens, Steef Samson, Tijs Herpers, Robert Smits, Henk Rutten, Matthieu van Ginneken, Bram Deep learning with robustness to missing data: A novel approach to the detection of COVID-19 |
title | Deep learning with robustness to missing data: A novel approach to the detection of COVID-19 |
title_full | Deep learning with robustness to missing data: A novel approach to the detection of COVID-19 |
title_fullStr | Deep learning with robustness to missing data: A novel approach to the detection of COVID-19 |
title_full_unstemmed | Deep learning with robustness to missing data: A novel approach to the detection of COVID-19 |
title_short | Deep learning with robustness to missing data: A novel approach to the detection of COVID-19 |
title_sort | deep learning with robustness to missing data: a novel approach to the detection of covid-19 |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323880/ https://www.ncbi.nlm.nih.gov/pubmed/34329354 http://dx.doi.org/10.1371/journal.pone.0255301 |
work_keys_str_mv | AT callıerdi deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19 AT murphykeelin deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19 AT kurstjenssteef deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19 AT samsontijs deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19 AT herpersrobert deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19 AT smitshenk deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19 AT ruttenmatthieu deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19 AT vanginnekenbram deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19 |