Cargando…

Deep learning with robustness to missing data: A novel approach to the detection of COVID-19

In the context of the current global pandemic and the limitations of the RT-PCR test, we propose a novel deep learning architecture, DFCN (Denoising Fully Connected Network). Since medical facilities around the world differ enormously in what laboratory tests or chest imaging may be available, DFCN...

Descripción completa

Detalles Bibliográficos
Autores principales: Çallı, Erdi, Murphy, Keelin, Kurstjens, Steef, Samson, Tijs, Herpers, Robert, Smits, Henk, Rutten, Matthieu, van Ginneken, Bram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323880/
https://www.ncbi.nlm.nih.gov/pubmed/34329354
http://dx.doi.org/10.1371/journal.pone.0255301
_version_ 1783731323321450496
author Çallı, Erdi
Murphy, Keelin
Kurstjens, Steef
Samson, Tijs
Herpers, Robert
Smits, Henk
Rutten, Matthieu
van Ginneken, Bram
author_facet Çallı, Erdi
Murphy, Keelin
Kurstjens, Steef
Samson, Tijs
Herpers, Robert
Smits, Henk
Rutten, Matthieu
van Ginneken, Bram
author_sort Çallı, Erdi
collection PubMed
description In the context of the current global pandemic and the limitations of the RT-PCR test, we propose a novel deep learning architecture, DFCN (Denoising Fully Connected Network). Since medical facilities around the world differ enormously in what laboratory tests or chest imaging may be available, DFCN is designed to be robust to missing input data. An ablation study extensively evaluates the performance benefits of the DFCN as well as its robustness to missing inputs. Data from 1088 patients with confirmed RT-PCR results are obtained from two independent medical facilities. The data includes results from 27 laboratory tests and a chest x-ray scored by a deep learning model. Training and test datasets are taken from different medical facilities. Data is made publicly available. The performance of DFCN in predicting the RT-PCR result is compared with 3 related architectures as well as a Random Forest baseline. All models are trained with varying levels of masked input data to encourage robustness to missing inputs. Missing data is simulated at test time by masking inputs randomly. DFCN outperforms all other models with statistical significance using random subsets of input data with 2-27 available inputs. When all 28 inputs are available DFCN obtains an AUC of 0.924, higher than any other model. Furthermore, with clinically meaningful subsets of parameters consisting of just 6 and 7 inputs respectively, DFCN achieves higher AUCs than any other model, with values of 0.909 and 0.919.
format Online
Article
Text
id pubmed-8323880
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83238802021-07-31 Deep learning with robustness to missing data: A novel approach to the detection of COVID-19 Çallı, Erdi Murphy, Keelin Kurstjens, Steef Samson, Tijs Herpers, Robert Smits, Henk Rutten, Matthieu van Ginneken, Bram PLoS One Research Article In the context of the current global pandemic and the limitations of the RT-PCR test, we propose a novel deep learning architecture, DFCN (Denoising Fully Connected Network). Since medical facilities around the world differ enormously in what laboratory tests or chest imaging may be available, DFCN is designed to be robust to missing input data. An ablation study extensively evaluates the performance benefits of the DFCN as well as its robustness to missing inputs. Data from 1088 patients with confirmed RT-PCR results are obtained from two independent medical facilities. The data includes results from 27 laboratory tests and a chest x-ray scored by a deep learning model. Training and test datasets are taken from different medical facilities. Data is made publicly available. The performance of DFCN in predicting the RT-PCR result is compared with 3 related architectures as well as a Random Forest baseline. All models are trained with varying levels of masked input data to encourage robustness to missing inputs. Missing data is simulated at test time by masking inputs randomly. DFCN outperforms all other models with statistical significance using random subsets of input data with 2-27 available inputs. When all 28 inputs are available DFCN obtains an AUC of 0.924, higher than any other model. Furthermore, with clinically meaningful subsets of parameters consisting of just 6 and 7 inputs respectively, DFCN achieves higher AUCs than any other model, with values of 0.909 and 0.919. Public Library of Science 2021-07-30 /pmc/articles/PMC8323880/ /pubmed/34329354 http://dx.doi.org/10.1371/journal.pone.0255301 Text en © 2021 Çallı et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Çallı, Erdi
Murphy, Keelin
Kurstjens, Steef
Samson, Tijs
Herpers, Robert
Smits, Henk
Rutten, Matthieu
van Ginneken, Bram
Deep learning with robustness to missing data: A novel approach to the detection of COVID-19
title Deep learning with robustness to missing data: A novel approach to the detection of COVID-19
title_full Deep learning with robustness to missing data: A novel approach to the detection of COVID-19
title_fullStr Deep learning with robustness to missing data: A novel approach to the detection of COVID-19
title_full_unstemmed Deep learning with robustness to missing data: A novel approach to the detection of COVID-19
title_short Deep learning with robustness to missing data: A novel approach to the detection of COVID-19
title_sort deep learning with robustness to missing data: a novel approach to the detection of covid-19
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323880/
https://www.ncbi.nlm.nih.gov/pubmed/34329354
http://dx.doi.org/10.1371/journal.pone.0255301
work_keys_str_mv AT callıerdi deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19
AT murphykeelin deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19
AT kurstjenssteef deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19
AT samsontijs deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19
AT herpersrobert deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19
AT smitshenk deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19
AT ruttenmatthieu deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19
AT vanginnekenbram deeplearningwithrobustnesstomissingdataanovelapproachtothedetectionofcovid19