Cargando…

Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records

One major impediment to the wider use of deep learning for clinical decision making is the difficulty of assigning a level of confidence to model predictions. Currently, deep Bayesian neural networks and sparse Gaussian processes are the main two scalable uncertainty estimation methods. However, dee...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Yikuan, Rao, Shishir, Hassaine, Abdelaali, Ramakrishnan, Rema, Canoy, Dexter, Salimi-Khorshidi, Gholamreza, Mamouei, Mohammad, Lukasiewicz, Thomas, Rahimi, Kazem
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8526832/ https://www.ncbi.nlm.nih.gov/pubmed/34667200 http://dx.doi.org/10.1038/s41598-021-00144-6

_version_	1784585945972998144
author	Li, Yikuan Rao, Shishir Hassaine, Abdelaali Ramakrishnan, Rema Canoy, Dexter Salimi-Khorshidi, Gholamreza Mamouei, Mohammad Lukasiewicz, Thomas Rahimi, Kazem
author_facet	Li, Yikuan Rao, Shishir Hassaine, Abdelaali Ramakrishnan, Rema Canoy, Dexter Salimi-Khorshidi, Gholamreza Mamouei, Mohammad Lukasiewicz, Thomas Rahimi, Kazem
author_sort	Li, Yikuan
collection	PubMed
description	One major impediment to the wider use of deep learning for clinical decision making is the difficulty of assigning a level of confidence to model predictions. Currently, deep Bayesian neural networks and sparse Gaussian processes are the main two scalable uncertainty estimation methods. However, deep Bayesian neural networks suffer from lack of expressiveness, and more expressive models such as deep kernel learning, which is an extension of sparse Gaussian process, captures only the uncertainty from the higher-level latent space. Therefore, the deep learning model under it lacks interpretability and ignores uncertainty from the raw data. In this paper, we merge features of the deep Bayesian learning framework with deep kernel learning to leverage the strengths of both methods for a more comprehensive uncertainty estimation. Through a series of experiments on predicting the first incidence of heart failure, diabetes and depression applied to large-scale electronic medical records, we demonstrate that our method is better at capturing uncertainty than both Gaussian processes and deep Bayesian neural networks in terms of indicating data insufficiency and identifying misclassifications, with a comparable generalization performance. Furthermore, by assessing the accuracy and area under the receiver operating characteristic curve over the predictive probability, we show that our method is less susceptible to making overconfident predictions, especially for the minority class in imbalanced datasets. Finally, we demonstrate how uncertainty information derived by the model can inform risk factor analysis towards model interpretability.
format	Online Article Text
id	pubmed-8526832
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-85268322021-10-22 Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records Li, Yikuan Rao, Shishir Hassaine, Abdelaali Ramakrishnan, Rema Canoy, Dexter Salimi-Khorshidi, Gholamreza Mamouei, Mohammad Lukasiewicz, Thomas Rahimi, Kazem Sci Rep Article One major impediment to the wider use of deep learning for clinical decision making is the difficulty of assigning a level of confidence to model predictions. Currently, deep Bayesian neural networks and sparse Gaussian processes are the main two scalable uncertainty estimation methods. However, deep Bayesian neural networks suffer from lack of expressiveness, and more expressive models such as deep kernel learning, which is an extension of sparse Gaussian process, captures only the uncertainty from the higher-level latent space. Therefore, the deep learning model under it lacks interpretability and ignores uncertainty from the raw data. In this paper, we merge features of the deep Bayesian learning framework with deep kernel learning to leverage the strengths of both methods for a more comprehensive uncertainty estimation. Through a series of experiments on predicting the first incidence of heart failure, diabetes and depression applied to large-scale electronic medical records, we demonstrate that our method is better at capturing uncertainty than both Gaussian processes and deep Bayesian neural networks in terms of indicating data insufficiency and identifying misclassifications, with a comparable generalization performance. Furthermore, by assessing the accuracy and area under the receiver operating characteristic curve over the predictive probability, we show that our method is less susceptible to making overconfident predictions, especially for the minority class in imbalanced datasets. Finally, we demonstrate how uncertainty information derived by the model can inform risk factor analysis towards model interpretability. Nature Publishing Group UK 2021-10-19 /pmc/articles/PMC8526832/ /pubmed/34667200 http://dx.doi.org/10.1038/s41598-021-00144-6 Text en © The Author(s) 2021, corrected publication 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Li, Yikuan Rao, Shishir Hassaine, Abdelaali Ramakrishnan, Rema Canoy, Dexter Salimi-Khorshidi, Gholamreza Mamouei, Mohammad Lukasiewicz, Thomas Rahimi, Kazem Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records
title	Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records
title_full	Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records
title_fullStr	Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records
title_full_unstemmed	Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records
title_short	Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records
title_sort	deep bayesian gaussian processes for uncertainty estimation in electronic health records
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8526832/ https://www.ncbi.nlm.nih.gov/pubmed/34667200 http://dx.doi.org/10.1038/s41598-021-00144-6
work_keys_str_mv	AT liyikuan deepbayesiangaussianprocessesforuncertaintyestimationinelectronichealthrecords AT raoshishir deepbayesiangaussianprocessesforuncertaintyestimationinelectronichealthrecords AT hassaineabdelaali deepbayesiangaussianprocessesforuncertaintyestimationinelectronichealthrecords AT ramakrishnanrema deepbayesiangaussianprocessesforuncertaintyestimationinelectronichealthrecords AT canoydexter deepbayesiangaussianprocessesforuncertaintyestimationinelectronichealthrecords AT salimikhorshidigholamreza deepbayesiangaussianprocessesforuncertaintyestimationinelectronichealthrecords AT mamoueimohammad deepbayesiangaussianprocessesforuncertaintyestimationinelectronichealthrecords AT lukasiewiczthomas deepbayesiangaussianprocessesforuncertaintyestimationinelectronichealthrecords AT rahimikazem deepbayesiangaussianprocessesforuncertaintyestimationinelectronichealthrecords

Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records

Ejemplares similares