Cargando…

Sparse multi-output Gaussian processes for online medical time series prediction

BACKGROUND: For real-time monitoring of hospital patients, high-quality inference of patients’ health status using all information available from clinical covariates and lab test results is essential to enable successful medical interventions and improve patient outcomes. Developing a computational...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Li-Fang, Dumitrascu, Bianca, Darnell, Gregory, Chivers, Corey, Draugelis, Michael, Li, Kai, Engelhardt, Barbara E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7341595/
https://www.ncbi.nlm.nih.gov/pubmed/32641134
http://dx.doi.org/10.1186/s12911-020-1069-4
_version_ 1783555270692044800
author Cheng, Li-Fang
Dumitrascu, Bianca
Darnell, Gregory
Chivers, Corey
Draugelis, Michael
Li, Kai
Engelhardt, Barbara E
author_facet Cheng, Li-Fang
Dumitrascu, Bianca
Darnell, Gregory
Chivers, Corey
Draugelis, Michael
Li, Kai
Engelhardt, Barbara E
author_sort Cheng, Li-Fang
collection PubMed
description BACKGROUND: For real-time monitoring of hospital patients, high-quality inference of patients’ health status using all information available from clinical covariates and lab test results is essential to enable successful medical interventions and improve patient outcomes. Developing a computational framework that can learn from observational large-scale electronic health records (EHRs) and make accurate real-time predictions is a critical step. In this work, we develop and explore a Bayesian nonparametric model based on multi-output Gaussian process (GP) regression for hospital patient monitoring. METHODS: We propose MedGP, a statistical framework that incorporates 24 clinical covariates and supports a rich reference data set from which relationships between observed covariates may be inferred and exploited for high-quality inference of patient state over time. To do this, we develop a highly structured sparse GP kernel to enable tractable computation over tens of thousands of time points while estimating correlations among clinical covariates, patients, and periodicity in patient observations. MedGP has a number of benefits over current methods, including (i) not requiring an alignment of the time series data, (ii) quantifying confidence regions in the predictions, (iii) exploiting a vast and rich database of patients, and (iv) inferring interpretable relationships among clinical covariates. RESULTS: We evaluate and compare results from MedGP on the task of online prediction for three patient subgroups from two medical data sets across 8,043 patients. We find MedGP improves online prediction over baseline and state-of-the-art methods for nearly all covariates across different disease subgroups and hospitals. CONCLUSIONS: The MedGP framework is robust and efficient in estimating the temporal dependencies from sparse and irregularly sampled medical time series data for online prediction. The publicly available code is at https://github.com/bee-hive/MedGP.
format Online
Article
Text
id pubmed-7341595
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73415952020-07-14 Sparse multi-output Gaussian processes for online medical time series prediction Cheng, Li-Fang Dumitrascu, Bianca Darnell, Gregory Chivers, Corey Draugelis, Michael Li, Kai Engelhardt, Barbara E BMC Med Inform Decis Mak Technical Advance BACKGROUND: For real-time monitoring of hospital patients, high-quality inference of patients’ health status using all information available from clinical covariates and lab test results is essential to enable successful medical interventions and improve patient outcomes. Developing a computational framework that can learn from observational large-scale electronic health records (EHRs) and make accurate real-time predictions is a critical step. In this work, we develop and explore a Bayesian nonparametric model based on multi-output Gaussian process (GP) regression for hospital patient monitoring. METHODS: We propose MedGP, a statistical framework that incorporates 24 clinical covariates and supports a rich reference data set from which relationships between observed covariates may be inferred and exploited for high-quality inference of patient state over time. To do this, we develop a highly structured sparse GP kernel to enable tractable computation over tens of thousands of time points while estimating correlations among clinical covariates, patients, and periodicity in patient observations. MedGP has a number of benefits over current methods, including (i) not requiring an alignment of the time series data, (ii) quantifying confidence regions in the predictions, (iii) exploiting a vast and rich database of patients, and (iv) inferring interpretable relationships among clinical covariates. RESULTS: We evaluate and compare results from MedGP on the task of online prediction for three patient subgroups from two medical data sets across 8,043 patients. We find MedGP improves online prediction over baseline and state-of-the-art methods for nearly all covariates across different disease subgroups and hospitals. CONCLUSIONS: The MedGP framework is robust and efficient in estimating the temporal dependencies from sparse and irregularly sampled medical time series data for online prediction. The publicly available code is at https://github.com/bee-hive/MedGP. BioMed Central 2020-07-08 /pmc/articles/PMC7341595/ /pubmed/32641134 http://dx.doi.org/10.1186/s12911-020-1069-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visithttp://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Technical Advance
Cheng, Li-Fang
Dumitrascu, Bianca
Darnell, Gregory
Chivers, Corey
Draugelis, Michael
Li, Kai
Engelhardt, Barbara E
Sparse multi-output Gaussian processes for online medical time series prediction
title Sparse multi-output Gaussian processes for online medical time series prediction
title_full Sparse multi-output Gaussian processes for online medical time series prediction
title_fullStr Sparse multi-output Gaussian processes for online medical time series prediction
title_full_unstemmed Sparse multi-output Gaussian processes for online medical time series prediction
title_short Sparse multi-output Gaussian processes for online medical time series prediction
title_sort sparse multi-output gaussian processes for online medical time series prediction
topic Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7341595/
https://www.ncbi.nlm.nih.gov/pubmed/32641134
http://dx.doi.org/10.1186/s12911-020-1069-4
work_keys_str_mv AT chenglifang sparsemultioutputgaussianprocessesforonlinemedicaltimeseriesprediction
AT dumitrascubianca sparsemultioutputgaussianprocessesforonlinemedicaltimeseriesprediction
AT darnellgregory sparsemultioutputgaussianprocessesforonlinemedicaltimeseriesprediction
AT chiverscorey sparsemultioutputgaussianprocessesforonlinemedicaltimeseriesprediction
AT draugelismichael sparsemultioutputgaussianprocessesforonlinemedicaltimeseriesprediction
AT likai sparsemultioutputgaussianprocessesforonlinemedicaltimeseriesprediction
AT engelhardtbarbarae sparsemultioutputgaussianprocessesforonlinemedicaltimeseriesprediction