Cargando…

HealtheDataLab – a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions

BACKGROUND: There is a shortage of medical informatics and data science platforms using cloud computing on electronic medical record (EMR) data, and with computing capacity for analyzing big data. We implemented, described, and applied a cloud computing solution utilizing the fast health interoperab...

Descripción completa

Detalles Bibliográficos
Autores principales: Ehwerhemuepha, Louis, Gasperino, Gary, Bischoff, Nathaniel, Taraman, Sharief, Chang, Anthony, Feaster, William
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304122/
https://www.ncbi.nlm.nih.gov/pubmed/32560653
http://dx.doi.org/10.1186/s12911-020-01153-7
_version_ 1783548202886103040
author Ehwerhemuepha, Louis
Gasperino, Gary
Bischoff, Nathaniel
Taraman, Sharief
Chang, Anthony
Feaster, William
author_facet Ehwerhemuepha, Louis
Gasperino, Gary
Bischoff, Nathaniel
Taraman, Sharief
Chang, Anthony
Feaster, William
author_sort Ehwerhemuepha, Louis
collection PubMed
description BACKGROUND: There is a shortage of medical informatics and data science platforms using cloud computing on electronic medical record (EMR) data, and with computing capacity for analyzing big data. We implemented, described, and applied a cloud computing solution utilizing the fast health interoperability resources (FHIR) standardization and state-of-the-art parallel distributed computing platform for advanced analytics. METHODS: We utilized the architecture of the modern predictive analytics platform called Cerner® HealtheDataLab and described the suite of cloud computing services and Apache Projects that it relies on. We validated the platform by replicating and improving on a previous single pediatric institution study/model on readmission and developing a multi-center model of all-cause readmission for pediatric-age patients using the Cerner® Health Facts Deidentified Database (now updated and referred to as the Cerner Real World Data). We retrieved a subset of 1.4 million pediatric encounters consisting of 48 hospitals’ data on pediatric encounters in the database based on a priori inclusion criteria. We built and analyzed corresponding random forest and multilayer perceptron (MLP) neural network models using HealtheDataLab. RESULTS: Using the HealtheDataLab platform, we developed a random forest model and multi-layer perceptron model with AUC of 0.8446 (0.8444, 0.8447) and 0.8451 (0.8449, 0.8453) respectively. We showed the distribution in model performance across hospitals and identified a set of novel variables under previous resource utilization and generic medications that may be used to improve existing readmission models. CONCLUSION: Our results suggest that high performance, elastic cloud computing infrastructures such as the platform presented here can be used for the development of highly predictive models on EMR data in a secure and robust environment. This in turn can lead to new clinical insights/discoveries.
format Online
Article
Text
id pubmed-7304122
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73041222020-06-22 HealtheDataLab – a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions Ehwerhemuepha, Louis Gasperino, Gary Bischoff, Nathaniel Taraman, Sharief Chang, Anthony Feaster, William BMC Med Inform Decis Mak Research Article BACKGROUND: There is a shortage of medical informatics and data science platforms using cloud computing on electronic medical record (EMR) data, and with computing capacity for analyzing big data. We implemented, described, and applied a cloud computing solution utilizing the fast health interoperability resources (FHIR) standardization and state-of-the-art parallel distributed computing platform for advanced analytics. METHODS: We utilized the architecture of the modern predictive analytics platform called Cerner® HealtheDataLab and described the suite of cloud computing services and Apache Projects that it relies on. We validated the platform by replicating and improving on a previous single pediatric institution study/model on readmission and developing a multi-center model of all-cause readmission for pediatric-age patients using the Cerner® Health Facts Deidentified Database (now updated and referred to as the Cerner Real World Data). We retrieved a subset of 1.4 million pediatric encounters consisting of 48 hospitals’ data on pediatric encounters in the database based on a priori inclusion criteria. We built and analyzed corresponding random forest and multilayer perceptron (MLP) neural network models using HealtheDataLab. RESULTS: Using the HealtheDataLab platform, we developed a random forest model and multi-layer perceptron model with AUC of 0.8446 (0.8444, 0.8447) and 0.8451 (0.8449, 0.8453) respectively. We showed the distribution in model performance across hospitals and identified a set of novel variables under previous resource utilization and generic medications that may be used to improve existing readmission models. CONCLUSION: Our results suggest that high performance, elastic cloud computing infrastructures such as the platform presented here can be used for the development of highly predictive models on EMR data in a secure and robust environment. This in turn can lead to new clinical insights/discoveries. BioMed Central 2020-06-19 /pmc/articles/PMC7304122/ /pubmed/32560653 http://dx.doi.org/10.1186/s12911-020-01153-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Ehwerhemuepha, Louis
Gasperino, Gary
Bischoff, Nathaniel
Taraman, Sharief
Chang, Anthony
Feaster, William
HealtheDataLab – a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions
title HealtheDataLab – a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions
title_full HealtheDataLab – a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions
title_fullStr HealtheDataLab – a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions
title_full_unstemmed HealtheDataLab – a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions
title_short HealtheDataLab – a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions
title_sort healthedatalab – a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304122/
https://www.ncbi.nlm.nih.gov/pubmed/32560653
http://dx.doi.org/10.1186/s12911-020-01153-7
work_keys_str_mv AT ehwerhemuephalouis healthedatalabacloudcomputingsolutionfordatascienceandadvancedanalyticsinhealthcarewithapplicationtopredictingmulticenterpediatricreadmissions
AT gasperinogary healthedatalabacloudcomputingsolutionfordatascienceandadvancedanalyticsinhealthcarewithapplicationtopredictingmulticenterpediatricreadmissions
AT bischoffnathaniel healthedatalabacloudcomputingsolutionfordatascienceandadvancedanalyticsinhealthcarewithapplicationtopredictingmulticenterpediatricreadmissions
AT taramansharief healthedatalabacloudcomputingsolutionfordatascienceandadvancedanalyticsinhealthcarewithapplicationtopredictingmulticenterpediatricreadmissions
AT changanthony healthedatalabacloudcomputingsolutionfordatascienceandadvancedanalyticsinhealthcarewithapplicationtopredictingmulticenterpediatricreadmissions
AT feasterwilliam healthedatalabacloudcomputingsolutionfordatascienceandadvancedanalyticsinhealthcarewithapplicationtopredictingmulticenterpediatricreadmissions