Cargando…

Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records

OBJECTIVE: This study aimed to develop and validate predictive models using electronic health records (EHR) data to determine whether hospitalized COVID-19-positive patients would be admitted to alternative medical care or discharged home. METHODS: We conducted a retrospective cohort study using dei...

Descripción completa

Detalles Bibliográficos
Autores principales: Zapata, Ruben D., Huang, Shu, Morris, Earl, Wang, Chang, Harle, Christopher, Magoc, Tanja, Mardini, Mamoun, Loftus, Tyler, Modave, François
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588875/
https://www.ncbi.nlm.nih.gov/pubmed/37862334
http://dx.doi.org/10.1371/journal.pone.0292888
_version_ 1785123671803691008
author Zapata, Ruben D.
Huang, Shu
Morris, Earl
Wang, Chang
Harle, Christopher
Magoc, Tanja
Mardini, Mamoun
Loftus, Tyler
Modave, François
author_facet Zapata, Ruben D.
Huang, Shu
Morris, Earl
Wang, Chang
Harle, Christopher
Magoc, Tanja
Mardini, Mamoun
Loftus, Tyler
Modave, François
author_sort Zapata, Ruben D.
collection PubMed
description OBJECTIVE: This study aimed to develop and validate predictive models using electronic health records (EHR) data to determine whether hospitalized COVID-19-positive patients would be admitted to alternative medical care or discharged home. METHODS: We conducted a retrospective cohort study using deidentified data from the University of Florida Health Integrated Data Repository. The study included 1,578 adult patients (≥18 years) who tested positive for COVID-19 while hospitalized, comprising 960 (60.8%) female patients with a mean (SD) age of 51.86 (18.49) years and 618 (39.2%) male patients with a mean (SD) age of 54.35 (18.48) years. Machine learning (ML) model training involved cross-validation to assess their performance in predicting patient disposition. RESULTS: We developed and validated six supervised ML-based prediction models (logistic regression, Gaussian Naïve Bayes, k-nearest neighbors, decision trees, random forest, and support vector machine classifier) to predict patient discharge status. The models were evaluated based on the area under the receiver operating characteristic curve (ROC-AUC), precision, accuracy, F1 score, and Brier score. The random forest classifier exhibited the highest performance, achieving an accuracy of 0.84 and an AUC of 0.72. Logistic regression (accuracy: 0.85, AUC: 0.71), k-nearest neighbor (accuracy: 0.84, AUC: 0.63), decision tree (accuracy: 0.84, AUC: 0.61), Gaussian Naïve Bayes (accuracy: 0.84, AUC: 0.66), and support vector machine classifier (accuracy: 0.84, AUC: 0.67) also demonstrated valuable predictive capabilities. SIGNIFICANCE: This study’s findings are crucial for efficiently allocating healthcare resources during pandemics like COVID-19. By harnessing ML techniques and EHR data, we can create predictive tools to identify patients at greater risk of severe symptoms based on their medical histories. The models developed here serve as a foundation for expanding the toolkit available to healthcare professionals and organizations. Additionally, explainable ML methods, such as Shapley Additive Explanations, aid in uncovering underlying data features that inform healthcare decision-making processes.
format Online
Article
Text
id pubmed-10588875
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105888752023-10-21 Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records Zapata, Ruben D. Huang, Shu Morris, Earl Wang, Chang Harle, Christopher Magoc, Tanja Mardini, Mamoun Loftus, Tyler Modave, François PLoS One Research Article OBJECTIVE: This study aimed to develop and validate predictive models using electronic health records (EHR) data to determine whether hospitalized COVID-19-positive patients would be admitted to alternative medical care or discharged home. METHODS: We conducted a retrospective cohort study using deidentified data from the University of Florida Health Integrated Data Repository. The study included 1,578 adult patients (≥18 years) who tested positive for COVID-19 while hospitalized, comprising 960 (60.8%) female patients with a mean (SD) age of 51.86 (18.49) years and 618 (39.2%) male patients with a mean (SD) age of 54.35 (18.48) years. Machine learning (ML) model training involved cross-validation to assess their performance in predicting patient disposition. RESULTS: We developed and validated six supervised ML-based prediction models (logistic regression, Gaussian Naïve Bayes, k-nearest neighbors, decision trees, random forest, and support vector machine classifier) to predict patient discharge status. The models were evaluated based on the area under the receiver operating characteristic curve (ROC-AUC), precision, accuracy, F1 score, and Brier score. The random forest classifier exhibited the highest performance, achieving an accuracy of 0.84 and an AUC of 0.72. Logistic regression (accuracy: 0.85, AUC: 0.71), k-nearest neighbor (accuracy: 0.84, AUC: 0.63), decision tree (accuracy: 0.84, AUC: 0.61), Gaussian Naïve Bayes (accuracy: 0.84, AUC: 0.66), and support vector machine classifier (accuracy: 0.84, AUC: 0.67) also demonstrated valuable predictive capabilities. SIGNIFICANCE: This study’s findings are crucial for efficiently allocating healthcare resources during pandemics like COVID-19. By harnessing ML techniques and EHR data, we can create predictive tools to identify patients at greater risk of severe symptoms based on their medical histories. The models developed here serve as a foundation for expanding the toolkit available to healthcare professionals and organizations. Additionally, explainable ML methods, such as Shapley Additive Explanations, aid in uncovering underlying data features that inform healthcare decision-making processes. Public Library of Science 2023-10-20 /pmc/articles/PMC10588875/ /pubmed/37862334 http://dx.doi.org/10.1371/journal.pone.0292888 Text en © 2023 Zapata et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zapata, Ruben D.
Huang, Shu
Morris, Earl
Wang, Chang
Harle, Christopher
Magoc, Tanja
Mardini, Mamoun
Loftus, Tyler
Modave, François
Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records
title Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records
title_full Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records
title_fullStr Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records
title_full_unstemmed Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records
title_short Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records
title_sort machine learning-based prediction models for home discharge in patients with covid-19: development and evaluation using electronic health records
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588875/
https://www.ncbi.nlm.nih.gov/pubmed/37862334
http://dx.doi.org/10.1371/journal.pone.0292888
work_keys_str_mv AT zapatarubend machinelearningbasedpredictionmodelsforhomedischargeinpatientswithcovid19developmentandevaluationusingelectronichealthrecords
AT huangshu machinelearningbasedpredictionmodelsforhomedischargeinpatientswithcovid19developmentandevaluationusingelectronichealthrecords
AT morrisearl machinelearningbasedpredictionmodelsforhomedischargeinpatientswithcovid19developmentandevaluationusingelectronichealthrecords
AT wangchang machinelearningbasedpredictionmodelsforhomedischargeinpatientswithcovid19developmentandevaluationusingelectronichealthrecords
AT harlechristopher machinelearningbasedpredictionmodelsforhomedischargeinpatientswithcovid19developmentandevaluationusingelectronichealthrecords
AT magoctanja machinelearningbasedpredictionmodelsforhomedischargeinpatientswithcovid19developmentandevaluationusingelectronichealthrecords
AT mardinimamoun machinelearningbasedpredictionmodelsforhomedischargeinpatientswithcovid19developmentandevaluationusingelectronichealthrecords
AT loftustyler machinelearningbasedpredictionmodelsforhomedischargeinpatientswithcovid19developmentandevaluationusingelectronichealthrecords
AT modavefrancois machinelearningbasedpredictionmodelsforhomedischargeinpatientswithcovid19developmentandevaluationusingelectronichealthrecords