Cargando…

Early risk assessment for COVID-19 patients from emergency department data using machine learning

Since its emergence in late 2019, the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a pandemic with more than 55 million reported cases and 1.3 million estimated deaths worldwide. While epidemiological and clinical characteristics of COVID-19 have been reported, risk factor...

Descripción completa

Detalles Bibliográficos
Autores principales: Heldt, Frank S., Vizcaychipi, Marcela P., Peacock, Sophie, Cinelli, Mattia, McLachlan, Lachlan, Andreotti, Fernando, Jovanović, Stojan, Dürichen, Robert, Lipunova, Nadezda, Fletcher, Robert A., Hancock, Anne, McCarthy, Alex, Pointon, Richard A., Brown, Alexander, Eaton, James, Liddi, Roberto, Mackillop, Lucy, Tarassenko, Lionel, Khan, Rabia T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7892838/
https://www.ncbi.nlm.nih.gov/pubmed/33603086
http://dx.doi.org/10.1038/s41598-021-83784-y
_version_ 1783652932245258240
author Heldt, Frank S.
Vizcaychipi, Marcela P.
Peacock, Sophie
Cinelli, Mattia
McLachlan, Lachlan
Andreotti, Fernando
Jovanović, Stojan
Dürichen, Robert
Lipunova, Nadezda
Fletcher, Robert A.
Hancock, Anne
McCarthy, Alex
Pointon, Richard A.
Brown, Alexander
Eaton, James
Liddi, Roberto
Mackillop, Lucy
Tarassenko, Lionel
Khan, Rabia T.
author_facet Heldt, Frank S.
Vizcaychipi, Marcela P.
Peacock, Sophie
Cinelli, Mattia
McLachlan, Lachlan
Andreotti, Fernando
Jovanović, Stojan
Dürichen, Robert
Lipunova, Nadezda
Fletcher, Robert A.
Hancock, Anne
McCarthy, Alex
Pointon, Richard A.
Brown, Alexander
Eaton, James
Liddi, Roberto
Mackillop, Lucy
Tarassenko, Lionel
Khan, Rabia T.
author_sort Heldt, Frank S.
collection PubMed
description Since its emergence in late 2019, the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a pandemic with more than 55 million reported cases and 1.3 million estimated deaths worldwide. While epidemiological and clinical characteristics of COVID-19 have been reported, risk factors underlying the transition from mild to severe disease among patients remain poorly understood. In this retrospective study, we analysed data of 879 confirmed SARS-CoV-2 positive patients admitted to a two-site NHS Trust hospital in London, England, between January 1st and May 26th, 2020, with a majority of cases occurring in March and April. We extracted anonymised demographic data, physiological clinical variables and laboratory results from electronic healthcare records (EHR) and applied multivariate logistic regression, random forest and extreme gradient boosted trees. To evaluate the potential for early risk assessment, we used data available during patients’ initial presentation at the emergency department (ED) to predict deterioration to one of three clinical endpoints in the remainder of the hospital stay: admission to intensive care, need for invasive mechanical ventilation and in-hospital mortality. Based on the trained models, we extracted the most informative clinical features in determining these patient trajectories. Considering our inclusion criteria, we have identified 129 of 879 (15%) patients that required intensive care, 62 of 878 (7%) patients needing mechanical ventilation, and 193 of 619 (31%) cases of in-hospital mortality. Our models learned successfully from early clinical data and predicted clinical endpoints with high accuracy, the best model achieving area under the receiver operating characteristic (AUC-ROC) scores of 0.76 to 0.87 (F1 scores of 0.42–0.60). Younger patient age was associated with an increased risk of receiving intensive care and ventilation, but lower risk of mortality. Clinical indicators of a patient’s oxygen supply and selected laboratory results, such as blood lactate and creatinine levels, were most predictive of COVID-19 patient trajectories. Among COVID-19 patients machine learning can aid in the early identification of those with a poor prognosis, using EHR data collected during a patient’s first presentation at ED. Patient age and measures of oxygenation status during ED stay are primary indicators of poor patient outcomes.
format Online
Article
Text
id pubmed-7892838
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-78928382021-02-23 Early risk assessment for COVID-19 patients from emergency department data using machine learning Heldt, Frank S. Vizcaychipi, Marcela P. Peacock, Sophie Cinelli, Mattia McLachlan, Lachlan Andreotti, Fernando Jovanović, Stojan Dürichen, Robert Lipunova, Nadezda Fletcher, Robert A. Hancock, Anne McCarthy, Alex Pointon, Richard A. Brown, Alexander Eaton, James Liddi, Roberto Mackillop, Lucy Tarassenko, Lionel Khan, Rabia T. Sci Rep Article Since its emergence in late 2019, the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a pandemic with more than 55 million reported cases and 1.3 million estimated deaths worldwide. While epidemiological and clinical characteristics of COVID-19 have been reported, risk factors underlying the transition from mild to severe disease among patients remain poorly understood. In this retrospective study, we analysed data of 879 confirmed SARS-CoV-2 positive patients admitted to a two-site NHS Trust hospital in London, England, between January 1st and May 26th, 2020, with a majority of cases occurring in March and April. We extracted anonymised demographic data, physiological clinical variables and laboratory results from electronic healthcare records (EHR) and applied multivariate logistic regression, random forest and extreme gradient boosted trees. To evaluate the potential for early risk assessment, we used data available during patients’ initial presentation at the emergency department (ED) to predict deterioration to one of three clinical endpoints in the remainder of the hospital stay: admission to intensive care, need for invasive mechanical ventilation and in-hospital mortality. Based on the trained models, we extracted the most informative clinical features in determining these patient trajectories. Considering our inclusion criteria, we have identified 129 of 879 (15%) patients that required intensive care, 62 of 878 (7%) patients needing mechanical ventilation, and 193 of 619 (31%) cases of in-hospital mortality. Our models learned successfully from early clinical data and predicted clinical endpoints with high accuracy, the best model achieving area under the receiver operating characteristic (AUC-ROC) scores of 0.76 to 0.87 (F1 scores of 0.42–0.60). Younger patient age was associated with an increased risk of receiving intensive care and ventilation, but lower risk of mortality. Clinical indicators of a patient’s oxygen supply and selected laboratory results, such as blood lactate and creatinine levels, were most predictive of COVID-19 patient trajectories. Among COVID-19 patients machine learning can aid in the early identification of those with a poor prognosis, using EHR data collected during a patient’s first presentation at ED. Patient age and measures of oxygenation status during ED stay are primary indicators of poor patient outcomes. Nature Publishing Group UK 2021-02-18 /pmc/articles/PMC7892838/ /pubmed/33603086 http://dx.doi.org/10.1038/s41598-021-83784-y Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Heldt, Frank S.
Vizcaychipi, Marcela P.
Peacock, Sophie
Cinelli, Mattia
McLachlan, Lachlan
Andreotti, Fernando
Jovanović, Stojan
Dürichen, Robert
Lipunova, Nadezda
Fletcher, Robert A.
Hancock, Anne
McCarthy, Alex
Pointon, Richard A.
Brown, Alexander
Eaton, James
Liddi, Roberto
Mackillop, Lucy
Tarassenko, Lionel
Khan, Rabia T.
Early risk assessment for COVID-19 patients from emergency department data using machine learning
title Early risk assessment for COVID-19 patients from emergency department data using machine learning
title_full Early risk assessment for COVID-19 patients from emergency department data using machine learning
title_fullStr Early risk assessment for COVID-19 patients from emergency department data using machine learning
title_full_unstemmed Early risk assessment for COVID-19 patients from emergency department data using machine learning
title_short Early risk assessment for COVID-19 patients from emergency department data using machine learning
title_sort early risk assessment for covid-19 patients from emergency department data using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7892838/
https://www.ncbi.nlm.nih.gov/pubmed/33603086
http://dx.doi.org/10.1038/s41598-021-83784-y
work_keys_str_mv AT heldtfranks earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT vizcaychipimarcelap earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT peacocksophie earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT cinellimattia earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT mclachlanlachlan earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT andreottifernando earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT jovanovicstojan earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT durichenrobert earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT lipunovanadezda earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT fletcherroberta earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT hancockanne earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT mccarthyalex earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT pointonricharda earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT brownalexander earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT eatonjames earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT liddiroberto earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT mackilloplucy earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT tarassenkolionel earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning
AT khanrabiat earlyriskassessmentforcovid19patientsfromemergencydepartmentdatausingmachinelearning