Cargando…
A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19
The COVID-19 pandemic is impressively challenging the healthcare system. Several prognostic models have been validated but few of them are implemented in daily practice. The objective of the study was to validate a machine-learning risk prediction model using easy-to-obtain parameters to help to ide...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8551240/ https://www.ncbi.nlm.nih.gov/pubmed/34707184 http://dx.doi.org/10.1038/s41598-021-99905-6 |
_version_ | 1784591113569435648 |
---|---|
author | Murri, Rita Lenkowicz, Jacopo Masciocchi, Carlotta Iacomini, Chiara Fantoni, Massimo Damiani, Andrea Marchetti, Antonio Sergi, Paolo Domenico Angelo Arcuri, Giovanni Cesario, Alfredo Patarnello, Stefano Antonelli, Massimo Bellantone, Rocco Bernabei, Roberto Boccia, Stefania Calabresi, Paolo Cambieri, Andrea Cauda, Roberto Colosimo, Cesare Crea, Filippo De Maria, Ruggero De Stefano, Valerio Franceschi, Francesco Gasbarrini, Antonio Parolini, Ornella Richeldi, Luca Sanguinetti, Maurizio Urbani, Andrea Zega, Maurizio Scambia, Giovanni Valentini, Vincenzo |
author_facet | Murri, Rita Lenkowicz, Jacopo Masciocchi, Carlotta Iacomini, Chiara Fantoni, Massimo Damiani, Andrea Marchetti, Antonio Sergi, Paolo Domenico Angelo Arcuri, Giovanni Cesario, Alfredo Patarnello, Stefano Antonelli, Massimo Bellantone, Rocco Bernabei, Roberto Boccia, Stefania Calabresi, Paolo Cambieri, Andrea Cauda, Roberto Colosimo, Cesare Crea, Filippo De Maria, Ruggero De Stefano, Valerio Franceschi, Francesco Gasbarrini, Antonio Parolini, Ornella Richeldi, Luca Sanguinetti, Maurizio Urbani, Andrea Zega, Maurizio Scambia, Giovanni Valentini, Vincenzo |
author_sort | Murri, Rita |
collection | PubMed |
description | The COVID-19 pandemic is impressively challenging the healthcare system. Several prognostic models have been validated but few of them are implemented in daily practice. The objective of the study was to validate a machine-learning risk prediction model using easy-to-obtain parameters to help to identify patients with COVID-19 who are at higher risk of death. The training cohort included all patients admitted to Fondazione Policlinico Gemelli with COVID-19 from March 5, 2020, to November 5, 2020. Afterward, the model was tested on all patients admitted to the same hospital with COVID-19 from November 6, 2020, to February 5, 2021. The primary outcome was in-hospital case-fatality risk. The out-of-sample performance of the model was estimated from the training set in terms of Area under the Receiving Operator Curve (AUROC) and classification matrix statistics by averaging the results of fivefold cross validation repeated 3-times and comparing the results with those obtained on the test set. An explanation analysis of the model, based on the SHapley Additive exPlanations (SHAP), is also presented. To assess the subsequent time evolution, the change in paO2/FiO2 (P/F) at 48 h after the baseline measurement was plotted against its baseline value. Among the 921 patients included in the training cohort, 120 died (13%). Variables selected for the model were age, platelet count, SpO2, blood urea nitrogen (BUN), hemoglobin, C-reactive protein, neutrophil count, and sodium. The results of the fivefold cross-validation repeated 3-times gave AUROC of 0.87, and statistics of the classification matrix to the Youden index as follows: sensitivity 0.840, specificity 0.774, negative predictive value 0.971. Then, the model was tested on a new population (n = 1463) in which the case-fatality rate was 22.6%. The test model showed AUROC 0.818, sensitivity 0.813, specificity 0.650, negative predictive value 0.922. Considering the first quartile of the predicted risk score (low-risk score group), the case-fatality rate was 1.6%, 17.8% in the second and third quartile (high-risk score group) and 53.5% in the fourth quartile (very high-risk score group). The three risk score groups showed good discrimination for the P/F value at admission, and a positive correlation was found for the low-risk class to P/F at 48 h after admission (adjusted R-squared = 0.48). We developed a predictive model of death for people with SARS-CoV-2 infection by including only easy-to-obtain variables (abnormal blood count, BUN, C-reactive protein, sodium and lower SpO2). It demonstrated good accuracy and high power of discrimination. The simplicity of the model makes the risk prediction applicable for patients in the Emergency Department, or during hospitalization. Although it is reasonable to assume that the model is also applicable in not-hospitalized persons, only appropriate studies can assess the accuracy of the model also for persons at home. |
format | Online Article Text |
id | pubmed-8551240 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-85512402021-10-28 A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19 Murri, Rita Lenkowicz, Jacopo Masciocchi, Carlotta Iacomini, Chiara Fantoni, Massimo Damiani, Andrea Marchetti, Antonio Sergi, Paolo Domenico Angelo Arcuri, Giovanni Cesario, Alfredo Patarnello, Stefano Antonelli, Massimo Bellantone, Rocco Bernabei, Roberto Boccia, Stefania Calabresi, Paolo Cambieri, Andrea Cauda, Roberto Colosimo, Cesare Crea, Filippo De Maria, Ruggero De Stefano, Valerio Franceschi, Francesco Gasbarrini, Antonio Parolini, Ornella Richeldi, Luca Sanguinetti, Maurizio Urbani, Andrea Zega, Maurizio Scambia, Giovanni Valentini, Vincenzo Sci Rep Article The COVID-19 pandemic is impressively challenging the healthcare system. Several prognostic models have been validated but few of them are implemented in daily practice. The objective of the study was to validate a machine-learning risk prediction model using easy-to-obtain parameters to help to identify patients with COVID-19 who are at higher risk of death. The training cohort included all patients admitted to Fondazione Policlinico Gemelli with COVID-19 from March 5, 2020, to November 5, 2020. Afterward, the model was tested on all patients admitted to the same hospital with COVID-19 from November 6, 2020, to February 5, 2021. The primary outcome was in-hospital case-fatality risk. The out-of-sample performance of the model was estimated from the training set in terms of Area under the Receiving Operator Curve (AUROC) and classification matrix statistics by averaging the results of fivefold cross validation repeated 3-times and comparing the results with those obtained on the test set. An explanation analysis of the model, based on the SHapley Additive exPlanations (SHAP), is also presented. To assess the subsequent time evolution, the change in paO2/FiO2 (P/F) at 48 h after the baseline measurement was plotted against its baseline value. Among the 921 patients included in the training cohort, 120 died (13%). Variables selected for the model were age, platelet count, SpO2, blood urea nitrogen (BUN), hemoglobin, C-reactive protein, neutrophil count, and sodium. The results of the fivefold cross-validation repeated 3-times gave AUROC of 0.87, and statistics of the classification matrix to the Youden index as follows: sensitivity 0.840, specificity 0.774, negative predictive value 0.971. Then, the model was tested on a new population (n = 1463) in which the case-fatality rate was 22.6%. The test model showed AUROC 0.818, sensitivity 0.813, specificity 0.650, negative predictive value 0.922. Considering the first quartile of the predicted risk score (low-risk score group), the case-fatality rate was 1.6%, 17.8% in the second and third quartile (high-risk score group) and 53.5% in the fourth quartile (very high-risk score group). The three risk score groups showed good discrimination for the P/F value at admission, and a positive correlation was found for the low-risk class to P/F at 48 h after admission (adjusted R-squared = 0.48). We developed a predictive model of death for people with SARS-CoV-2 infection by including only easy-to-obtain variables (abnormal blood count, BUN, C-reactive protein, sodium and lower SpO2). It demonstrated good accuracy and high power of discrimination. The simplicity of the model makes the risk prediction applicable for patients in the Emergency Department, or during hospitalization. Although it is reasonable to assume that the model is also applicable in not-hospitalized persons, only appropriate studies can assess the accuracy of the model also for persons at home. Nature Publishing Group UK 2021-10-27 /pmc/articles/PMC8551240/ /pubmed/34707184 http://dx.doi.org/10.1038/s41598-021-99905-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Murri, Rita Lenkowicz, Jacopo Masciocchi, Carlotta Iacomini, Chiara Fantoni, Massimo Damiani, Andrea Marchetti, Antonio Sergi, Paolo Domenico Angelo Arcuri, Giovanni Cesario, Alfredo Patarnello, Stefano Antonelli, Massimo Bellantone, Rocco Bernabei, Roberto Boccia, Stefania Calabresi, Paolo Cambieri, Andrea Cauda, Roberto Colosimo, Cesare Crea, Filippo De Maria, Ruggero De Stefano, Valerio Franceschi, Francesco Gasbarrini, Antonio Parolini, Ornella Richeldi, Luca Sanguinetti, Maurizio Urbani, Andrea Zega, Maurizio Scambia, Giovanni Valentini, Vincenzo A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19 |
title | A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19 |
title_full | A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19 |
title_fullStr | A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19 |
title_full_unstemmed | A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19 |
title_short | A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19 |
title_sort | machine-learning parsimonious multivariable predictive model of mortality risk in patients with covid-19 |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8551240/ https://www.ncbi.nlm.nih.gov/pubmed/34707184 http://dx.doi.org/10.1038/s41598-021-99905-6 |
work_keys_str_mv | AT murririta amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT lenkowiczjacopo amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT masciocchicarlotta amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT iacominichiara amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT fantonimassimo amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT damianiandrea amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT marchettiantonio amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT sergipaolodomenicoangelo amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT arcurigiovanni amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT cesarioalfredo amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT patarnellostefano amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT antonellimassimo amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT bellantonerocco amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT bernabeiroberto amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT bocciastefania amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT calabresipaolo amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT cambieriandrea amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT caudaroberto amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT colosimocesare amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT creafilippo amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT demariaruggero amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT destefanovalerio amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT franceschifrancesco amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT gasbarriniantonio amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT paroliniornella amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT richeldiluca amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT sanguinettimaurizio amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT urbaniandrea amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT zegamaurizio amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT scambiagiovanni amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT valentinivincenzo amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT amachinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT murririta machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT lenkowiczjacopo machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT masciocchicarlotta machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT iacominichiara machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT fantonimassimo machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT damianiandrea machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT marchettiantonio machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT sergipaolodomenicoangelo machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT arcurigiovanni machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT cesarioalfredo machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT patarnellostefano machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT antonellimassimo machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT bellantonerocco machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT bernabeiroberto machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT bocciastefania machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT calabresipaolo machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT cambieriandrea machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT caudaroberto machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT colosimocesare machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT creafilippo machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT demariaruggero machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT destefanovalerio machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT franceschifrancesco machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT gasbarriniantonio machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT paroliniornella machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT richeldiluca machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT sanguinettimaurizio machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT urbaniandrea machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT zegamaurizio machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT scambiagiovanni machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT valentinivincenzo machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 AT machinelearningparsimoniousmultivariablepredictivemodelofmortalityriskinpatientswithcovid19 |