Cargando…

Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients

Patients affected by SARS-COV-2 have collapsed healthcare systems around the world. Consequently, different challenges arise regarding the prediction of hospital needs, optimization of resources, diagnostic triage tools and patient evolution, as well as tools that allow us to analyze which are the f...

Descripción completa

Detalles Bibliográficos
Autores principales: Reina Reina, Alejandro, Barrera, José M., Valdivieso, Bernardo, Gas, María-Eugenia, Maté, Alejandro, Trujillo, Juan C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8986770/
https://www.ncbi.nlm.nih.gov/pubmed/35388055
http://dx.doi.org/10.1038/s41598-022-09613-y
_version_ 1784682603223187456
author Reina Reina, Alejandro
Barrera, José M.
Valdivieso, Bernardo
Gas, María-Eugenia
Maté, Alejandro
Trujillo, Juan C.
author_facet Reina Reina, Alejandro
Barrera, José M.
Valdivieso, Bernardo
Gas, María-Eugenia
Maté, Alejandro
Trujillo, Juan C.
author_sort Reina Reina, Alejandro
collection PubMed
description Patients affected by SARS-COV-2 have collapsed healthcare systems around the world. Consequently, different challenges arise regarding the prediction of hospital needs, optimization of resources, diagnostic triage tools and patient evolution, as well as tools that allow us to analyze which are the factors that determine the severity of patients. Currently, it is widely accepted that one of the problems since the pandemic appeared was to detect (i) who patients were about to need Intensive Care Unit (ICU) and (ii) who ones were about not overcome the disease. These critical patients collapsed Hospitals to the point that many surgeries around the world had to be cancelled. Therefore, the aim of this paper is to provide a Machine Learning (ML) model that helps us to prevent when a patient is about to be critical. Although we are in the era of data, regarding the SARS-COV-2 patients, there are currently few tools and solutions that help medical professionals to predict the evolution of patients in order to improve their treatment and the needs of critical resources at hospitals. Moreover, most of these tools have been created from small populations and/or Chinese populations, which carries a high risk of bias. In this paper, we present a model, based on ML techniques, based on 5378 Spanish patients’ data from which a quality cohort of 1201 was extracted to train the model. Our model is capable of predicting the probability of death of patients with SARS-COV-2 based on age, sex and comorbidities of the patient. It also allows what-if analysis, with the inclusion of comorbidities that the patient may develop during the SARS-COV-2 infection. For the training of the model, we have followed an agnostic approach. We explored all the active comorbidities during the SARS-COV-2 infection of the patients with the objective that the model weights the effect of each comorbidity on the patient’s evolution according to the data available. The model has been validated by using stratified cross-validation with k = 5 to prevent class imbalance. We obtained robust results, presenting a high hit rate, with 84.16% accuracy, 83.33% sensitivity, and an Area Under the Curve (AUC) of 0.871. The main advantage of our model, in addition to its high success rate, is that it can be used with medical records in order to predict their diagnosis, allowing the critical population to be identified in advance. Furthermore, it uses the International Classification of Diseases, Ninth Revision, Clinical Modification (ICD 9-CM) standard. In this sense, we should also emphasize that those hospitals using other encodings can add an intermediate layer business to business (B2B) with the aim of making transformations to the same international format.
format Online
Article
Text
id pubmed-8986770
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-89867702022-04-08 Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients Reina Reina, Alejandro Barrera, José M. Valdivieso, Bernardo Gas, María-Eugenia Maté, Alejandro Trujillo, Juan C. Sci Rep Article Patients affected by SARS-COV-2 have collapsed healthcare systems around the world. Consequently, different challenges arise regarding the prediction of hospital needs, optimization of resources, diagnostic triage tools and patient evolution, as well as tools that allow us to analyze which are the factors that determine the severity of patients. Currently, it is widely accepted that one of the problems since the pandemic appeared was to detect (i) who patients were about to need Intensive Care Unit (ICU) and (ii) who ones were about not overcome the disease. These critical patients collapsed Hospitals to the point that many surgeries around the world had to be cancelled. Therefore, the aim of this paper is to provide a Machine Learning (ML) model that helps us to prevent when a patient is about to be critical. Although we are in the era of data, regarding the SARS-COV-2 patients, there are currently few tools and solutions that help medical professionals to predict the evolution of patients in order to improve their treatment and the needs of critical resources at hospitals. Moreover, most of these tools have been created from small populations and/or Chinese populations, which carries a high risk of bias. In this paper, we present a model, based on ML techniques, based on 5378 Spanish patients’ data from which a quality cohort of 1201 was extracted to train the model. Our model is capable of predicting the probability of death of patients with SARS-COV-2 based on age, sex and comorbidities of the patient. It also allows what-if analysis, with the inclusion of comorbidities that the patient may develop during the SARS-COV-2 infection. For the training of the model, we have followed an agnostic approach. We explored all the active comorbidities during the SARS-COV-2 infection of the patients with the objective that the model weights the effect of each comorbidity on the patient’s evolution according to the data available. The model has been validated by using stratified cross-validation with k = 5 to prevent class imbalance. We obtained robust results, presenting a high hit rate, with 84.16% accuracy, 83.33% sensitivity, and an Area Under the Curve (AUC) of 0.871. The main advantage of our model, in addition to its high success rate, is that it can be used with medical records in order to predict their diagnosis, allowing the critical population to be identified in advance. Furthermore, it uses the International Classification of Diseases, Ninth Revision, Clinical Modification (ICD 9-CM) standard. In this sense, we should also emphasize that those hospitals using other encodings can add an intermediate layer business to business (B2B) with the aim of making transformations to the same international format. Nature Publishing Group UK 2022-04-06 /pmc/articles/PMC8986770/ /pubmed/35388055 http://dx.doi.org/10.1038/s41598-022-09613-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Reina Reina, Alejandro
Barrera, José M.
Valdivieso, Bernardo
Gas, María-Eugenia
Maté, Alejandro
Trujillo, Juan C.
Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients
title Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients
title_full Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients
title_fullStr Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients
title_full_unstemmed Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients
title_short Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients
title_sort machine learning model from a spanish cohort for prediction of sars-cov-2 mortality risk and critical patients
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8986770/
https://www.ncbi.nlm.nih.gov/pubmed/35388055
http://dx.doi.org/10.1038/s41598-022-09613-y
work_keys_str_mv AT reinareinaalejandro machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients
AT barrerajosem machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients
AT valdiviesobernardo machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients
AT gasmariaeugenia machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients
AT matealejandro machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients
AT trujillojuanc machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients