Cargando…
Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients
Patients affected by SARS-COV-2 have collapsed healthcare systems around the world. Consequently, different challenges arise regarding the prediction of hospital needs, optimization of resources, diagnostic triage tools and patient evolution, as well as tools that allow us to analyze which are the f...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8986770/ https://www.ncbi.nlm.nih.gov/pubmed/35388055 http://dx.doi.org/10.1038/s41598-022-09613-y |
_version_ | 1784682603223187456 |
---|---|
author | Reina Reina, Alejandro Barrera, José M. Valdivieso, Bernardo Gas, María-Eugenia Maté, Alejandro Trujillo, Juan C. |
author_facet | Reina Reina, Alejandro Barrera, José M. Valdivieso, Bernardo Gas, María-Eugenia Maté, Alejandro Trujillo, Juan C. |
author_sort | Reina Reina, Alejandro |
collection | PubMed |
description | Patients affected by SARS-COV-2 have collapsed healthcare systems around the world. Consequently, different challenges arise regarding the prediction of hospital needs, optimization of resources, diagnostic triage tools and patient evolution, as well as tools that allow us to analyze which are the factors that determine the severity of patients. Currently, it is widely accepted that one of the problems since the pandemic appeared was to detect (i) who patients were about to need Intensive Care Unit (ICU) and (ii) who ones were about not overcome the disease. These critical patients collapsed Hospitals to the point that many surgeries around the world had to be cancelled. Therefore, the aim of this paper is to provide a Machine Learning (ML) model that helps us to prevent when a patient is about to be critical. Although we are in the era of data, regarding the SARS-COV-2 patients, there are currently few tools and solutions that help medical professionals to predict the evolution of patients in order to improve their treatment and the needs of critical resources at hospitals. Moreover, most of these tools have been created from small populations and/or Chinese populations, which carries a high risk of bias. In this paper, we present a model, based on ML techniques, based on 5378 Spanish patients’ data from which a quality cohort of 1201 was extracted to train the model. Our model is capable of predicting the probability of death of patients with SARS-COV-2 based on age, sex and comorbidities of the patient. It also allows what-if analysis, with the inclusion of comorbidities that the patient may develop during the SARS-COV-2 infection. For the training of the model, we have followed an agnostic approach. We explored all the active comorbidities during the SARS-COV-2 infection of the patients with the objective that the model weights the effect of each comorbidity on the patient’s evolution according to the data available. The model has been validated by using stratified cross-validation with k = 5 to prevent class imbalance. We obtained robust results, presenting a high hit rate, with 84.16% accuracy, 83.33% sensitivity, and an Area Under the Curve (AUC) of 0.871. The main advantage of our model, in addition to its high success rate, is that it can be used with medical records in order to predict their diagnosis, allowing the critical population to be identified in advance. Furthermore, it uses the International Classification of Diseases, Ninth Revision, Clinical Modification (ICD 9-CM) standard. In this sense, we should also emphasize that those hospitals using other encodings can add an intermediate layer business to business (B2B) with the aim of making transformations to the same international format. |
format | Online Article Text |
id | pubmed-8986770 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-89867702022-04-08 Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients Reina Reina, Alejandro Barrera, José M. Valdivieso, Bernardo Gas, María-Eugenia Maté, Alejandro Trujillo, Juan C. Sci Rep Article Patients affected by SARS-COV-2 have collapsed healthcare systems around the world. Consequently, different challenges arise regarding the prediction of hospital needs, optimization of resources, diagnostic triage tools and patient evolution, as well as tools that allow us to analyze which are the factors that determine the severity of patients. Currently, it is widely accepted that one of the problems since the pandemic appeared was to detect (i) who patients were about to need Intensive Care Unit (ICU) and (ii) who ones were about not overcome the disease. These critical patients collapsed Hospitals to the point that many surgeries around the world had to be cancelled. Therefore, the aim of this paper is to provide a Machine Learning (ML) model that helps us to prevent when a patient is about to be critical. Although we are in the era of data, regarding the SARS-COV-2 patients, there are currently few tools and solutions that help medical professionals to predict the evolution of patients in order to improve their treatment and the needs of critical resources at hospitals. Moreover, most of these tools have been created from small populations and/or Chinese populations, which carries a high risk of bias. In this paper, we present a model, based on ML techniques, based on 5378 Spanish patients’ data from which a quality cohort of 1201 was extracted to train the model. Our model is capable of predicting the probability of death of patients with SARS-COV-2 based on age, sex and comorbidities of the patient. It also allows what-if analysis, with the inclusion of comorbidities that the patient may develop during the SARS-COV-2 infection. For the training of the model, we have followed an agnostic approach. We explored all the active comorbidities during the SARS-COV-2 infection of the patients with the objective that the model weights the effect of each comorbidity on the patient’s evolution according to the data available. The model has been validated by using stratified cross-validation with k = 5 to prevent class imbalance. We obtained robust results, presenting a high hit rate, with 84.16% accuracy, 83.33% sensitivity, and an Area Under the Curve (AUC) of 0.871. The main advantage of our model, in addition to its high success rate, is that it can be used with medical records in order to predict their diagnosis, allowing the critical population to be identified in advance. Furthermore, it uses the International Classification of Diseases, Ninth Revision, Clinical Modification (ICD 9-CM) standard. In this sense, we should also emphasize that those hospitals using other encodings can add an intermediate layer business to business (B2B) with the aim of making transformations to the same international format. Nature Publishing Group UK 2022-04-06 /pmc/articles/PMC8986770/ /pubmed/35388055 http://dx.doi.org/10.1038/s41598-022-09613-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Reina Reina, Alejandro Barrera, José M. Valdivieso, Bernardo Gas, María-Eugenia Maté, Alejandro Trujillo, Juan C. Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients |
title | Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients |
title_full | Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients |
title_fullStr | Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients |
title_full_unstemmed | Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients |
title_short | Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients |
title_sort | machine learning model from a spanish cohort for prediction of sars-cov-2 mortality risk and critical patients |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8986770/ https://www.ncbi.nlm.nih.gov/pubmed/35388055 http://dx.doi.org/10.1038/s41598-022-09613-y |
work_keys_str_mv | AT reinareinaalejandro machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients AT barrerajosem machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients AT valdiviesobernardo machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients AT gasmariaeugenia machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients AT matealejandro machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients AT trujillojuanc machinelearningmodelfromaspanishcohortforpredictionofsarscov2mortalityriskandcriticalpatients |