Cargando…

Machine learning prediction for mortality of patients diagnosed with COVID-19: a nationwide Korean cohort study

The rapid spread of COVID-19 has resulted in the shortage of medical resources, which necessitates accurate prognosis prediction to triage patients effectively. This study used the nationwide cohort of South Korea to develop a machine learning model to predict prognosis based on sociodemographic and...

Descripción completa

Detalles Bibliográficos
Autores principales: An, Chansik, Lim, Hyunsun, Kim, Dong-Wook, Chang, Jung Hyun, Choi, Yoon Jung, Kim, Seong Woo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7599238/
https://www.ncbi.nlm.nih.gov/pubmed/33127965
http://dx.doi.org/10.1038/s41598-020-75767-2
_version_ 1783602830004715520
author An, Chansik
Lim, Hyunsun
Kim, Dong-Wook
Chang, Jung Hyun
Choi, Yoon Jung
Kim, Seong Woo
author_facet An, Chansik
Lim, Hyunsun
Kim, Dong-Wook
Chang, Jung Hyun
Choi, Yoon Jung
Kim, Seong Woo
author_sort An, Chansik
collection PubMed
description The rapid spread of COVID-19 has resulted in the shortage of medical resources, which necessitates accurate prognosis prediction to triage patients effectively. This study used the nationwide cohort of South Korea to develop a machine learning model to predict prognosis based on sociodemographic and medical information. Of 10,237 COVID-19 patients, 228 (2.2%) died, 7772 (75.9%) recovered, and 2237 (21.9%) were still in isolation or being treated at the last follow-up (April 16, 2020). The Cox proportional hazards regression analysis revealed that age > 70, male sex, moderate or severe disability, the presence of symptoms, nursing home residence, and comorbidities of diabetes mellitus (DM), chronic lung disease, or asthma were significantly associated with increased risk of mortality (p ≤ 0.047). For machine learning, the least absolute shrinkage and selection operator (LASSO), linear support vector machine (SVM), SVM with radial basis function kernel, random forest (RF), and k-nearest neighbors were tested. In prediction of mortality, LASSO and linear SVM demonstrated high sensitivities (90.7% [95% confidence interval: 83.3, 97.3] and 92.0% [85.9, 98.1], respectively) and specificities (91.4% [90.3, 92.5] and 91.8%, [90.7, 92.9], respectively) while maintaining high specificities > 90%, as well as high area under the receiver operating characteristics curves (0.963 [0.946, 0.979] and 0.962 [0.945, 0.979], respectively). The most significant predictors for LASSO included old age and preexisting DM or cancer; for RF they were old age, infection route (cluster infection or infection from personal contact), and underlying hypertension. The proposed prediction model may be helpful for the quick triage of patients without having to wait for the results of additional tests such as laboratory or radiologic studies, during a pandemic when limited medical resources must be wisely allocated without hesitation.
format Online
Article
Text
id pubmed-7599238
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-75992382020-11-03 Machine learning prediction for mortality of patients diagnosed with COVID-19: a nationwide Korean cohort study An, Chansik Lim, Hyunsun Kim, Dong-Wook Chang, Jung Hyun Choi, Yoon Jung Kim, Seong Woo Sci Rep Article The rapid spread of COVID-19 has resulted in the shortage of medical resources, which necessitates accurate prognosis prediction to triage patients effectively. This study used the nationwide cohort of South Korea to develop a machine learning model to predict prognosis based on sociodemographic and medical information. Of 10,237 COVID-19 patients, 228 (2.2%) died, 7772 (75.9%) recovered, and 2237 (21.9%) were still in isolation or being treated at the last follow-up (April 16, 2020). The Cox proportional hazards regression analysis revealed that age > 70, male sex, moderate or severe disability, the presence of symptoms, nursing home residence, and comorbidities of diabetes mellitus (DM), chronic lung disease, or asthma were significantly associated with increased risk of mortality (p ≤ 0.047). For machine learning, the least absolute shrinkage and selection operator (LASSO), linear support vector machine (SVM), SVM with radial basis function kernel, random forest (RF), and k-nearest neighbors were tested. In prediction of mortality, LASSO and linear SVM demonstrated high sensitivities (90.7% [95% confidence interval: 83.3, 97.3] and 92.0% [85.9, 98.1], respectively) and specificities (91.4% [90.3, 92.5] and 91.8%, [90.7, 92.9], respectively) while maintaining high specificities > 90%, as well as high area under the receiver operating characteristics curves (0.963 [0.946, 0.979] and 0.962 [0.945, 0.979], respectively). The most significant predictors for LASSO included old age and preexisting DM or cancer; for RF they were old age, infection route (cluster infection or infection from personal contact), and underlying hypertension. The proposed prediction model may be helpful for the quick triage of patients without having to wait for the results of additional tests such as laboratory or radiologic studies, during a pandemic when limited medical resources must be wisely allocated without hesitation. Nature Publishing Group UK 2020-10-30 /pmc/articles/PMC7599238/ /pubmed/33127965 http://dx.doi.org/10.1038/s41598-020-75767-2 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
An, Chansik
Lim, Hyunsun
Kim, Dong-Wook
Chang, Jung Hyun
Choi, Yoon Jung
Kim, Seong Woo
Machine learning prediction for mortality of patients diagnosed with COVID-19: a nationwide Korean cohort study
title Machine learning prediction for mortality of patients diagnosed with COVID-19: a nationwide Korean cohort study
title_full Machine learning prediction for mortality of patients diagnosed with COVID-19: a nationwide Korean cohort study
title_fullStr Machine learning prediction for mortality of patients diagnosed with COVID-19: a nationwide Korean cohort study
title_full_unstemmed Machine learning prediction for mortality of patients diagnosed with COVID-19: a nationwide Korean cohort study
title_short Machine learning prediction for mortality of patients diagnosed with COVID-19: a nationwide Korean cohort study
title_sort machine learning prediction for mortality of patients diagnosed with covid-19: a nationwide korean cohort study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7599238/
https://www.ncbi.nlm.nih.gov/pubmed/33127965
http://dx.doi.org/10.1038/s41598-020-75767-2
work_keys_str_mv AT anchansik machinelearningpredictionformortalityofpatientsdiagnosedwithcovid19anationwidekoreancohortstudy
AT limhyunsun machinelearningpredictionformortalityofpatientsdiagnosedwithcovid19anationwidekoreancohortstudy
AT kimdongwook machinelearningpredictionformortalityofpatientsdiagnosedwithcovid19anationwidekoreancohortstudy
AT changjunghyun machinelearningpredictionformortalityofpatientsdiagnosedwithcovid19anationwidekoreancohortstudy
AT choiyoonjung machinelearningpredictionformortalityofpatientsdiagnosedwithcovid19anationwidekoreancohortstudy
AT kimseongwoo machinelearningpredictionformortalityofpatientsdiagnosedwithcovid19anationwidekoreancohortstudy