Cargando…

Development and Validation of Predictors for the Survival of Patients With COVID-19 Based on Machine Learning

Background: The outbreak of COVID-19 attracted the attention of the whole world. Our study aimed to explore the predictors for the survival of patients with COVID-19 by machine learning. Methods: We conducted a retrospective analysis and used the idea of machine learning to train the data of COVID-1...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yongfeng, Chen, Qianjun, Liu, Tao, Luo, Ping, Zhou, Yi, Liu, Minghui, Xiong, Bei, Zhou, Fuling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8493244/
https://www.ncbi.nlm.nih.gov/pubmed/34631727
http://dx.doi.org/10.3389/fmed.2021.683431
_version_ 1784579083110187008
author Zhao, Yongfeng
Chen, Qianjun
Liu, Tao
Luo, Ping
Zhou, Yi
Liu, Minghui
Xiong, Bei
Zhou, Fuling
author_facet Zhao, Yongfeng
Chen, Qianjun
Liu, Tao
Luo, Ping
Zhou, Yi
Liu, Minghui
Xiong, Bei
Zhou, Fuling
author_sort Zhao, Yongfeng
collection PubMed
description Background: The outbreak of COVID-19 attracted the attention of the whole world. Our study aimed to explore the predictors for the survival of patients with COVID-19 by machine learning. Methods: We conducted a retrospective analysis and used the idea of machine learning to train the data of COVID-19 patients in Leishenshan Hospital through the logical regression algorithm provided by scikit-learn. Results: Of 2010 patients, 42 deaths were recorded until March 29, 2020. The mortality rate was 2.09%. There were 6,812 records after data features combination and data arrangement, 3,025 records with high-quality after deleting incomplete data by manual checking, and 5,738 records after data balancing finally by the method of Borderline-1 Smote. The results of 10 times of data training by logistic regression model showed that albumin, saturation of pulse oxygen at admission, alanine aminotransferase, and percentage of neutrophils were possibly associated with the survival of patients. The results of 10 times of data training including age, sex, and height beyond the laboratory measurements showed that percentage of neutrophils, saturation of pulse oxygen at admission, alanine aminotransferase, sex, and albumin were possibly associated with the survival of patients. The rates of precision, recall, and f1-score of the two training models were all higher than 0.9 and relatively stable. Conclusions: We demonstrated that percentage of neutrophils, saturation of pulse oxygen at admission, alanine aminotransferase, sex, and albumin were possibly associated with the survival of patients with COVID-19.
format Online
Article
Text
id pubmed-8493244
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84932442021-10-07 Development and Validation of Predictors for the Survival of Patients With COVID-19 Based on Machine Learning Zhao, Yongfeng Chen, Qianjun Liu, Tao Luo, Ping Zhou, Yi Liu, Minghui Xiong, Bei Zhou, Fuling Front Med (Lausanne) Medicine Background: The outbreak of COVID-19 attracted the attention of the whole world. Our study aimed to explore the predictors for the survival of patients with COVID-19 by machine learning. Methods: We conducted a retrospective analysis and used the idea of machine learning to train the data of COVID-19 patients in Leishenshan Hospital through the logical regression algorithm provided by scikit-learn. Results: Of 2010 patients, 42 deaths were recorded until March 29, 2020. The mortality rate was 2.09%. There were 6,812 records after data features combination and data arrangement, 3,025 records with high-quality after deleting incomplete data by manual checking, and 5,738 records after data balancing finally by the method of Borderline-1 Smote. The results of 10 times of data training by logistic regression model showed that albumin, saturation of pulse oxygen at admission, alanine aminotransferase, and percentage of neutrophils were possibly associated with the survival of patients. The results of 10 times of data training including age, sex, and height beyond the laboratory measurements showed that percentage of neutrophils, saturation of pulse oxygen at admission, alanine aminotransferase, sex, and albumin were possibly associated with the survival of patients. The rates of precision, recall, and f1-score of the two training models were all higher than 0.9 and relatively stable. Conclusions: We demonstrated that percentage of neutrophils, saturation of pulse oxygen at admission, alanine aminotransferase, sex, and albumin were possibly associated with the survival of patients with COVID-19. Frontiers Media S.A. 2021-09-22 /pmc/articles/PMC8493244/ /pubmed/34631727 http://dx.doi.org/10.3389/fmed.2021.683431 Text en Copyright © 2021 Zhao, Chen, Liu, Luo, Zhou, Liu, Xiong and Zhou. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Zhao, Yongfeng
Chen, Qianjun
Liu, Tao
Luo, Ping
Zhou, Yi
Liu, Minghui
Xiong, Bei
Zhou, Fuling
Development and Validation of Predictors for the Survival of Patients With COVID-19 Based on Machine Learning
title Development and Validation of Predictors for the Survival of Patients With COVID-19 Based on Machine Learning
title_full Development and Validation of Predictors for the Survival of Patients With COVID-19 Based on Machine Learning
title_fullStr Development and Validation of Predictors for the Survival of Patients With COVID-19 Based on Machine Learning
title_full_unstemmed Development and Validation of Predictors for the Survival of Patients With COVID-19 Based on Machine Learning
title_short Development and Validation of Predictors for the Survival of Patients With COVID-19 Based on Machine Learning
title_sort development and validation of predictors for the survival of patients with covid-19 based on machine learning
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8493244/
https://www.ncbi.nlm.nih.gov/pubmed/34631727
http://dx.doi.org/10.3389/fmed.2021.683431
work_keys_str_mv AT zhaoyongfeng developmentandvalidationofpredictorsforthesurvivalofpatientswithcovid19basedonmachinelearning
AT chenqianjun developmentandvalidationofpredictorsforthesurvivalofpatientswithcovid19basedonmachinelearning
AT liutao developmentandvalidationofpredictorsforthesurvivalofpatientswithcovid19basedonmachinelearning
AT luoping developmentandvalidationofpredictorsforthesurvivalofpatientswithcovid19basedonmachinelearning
AT zhouyi developmentandvalidationofpredictorsforthesurvivalofpatientswithcovid19basedonmachinelearning
AT liuminghui developmentandvalidationofpredictorsforthesurvivalofpatientswithcovid19basedonmachinelearning
AT xiongbei developmentandvalidationofpredictorsforthesurvivalofpatientswithcovid19basedonmachinelearning
AT zhoufuling developmentandvalidationofpredictorsforthesurvivalofpatientswithcovid19basedonmachinelearning