Cargando…
Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients
The Coronavirus Disease 2019 (COVID-19) has spread all over the world and impacted many people’s lives. The characteristics of COVID-19 and other types of pneumonia have both similarities and differences, which confused doctors initially to separate and understand them. Here we presented a retrospec...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063041/ https://www.ncbi.nlm.nih.gov/pubmed/35521216 http://dx.doi.org/10.3389/fcimb.2022.838749 |
_version_ | 1784699082344759296 |
---|---|
author | Zhao, Yu Zhang, Rusen Zhong, Yi Wang, Jingjing Weng, Zuquan Luo, Heng Chen, Cunrong |
author_facet | Zhao, Yu Zhang, Rusen Zhong, Yi Wang, Jingjing Weng, Zuquan Luo, Heng Chen, Cunrong |
author_sort | Zhao, Yu |
collection | PubMed |
description | The Coronavirus Disease 2019 (COVID-19) has spread all over the world and impacted many people’s lives. The characteristics of COVID-19 and other types of pneumonia have both similarities and differences, which confused doctors initially to separate and understand them. Here we presented a retrospective analysis for both COVID-19 and other types of pneumonia by combining the COVID-19 clinical data, eICU and MIMIC-III databases. Machine learning models, including logistic regression, random forest, XGBoost and deep learning neural networks, were developed to predict the severity of COVID-19 infections as well as the mortality of pneumonia patients in intensive care units (ICU). Statistical analysis and feature interpretation, including the analysis of two-level attention mechanisms on both temporal and non-temporal features, were utilized to understand the associations between different clinical variables and disease outcomes. For the COVID-19 data, the XGBoost model obtained the best performance on the test set (AUROC = 1.000 and AUPRC = 0.833). On the MIMIC-III and eICU pneumonia datasets, our deep learning model (Bi-LSTM_Attn) was able to identify clinical variables associated with death of pneumonia patients (AUROC = 0.924 and AUPRC = 0.802 for 24-hour observation window and 12-hour prediction window). The results highlighted clinical indicators, such as the lymphocyte counts, that may help the doctors to predict the disease progression and outcomes for both COVID-19 and other types of pneumonia. |
format | Online Article Text |
id | pubmed-9063041 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90630412022-05-04 Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients Zhao, Yu Zhang, Rusen Zhong, Yi Wang, Jingjing Weng, Zuquan Luo, Heng Chen, Cunrong Front Cell Infect Microbiol Cellular and Infection Microbiology The Coronavirus Disease 2019 (COVID-19) has spread all over the world and impacted many people’s lives. The characteristics of COVID-19 and other types of pneumonia have both similarities and differences, which confused doctors initially to separate and understand them. Here we presented a retrospective analysis for both COVID-19 and other types of pneumonia by combining the COVID-19 clinical data, eICU and MIMIC-III databases. Machine learning models, including logistic regression, random forest, XGBoost and deep learning neural networks, were developed to predict the severity of COVID-19 infections as well as the mortality of pneumonia patients in intensive care units (ICU). Statistical analysis and feature interpretation, including the analysis of two-level attention mechanisms on both temporal and non-temporal features, were utilized to understand the associations between different clinical variables and disease outcomes. For the COVID-19 data, the XGBoost model obtained the best performance on the test set (AUROC = 1.000 and AUPRC = 0.833). On the MIMIC-III and eICU pneumonia datasets, our deep learning model (Bi-LSTM_Attn) was able to identify clinical variables associated with death of pneumonia patients (AUROC = 0.924 and AUPRC = 0.802 for 24-hour observation window and 12-hour prediction window). The results highlighted clinical indicators, such as the lymphocyte counts, that may help the doctors to predict the disease progression and outcomes for both COVID-19 and other types of pneumonia. Frontiers Media S.A. 2022-04-19 /pmc/articles/PMC9063041/ /pubmed/35521216 http://dx.doi.org/10.3389/fcimb.2022.838749 Text en Copyright © 2022 Zhao, Zhang, Zhong, Wang, Weng, Luo and Chen https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cellular and Infection Microbiology Zhao, Yu Zhang, Rusen Zhong, Yi Wang, Jingjing Weng, Zuquan Luo, Heng Chen, Cunrong Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients |
title | Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients |
title_full | Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients |
title_fullStr | Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients |
title_full_unstemmed | Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients |
title_short | Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients |
title_sort | statistical analysis and machine learning prediction of disease outcomes for covid-19 and pneumonia patients |
topic | Cellular and Infection Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063041/ https://www.ncbi.nlm.nih.gov/pubmed/35521216 http://dx.doi.org/10.3389/fcimb.2022.838749 |
work_keys_str_mv | AT zhaoyu statisticalanalysisandmachinelearningpredictionofdiseaseoutcomesforcovid19andpneumoniapatients AT zhangrusen statisticalanalysisandmachinelearningpredictionofdiseaseoutcomesforcovid19andpneumoniapatients AT zhongyi statisticalanalysisandmachinelearningpredictionofdiseaseoutcomesforcovid19andpneumoniapatients AT wangjingjing statisticalanalysisandmachinelearningpredictionofdiseaseoutcomesforcovid19andpneumoniapatients AT wengzuquan statisticalanalysisandmachinelearningpredictionofdiseaseoutcomesforcovid19andpneumoniapatients AT luoheng statisticalanalysisandmachinelearningpredictionofdiseaseoutcomesforcovid19andpneumoniapatients AT chencunrong statisticalanalysisandmachinelearningpredictionofdiseaseoutcomesforcovid19andpneumoniapatients |