Cargando…
Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit
BACKGROUNDS: The in-hospital mortality in lung cancer patients admitted to intensive care unit (ICU) is extremely high. This study intended to adopt machine learning algorithm models to predict in-hospital mortality of critically ill lung cancer for providing relative information in clinical decisio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9879439/ https://www.ncbi.nlm.nih.gov/pubmed/36701342 http://dx.doi.org/10.1371/journal.pone.0280606 |
_version_ | 1784878688633880576 |
---|---|
author | Huang, Tianzhi Le, Dejin Yuan, Lili Xu, Shoujia Peng, Xiulan |
author_facet | Huang, Tianzhi Le, Dejin Yuan, Lili Xu, Shoujia Peng, Xiulan |
author_sort | Huang, Tianzhi |
collection | PubMed |
description | BACKGROUNDS: The in-hospital mortality in lung cancer patients admitted to intensive care unit (ICU) is extremely high. This study intended to adopt machine learning algorithm models to predict in-hospital mortality of critically ill lung cancer for providing relative information in clinical decision-making. METHODS: Data were extracted from the Medical Information Mart for Intensive Care-IV (MIMIC-IV) for a training cohort and data extracted from the Medical Information Mart for eICU Collaborative Research Database (eICU-CRD) database for a validation cohort. Logistic regression, random forest, decision tree, light gradient boosting machine (LightGBM), eXtreme gradient boosting (XGBoost), and an ensemble (random forest+LightGBM+XGBoost) model were used for prediction of in-hospital mortality and important feature extraction. The AUC (area under receiver operating curve), accuracy, F1 score and recall were used to evaluate the predictive performance of each model. Shapley Additive exPlanations (SHAP) values were calculated to evaluate feature importance of each feature. RESULTS: Overall, there were 653 (24.8%) in-hospital mortality in the training cohort, and 523 (21.7%) in-hospital mortality in the validation cohort. Among the six machine learning models, the ensemble model achieved the best performance. The top 5 most influential features were the sequential organ failure assessment (SOFA) score, albumin, the oxford acute severity of illness score (OASIS) score, anion gap and bilirubin in random forest and XGBoost model. The SHAP summary plot was used to illustrate the positive or negative effects of the top 15 features attributed to the XGBoost model. CONCLUSION: The ensemble model performed best and might be applied to forecast in-hospital mortality of critically ill lung cancer patients, and the SOFA score was the most important feature in all models. These results might offer valuable and significant reference for ICU clinicians’ decision-making in advance. |
format | Online Article Text |
id | pubmed-9879439 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-98794392023-01-27 Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit Huang, Tianzhi Le, Dejin Yuan, Lili Xu, Shoujia Peng, Xiulan PLoS One Research Article BACKGROUNDS: The in-hospital mortality in lung cancer patients admitted to intensive care unit (ICU) is extremely high. This study intended to adopt machine learning algorithm models to predict in-hospital mortality of critically ill lung cancer for providing relative information in clinical decision-making. METHODS: Data were extracted from the Medical Information Mart for Intensive Care-IV (MIMIC-IV) for a training cohort and data extracted from the Medical Information Mart for eICU Collaborative Research Database (eICU-CRD) database for a validation cohort. Logistic regression, random forest, decision tree, light gradient boosting machine (LightGBM), eXtreme gradient boosting (XGBoost), and an ensemble (random forest+LightGBM+XGBoost) model were used for prediction of in-hospital mortality and important feature extraction. The AUC (area under receiver operating curve), accuracy, F1 score and recall were used to evaluate the predictive performance of each model. Shapley Additive exPlanations (SHAP) values were calculated to evaluate feature importance of each feature. RESULTS: Overall, there were 653 (24.8%) in-hospital mortality in the training cohort, and 523 (21.7%) in-hospital mortality in the validation cohort. Among the six machine learning models, the ensemble model achieved the best performance. The top 5 most influential features were the sequential organ failure assessment (SOFA) score, albumin, the oxford acute severity of illness score (OASIS) score, anion gap and bilirubin in random forest and XGBoost model. The SHAP summary plot was used to illustrate the positive or negative effects of the top 15 features attributed to the XGBoost model. CONCLUSION: The ensemble model performed best and might be applied to forecast in-hospital mortality of critically ill lung cancer patients, and the SOFA score was the most important feature in all models. These results might offer valuable and significant reference for ICU clinicians’ decision-making in advance. Public Library of Science 2023-01-26 /pmc/articles/PMC9879439/ /pubmed/36701342 http://dx.doi.org/10.1371/journal.pone.0280606 Text en © 2023 Huang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Huang, Tianzhi Le, Dejin Yuan, Lili Xu, Shoujia Peng, Xiulan Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit |
title | Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit |
title_full | Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit |
title_fullStr | Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit |
title_full_unstemmed | Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit |
title_short | Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit |
title_sort | machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9879439/ https://www.ncbi.nlm.nih.gov/pubmed/36701342 http://dx.doi.org/10.1371/journal.pone.0280606 |
work_keys_str_mv | AT huangtianzhi machinelearningforpredictionofinhospitalmortalityinlungcancerpatientsadmittedtointensivecareunit AT ledejin machinelearningforpredictionofinhospitalmortalityinlungcancerpatientsadmittedtointensivecareunit AT yuanlili machinelearningforpredictionofinhospitalmortalityinlungcancerpatientsadmittedtointensivecareunit AT xushoujia machinelearningforpredictionofinhospitalmortalityinlungcancerpatientsadmittedtointensivecareunit AT pengxiulan machinelearningforpredictionofinhospitalmortalityinlungcancerpatientsadmittedtointensivecareunit |