Cargando…

Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations

INTRODUCTION: Lung cancer is a prevalent malignancy globally, with approximately 20% of patients developing cardiopulmonary complications after lobectomy. In order to prevent complications, an accurate and personalized method based on machine learning (ML) is required. METHODS: During the period of...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhai, Yihai, Lin, Xue, Wei, Qiaolin, Pu, Yuanjin, Pang, Yonghui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10359813/
https://www.ncbi.nlm.nih.gov/pubmed/37483738
http://dx.doi.org/10.1016/j.heliyon.2023.e17772
_version_ 1785075966020681728
author Zhai, Yihai
Lin, Xue
Wei, Qiaolin
Pu, Yuanjin
Pang, Yonghui
author_facet Zhai, Yihai
Lin, Xue
Wei, Qiaolin
Pu, Yuanjin
Pang, Yonghui
author_sort Zhai, Yihai
collection PubMed
description INTRODUCTION: Lung cancer is a prevalent malignancy globally, with approximately 20% of patients developing cardiopulmonary complications after lobectomy. In order to prevent complications, an accurate and personalized method based on machine learning (ML) is required. METHODS: During the period of 2017–2021, a retrospective analysis was conducted on the medical records of patients who had undergone lobectomy for non-small cell lung cancer (NSCLC). We performed logical regression, decision tree (DT), random forest (RF), gradient boost DT, and eXtreme gradient boosting analyses to establish an ML model. The ten-fold cross-validation was used to evaluate the performance of multiple ML models based on various evaluation metrics, including accuracy, precision, recall, F1 score, and area under the receiver operating (AUC). Additionally, we also calculated the Kappa value of these model. Each model used grid search to optimize hyper-parameters and then used the interpretability method to provide explanations for the model's Decisions. RESULTS: The study included 718 eligible patients, among whom the incidence of postoperative cardiopulmonary complications was 20.89%. The RF model showed the best comprehensive performance among all models, and its ten-fold cross-validation accuracy, precision, recall, F1 score, and AUC were (OR and 95% confidence interval [CI]) 0.786 (0.738–0.834), 0.803 (0.735–0.872), 0.738 (0.678–0.797), 0.766 (0.714–0.818), 0.856 (0.815–0.898), respectively. The kappa value of the RF model was 0.696 (0.617–0.768). The SHAP method showed that gender, age, and intraoperative blood loss were closely associated with postoperative cardiopulmonary complications. CONCLUSION: The application of ML methods for predicting postoperative cardiopulmonary complications based on clinical data of patients with NSCLC showed a good performance. The results indicate that ML combined with the SHAP individualized interpretation method has practical clinical value.
format Online
Article
Text
id pubmed-10359813
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-103598132023-07-22 Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations Zhai, Yihai Lin, Xue Wei, Qiaolin Pu, Yuanjin Pang, Yonghui Heliyon Research Article INTRODUCTION: Lung cancer is a prevalent malignancy globally, with approximately 20% of patients developing cardiopulmonary complications after lobectomy. In order to prevent complications, an accurate and personalized method based on machine learning (ML) is required. METHODS: During the period of 2017–2021, a retrospective analysis was conducted on the medical records of patients who had undergone lobectomy for non-small cell lung cancer (NSCLC). We performed logical regression, decision tree (DT), random forest (RF), gradient boost DT, and eXtreme gradient boosting analyses to establish an ML model. The ten-fold cross-validation was used to evaluate the performance of multiple ML models based on various evaluation metrics, including accuracy, precision, recall, F1 score, and area under the receiver operating (AUC). Additionally, we also calculated the Kappa value of these model. Each model used grid search to optimize hyper-parameters and then used the interpretability method to provide explanations for the model's Decisions. RESULTS: The study included 718 eligible patients, among whom the incidence of postoperative cardiopulmonary complications was 20.89%. The RF model showed the best comprehensive performance among all models, and its ten-fold cross-validation accuracy, precision, recall, F1 score, and AUC were (OR and 95% confidence interval [CI]) 0.786 (0.738–0.834), 0.803 (0.735–0.872), 0.738 (0.678–0.797), 0.766 (0.714–0.818), 0.856 (0.815–0.898), respectively. The kappa value of the RF model was 0.696 (0.617–0.768). The SHAP method showed that gender, age, and intraoperative blood loss were closely associated with postoperative cardiopulmonary complications. CONCLUSION: The application of ML methods for predicting postoperative cardiopulmonary complications based on clinical data of patients with NSCLC showed a good performance. The results indicate that ML combined with the SHAP individualized interpretation method has practical clinical value. Elsevier 2023-07-03 /pmc/articles/PMC10359813/ /pubmed/37483738 http://dx.doi.org/10.1016/j.heliyon.2023.e17772 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Zhai, Yihai
Lin, Xue
Wei, Qiaolin
Pu, Yuanjin
Pang, Yonghui
Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations
title Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations
title_full Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations
title_fullStr Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations
title_full_unstemmed Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations
title_short Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations
title_sort interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and shapley additive explanations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10359813/
https://www.ncbi.nlm.nih.gov/pubmed/37483738
http://dx.doi.org/10.1016/j.heliyon.2023.e17772
work_keys_str_mv AT zhaiyihai interpretablepredictionofcardiopulmonarycomplicationsafternonsmallcelllungcancersurgerybasedonmachinelearningandshapleyadditiveexplanations
AT linxue interpretablepredictionofcardiopulmonarycomplicationsafternonsmallcelllungcancersurgerybasedonmachinelearningandshapleyadditiveexplanations
AT weiqiaolin interpretablepredictionofcardiopulmonarycomplicationsafternonsmallcelllungcancersurgerybasedonmachinelearningandshapleyadditiveexplanations
AT puyuanjin interpretablepredictionofcardiopulmonarycomplicationsafternonsmallcelllungcancersurgerybasedonmachinelearningandshapleyadditiveexplanations
AT pangyonghui interpretablepredictionofcardiopulmonarycomplicationsafternonsmallcelllungcancersurgerybasedonmachinelearningandshapleyadditiveexplanations