Cargando…

Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning

BACKGROUND: There is currently a lack of model for predicting the occurrence of venous thromboembolism (VTE) in patients with lung cancer. Machine learning (ML) techniques are being increasingly adapted for use in the medical field because of their capabilities of intelligent analysis and scalabilit...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lei, Haike, Zhang, Mengyang, Wu, Zeyi, Liu, Chun, Li, Xiaosheng, Zhou, Wei, Long, Bo, Ma, Jiayang, Zhang, Huiyi, Wang, Ying, Wang, Guixue, Gong, Mengchun, Hong, Na, Liu, Haixia, Wu, Yongzhong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Cardiovascular Medicine
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8934875/ https://www.ncbi.nlm.nih.gov/pubmed/35321110 http://dx.doi.org/10.3389/fcvm.2022.845210

_version_	1784671925810757632
author	Lei, Haike Zhang, Mengyang Wu, Zeyi Liu, Chun Li, Xiaosheng Zhou, Wei Long, Bo Ma, Jiayang Zhang, Huiyi Wang, Ying Wang, Guixue Gong, Mengchun Hong, Na Liu, Haixia Wu, Yongzhong
author_facet	Lei, Haike Zhang, Mengyang Wu, Zeyi Liu, Chun Li, Xiaosheng Zhou, Wei Long, Bo Ma, Jiayang Zhang, Huiyi Wang, Ying Wang, Guixue Gong, Mengchun Hong, Na Liu, Haixia Wu, Yongzhong
author_sort	Lei, Haike
collection	PubMed
description	BACKGROUND: There is currently a lack of model for predicting the occurrence of venous thromboembolism (VTE) in patients with lung cancer. Machine learning (ML) techniques are being increasingly adapted for use in the medical field because of their capabilities of intelligent analysis and scalability. This study aimed to develop and validate ML models to predict the incidence of VTE among lung cancer patients. METHODS: Data of lung cancer patients from a Grade 3A cancer hospital in China with and without VTE were included. Patient characteristics and clinical predictors related to VTE were collected. The primary endpoint was the diagnosis of VTE during index hospitalization. We calculated and compared the area under the receiver operating characteristic curve (AUROC) using the selected best-performed model (Random Forest model) through multiple model comparison, as well as investigated feature contributions during the training process with both permutation importance scores and the impurity-based feature importance scores in random forest model. RESULTS: In total, 3,398 patients were included in our study, 125 of whom experienced VTE during their hospital stay. The ROC curve and precision–recall curve (PRC) for Random Forest Model showed an AUROC of 0.91 (95% CI: 0.893–0.926) and an AUPRC of 0.43 (95% CI: 0.363–0.500). For the simplified model, five most relevant features were selected: Karnofsky Performance Status (KPS), a history of VTE, recombinant human endostatin, EGFR-TKI, and platelet count. We re-trained a random forest classifier with results of the AUROC of 0.87 (95% CI: 0.802–0.917) and AUPRC of 0.30 (95% CI: 0.265–0.358), respectively. CONCLUSION: According to the study results, there was no conspicuous decrease in the model’s performance when use fewer features to predict, we concluded that our simplified model would be more applicable in real-life clinical settings. The developed model using ML algorithms in our study has the potential to improve the early detection and prediction of the incidence of VTE in patients with lung cancer.
format	Online Article Text
id	pubmed-8934875
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-89348752022-03-22 Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning Lei, Haike Zhang, Mengyang Wu, Zeyi Liu, Chun Li, Xiaosheng Zhou, Wei Long, Bo Ma, Jiayang Zhang, Huiyi Wang, Ying Wang, Guixue Gong, Mengchun Hong, Na Liu, Haixia Wu, Yongzhong Front Cardiovasc Med Cardiovascular Medicine BACKGROUND: There is currently a lack of model for predicting the occurrence of venous thromboembolism (VTE) in patients with lung cancer. Machine learning (ML) techniques are being increasingly adapted for use in the medical field because of their capabilities of intelligent analysis and scalability. This study aimed to develop and validate ML models to predict the incidence of VTE among lung cancer patients. METHODS: Data of lung cancer patients from a Grade 3A cancer hospital in China with and without VTE were included. Patient characteristics and clinical predictors related to VTE were collected. The primary endpoint was the diagnosis of VTE during index hospitalization. We calculated and compared the area under the receiver operating characteristic curve (AUROC) using the selected best-performed model (Random Forest model) through multiple model comparison, as well as investigated feature contributions during the training process with both permutation importance scores and the impurity-based feature importance scores in random forest model. RESULTS: In total, 3,398 patients were included in our study, 125 of whom experienced VTE during their hospital stay. The ROC curve and precision–recall curve (PRC) for Random Forest Model showed an AUROC of 0.91 (95% CI: 0.893–0.926) and an AUPRC of 0.43 (95% CI: 0.363–0.500). For the simplified model, five most relevant features were selected: Karnofsky Performance Status (KPS), a history of VTE, recombinant human endostatin, EGFR-TKI, and platelet count. We re-trained a random forest classifier with results of the AUROC of 0.87 (95% CI: 0.802–0.917) and AUPRC of 0.30 (95% CI: 0.265–0.358), respectively. CONCLUSION: According to the study results, there was no conspicuous decrease in the model’s performance when use fewer features to predict, we concluded that our simplified model would be more applicable in real-life clinical settings. The developed model using ML algorithms in our study has the potential to improve the early detection and prediction of the incidence of VTE in patients with lung cancer. Frontiers Media S.A. 2022-03-07 /pmc/articles/PMC8934875/ /pubmed/35321110 http://dx.doi.org/10.3389/fcvm.2022.845210 Text en Copyright © 2022 Lei, Zhang, Wu, Liu, Li, Zhou, Long, Ma, Zhang, Wang, Wang, Gong, Hong, Liu and Wu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Cardiovascular Medicine Lei, Haike Zhang, Mengyang Wu, Zeyi Liu, Chun Li, Xiaosheng Zhou, Wei Long, Bo Ma, Jiayang Zhang, Huiyi Wang, Ying Wang, Guixue Gong, Mengchun Hong, Na Liu, Haixia Wu, Yongzhong Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning
title	Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning
title_full	Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning
title_fullStr	Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning
title_full_unstemmed	Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning
title_short	Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning
title_sort	development and validation of a risk prediction model for venous thromboembolism in lung cancer patients using machine learning
topic	Cardiovascular Medicine
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8934875/ https://www.ncbi.nlm.nih.gov/pubmed/35321110 http://dx.doi.org/10.3389/fcvm.2022.845210
work_keys_str_mv	AT leihaike developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT zhangmengyang developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT wuzeyi developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT liuchun developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT lixiaosheng developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT zhouwei developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT longbo developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT majiayang developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT zhanghuiyi developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT wangying developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT wangguixue developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT gongmengchun developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT hongna developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT liuhaixia developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning AT wuyongzhong developmentandvalidationofariskpredictionmodelforvenousthromboembolisminlungcancerpatientsusingmachinelearning

Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning

Ejemplares similares