Cargando…

Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models

BACKGROUND: Surgical resection is the standard treatment for early-stage lung cancer. Since postoperative lung function is related to mortality, predicted postoperative lung function is used to determine the treatment modality. The aim of this study was to evaluate the predictive performance of line...

Descripción completa

Detalles Bibliográficos
Autores principales: Kwon, Oh Beom, Han, Solji, Lee, Hwa Young, Kang, Hye Seon, Kim, Sung Kyoung, Kim, Ju Sang, Park, Chan Kwon, Lee, Sang Haak, Kim, Seung Joon, Kim, Jin Woo, Yeo, Chang Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Korean Academy of Tuberculosis and Respiratory Diseases 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10323210/
https://www.ncbi.nlm.nih.gov/pubmed/37038881
http://dx.doi.org/10.4046/trd.2022.0048
_version_ 1785068920426725376
author Kwon, Oh Beom
Han, Solji
Lee, Hwa Young
Kang, Hye Seon
Kim, Sung Kyoung
Kim, Ju Sang
Park, Chan Kwon
Lee, Sang Haak
Kim, Seung Joon
Kim, Jin Woo
Yeo, Chang Dong
author_facet Kwon, Oh Beom
Han, Solji
Lee, Hwa Young
Kang, Hye Seon
Kim, Sung Kyoung
Kim, Ju Sang
Park, Chan Kwon
Lee, Sang Haak
Kim, Seung Joon
Kim, Jin Woo
Yeo, Chang Dong
author_sort Kwon, Oh Beom
collection PubMed
description BACKGROUND: Surgical resection is the standard treatment for early-stage lung cancer. Since postoperative lung function is related to mortality, predicted postoperative lung function is used to determine the treatment modality. The aim of this study was to evaluate the predictive performance of linear regression and machine learning models. METHODS: We extracted data from the Clinical Data Warehouse and developed three sets: set I, the linear regression model; set II, machine learning models omitting the missing data: and set III, machine learning models imputing the missing data. Six machine learning models, the least absolute shrinkage and selection operator (LASSO), Ridge regression, ElasticNet, Random Forest, eXtreme gradient boosting (XGBoost), and the light gradient boosting machine (LightGBM) were implemented. The forced expiratory volume in 1 second measured 6 months after surgery was defined as the outcome. Five-fold cross-validation was performed for hyperparameter tuning of the machine learning models. The dataset was split into training and test datasets at a 70:30 ratio. Implementation was done after dataset splitting in set III. Predictive performance was evaluated by R(2) and mean squared error (MSE) in the three sets. RESULTS: A total of 1,487 patients were included in sets I and III and 896 patients were included in set II. In set I, the R(2) value was 0.27 and in set II, LightGBM was the best model with the highest R(2) value of 0.5 and the lowest MSE of 154.95. In set III, LightGBM was the best model with the highest R(2) value of 0.56 and the lowest MSE of 174.07. CONCLUSION: The LightGBM model showed the best performance in predicting postoperative lung function.
format Online
Article
Text
id pubmed-10323210
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Korean Academy of Tuberculosis and Respiratory Diseases
record_format MEDLINE/PubMed
spelling pubmed-103232102023-07-07 Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models Kwon, Oh Beom Han, Solji Lee, Hwa Young Kang, Hye Seon Kim, Sung Kyoung Kim, Ju Sang Park, Chan Kwon Lee, Sang Haak Kim, Seung Joon Kim, Jin Woo Yeo, Chang Dong Tuberc Respir Dis (Seoul) Original Article BACKGROUND: Surgical resection is the standard treatment for early-stage lung cancer. Since postoperative lung function is related to mortality, predicted postoperative lung function is used to determine the treatment modality. The aim of this study was to evaluate the predictive performance of linear regression and machine learning models. METHODS: We extracted data from the Clinical Data Warehouse and developed three sets: set I, the linear regression model; set II, machine learning models omitting the missing data: and set III, machine learning models imputing the missing data. Six machine learning models, the least absolute shrinkage and selection operator (LASSO), Ridge regression, ElasticNet, Random Forest, eXtreme gradient boosting (XGBoost), and the light gradient boosting machine (LightGBM) were implemented. The forced expiratory volume in 1 second measured 6 months after surgery was defined as the outcome. Five-fold cross-validation was performed for hyperparameter tuning of the machine learning models. The dataset was split into training and test datasets at a 70:30 ratio. Implementation was done after dataset splitting in set III. Predictive performance was evaluated by R(2) and mean squared error (MSE) in the three sets. RESULTS: A total of 1,487 patients were included in sets I and III and 896 patients were included in set II. In set I, the R(2) value was 0.27 and in set II, LightGBM was the best model with the highest R(2) value of 0.5 and the lowest MSE of 154.95. In set III, LightGBM was the best model with the highest R(2) value of 0.56 and the lowest MSE of 174.07. CONCLUSION: The LightGBM model showed the best performance in predicting postoperative lung function. The Korean Academy of Tuberculosis and Respiratory Diseases 2023-07 2023-04-11 /pmc/articles/PMC10323210/ /pubmed/37038881 http://dx.doi.org/10.4046/trd.2022.0048 Text en Copyright © 2023 The Korean Academy of Tuberculosis and Respiratory Diseases https://creativecommons.org/licenses/by-nc/4.0/It is identical to the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ).
spellingShingle Original Article
Kwon, Oh Beom
Han, Solji
Lee, Hwa Young
Kang, Hye Seon
Kim, Sung Kyoung
Kim, Ju Sang
Park, Chan Kwon
Lee, Sang Haak
Kim, Seung Joon
Kim, Jin Woo
Yeo, Chang Dong
Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models
title Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models
title_full Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models
title_fullStr Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models
title_full_unstemmed Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models
title_short Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models
title_sort prediction of postoperative lung function in lung cancer patients using machine learning models
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10323210/
https://www.ncbi.nlm.nih.gov/pubmed/37038881
http://dx.doi.org/10.4046/trd.2022.0048
work_keys_str_mv AT kwonohbeom predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT hansolji predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT leehwayoung predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT kanghyeseon predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT kimsungkyoung predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT kimjusang predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT parkchankwon predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT leesanghaak predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT kimseungjoon predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT kimjinwoo predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels
AT yeochangdong predictionofpostoperativelungfunctioninlungcancerpatientsusingmachinelearningmodels