Cargando…

Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models

Accurate prognostic prediction is crucial for treatment decision-making in lung papillary adenocarcinoma (LPADC). The aim of this study was to predict cancer-specific survival in LPADC using ensemble machine learning and classical Cox regression models. Moreover, models were evaluated to provide rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Xia, Kaide, Chen, Dinghua, Jin, Shuai, Yi, Xinglin, Luo, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491759/
https://www.ncbi.nlm.nih.gov/pubmed/37684259
http://dx.doi.org/10.1038/s41598-023-40779-1
_version_ 1785104126990876672
author Xia, Kaide
Chen, Dinghua
Jin, Shuai
Yi, Xinglin
Luo, Li
author_facet Xia, Kaide
Chen, Dinghua
Jin, Shuai
Yi, Xinglin
Luo, Li
author_sort Xia, Kaide
collection PubMed
description Accurate prognostic prediction is crucial for treatment decision-making in lung papillary adenocarcinoma (LPADC). The aim of this study was to predict cancer-specific survival in LPADC using ensemble machine learning and classical Cox regression models. Moreover, models were evaluated to provide recommendations based on quantitative data for personalized treatment of LPADC. Data of patients diagnosed with LPADC (2004–2018) were extracted from the Surveillance, Epidemiology, and End Results database. The set of samples was randomly divided into the training and validation sets at a ratio of 7:3. Three ensemble models were selected, namely gradient boosting survival (GBS), random survival forest (RSF), and extra survival trees (EST). In addition, Cox proportional hazards (CoxPH) regression was used to construct the prognostic models. The Harrell’s concordance index (C-index), integrated Brier score (IBS), and area under the time-dependent receiver operating characteristic curve (time-dependent AUC) were used to evaluate the performance of the predictive models. A user-friendly web access panel was provided to easily evaluate the model for the prediction of survival and treatment recommendations. A total of 3615 patients were randomly divided into the training and validation cohorts (n = 2530 and 1085, respectively). The extra survival trees, RSF, GBS, and CoxPH models showed good discriminative ability and calibration in both the training and validation cohorts (mean of time-dependent AUC: > 0.84 and > 0.82; C-index: > 0.79 and > 0.77; IBS: < 0.16 and < 0.17, respectively). The RSF and GBS models were more consistent than the CoxPH model in predicting long-term survival. We implemented the developed models as web applications for deployment into clinical practice (accessible through https://shinyshine-820-lpaprediction-model-z3ubbu.streamlit.app/). All four prognostic models showed good discriminative ability and calibration. The RSF and GBS models exhibited the highest effectiveness among all models in predicting the long-term cancer-specific survival of patients with LPADC. This approach may facilitate the development of personalized treatment plans and prediction of prognosis for LPADC.
format Online
Article
Text
id pubmed-10491759
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-104917592023-09-10 Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models Xia, Kaide Chen, Dinghua Jin, Shuai Yi, Xinglin Luo, Li Sci Rep Article Accurate prognostic prediction is crucial for treatment decision-making in lung papillary adenocarcinoma (LPADC). The aim of this study was to predict cancer-specific survival in LPADC using ensemble machine learning and classical Cox regression models. Moreover, models were evaluated to provide recommendations based on quantitative data for personalized treatment of LPADC. Data of patients diagnosed with LPADC (2004–2018) were extracted from the Surveillance, Epidemiology, and End Results database. The set of samples was randomly divided into the training and validation sets at a ratio of 7:3. Three ensemble models were selected, namely gradient boosting survival (GBS), random survival forest (RSF), and extra survival trees (EST). In addition, Cox proportional hazards (CoxPH) regression was used to construct the prognostic models. The Harrell’s concordance index (C-index), integrated Brier score (IBS), and area under the time-dependent receiver operating characteristic curve (time-dependent AUC) were used to evaluate the performance of the predictive models. A user-friendly web access panel was provided to easily evaluate the model for the prediction of survival and treatment recommendations. A total of 3615 patients were randomly divided into the training and validation cohorts (n = 2530 and 1085, respectively). The extra survival trees, RSF, GBS, and CoxPH models showed good discriminative ability and calibration in both the training and validation cohorts (mean of time-dependent AUC: > 0.84 and > 0.82; C-index: > 0.79 and > 0.77; IBS: < 0.16 and < 0.17, respectively). The RSF and GBS models were more consistent than the CoxPH model in predicting long-term survival. We implemented the developed models as web applications for deployment into clinical practice (accessible through https://shinyshine-820-lpaprediction-model-z3ubbu.streamlit.app/). All four prognostic models showed good discriminative ability and calibration. The RSF and GBS models exhibited the highest effectiveness among all models in predicting the long-term cancer-specific survival of patients with LPADC. This approach may facilitate the development of personalized treatment plans and prediction of prognosis for LPADC. Nature Publishing Group UK 2023-09-08 /pmc/articles/PMC10491759/ /pubmed/37684259 http://dx.doi.org/10.1038/s41598-023-40779-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Xia, Kaide
Chen, Dinghua
Jin, Shuai
Yi, Xinglin
Luo, Li
Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models
title Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models
title_full Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models
title_fullStr Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models
title_full_unstemmed Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models
title_short Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models
title_sort prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491759/
https://www.ncbi.nlm.nih.gov/pubmed/37684259
http://dx.doi.org/10.1038/s41598-023-40779-1
work_keys_str_mv AT xiakaide predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels
AT chendinghua predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels
AT jinshuai predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels
AT yixinglin predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels
AT luoli predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels