Cargando…
Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models
Accurate prognostic prediction is crucial for treatment decision-making in lung papillary adenocarcinoma (LPADC). The aim of this study was to predict cancer-specific survival in LPADC using ensemble machine learning and classical Cox regression models. Moreover, models were evaluated to provide rec...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491759/ https://www.ncbi.nlm.nih.gov/pubmed/37684259 http://dx.doi.org/10.1038/s41598-023-40779-1 |
_version_ | 1785104126990876672 |
---|---|
author | Xia, Kaide Chen, Dinghua Jin, Shuai Yi, Xinglin Luo, Li |
author_facet | Xia, Kaide Chen, Dinghua Jin, Shuai Yi, Xinglin Luo, Li |
author_sort | Xia, Kaide |
collection | PubMed |
description | Accurate prognostic prediction is crucial for treatment decision-making in lung papillary adenocarcinoma (LPADC). The aim of this study was to predict cancer-specific survival in LPADC using ensemble machine learning and classical Cox regression models. Moreover, models were evaluated to provide recommendations based on quantitative data for personalized treatment of LPADC. Data of patients diagnosed with LPADC (2004–2018) were extracted from the Surveillance, Epidemiology, and End Results database. The set of samples was randomly divided into the training and validation sets at a ratio of 7:3. Three ensemble models were selected, namely gradient boosting survival (GBS), random survival forest (RSF), and extra survival trees (EST). In addition, Cox proportional hazards (CoxPH) regression was used to construct the prognostic models. The Harrell’s concordance index (C-index), integrated Brier score (IBS), and area under the time-dependent receiver operating characteristic curve (time-dependent AUC) were used to evaluate the performance of the predictive models. A user-friendly web access panel was provided to easily evaluate the model for the prediction of survival and treatment recommendations. A total of 3615 patients were randomly divided into the training and validation cohorts (n = 2530 and 1085, respectively). The extra survival trees, RSF, GBS, and CoxPH models showed good discriminative ability and calibration in both the training and validation cohorts (mean of time-dependent AUC: > 0.84 and > 0.82; C-index: > 0.79 and > 0.77; IBS: < 0.16 and < 0.17, respectively). The RSF and GBS models were more consistent than the CoxPH model in predicting long-term survival. We implemented the developed models as web applications for deployment into clinical practice (accessible through https://shinyshine-820-lpaprediction-model-z3ubbu.streamlit.app/). All four prognostic models showed good discriminative ability and calibration. The RSF and GBS models exhibited the highest effectiveness among all models in predicting the long-term cancer-specific survival of patients with LPADC. This approach may facilitate the development of personalized treatment plans and prediction of prognosis for LPADC. |
format | Online Article Text |
id | pubmed-10491759 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-104917592023-09-10 Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models Xia, Kaide Chen, Dinghua Jin, Shuai Yi, Xinglin Luo, Li Sci Rep Article Accurate prognostic prediction is crucial for treatment decision-making in lung papillary adenocarcinoma (LPADC). The aim of this study was to predict cancer-specific survival in LPADC using ensemble machine learning and classical Cox regression models. Moreover, models were evaluated to provide recommendations based on quantitative data for personalized treatment of LPADC. Data of patients diagnosed with LPADC (2004–2018) were extracted from the Surveillance, Epidemiology, and End Results database. The set of samples was randomly divided into the training and validation sets at a ratio of 7:3. Three ensemble models were selected, namely gradient boosting survival (GBS), random survival forest (RSF), and extra survival trees (EST). In addition, Cox proportional hazards (CoxPH) regression was used to construct the prognostic models. The Harrell’s concordance index (C-index), integrated Brier score (IBS), and area under the time-dependent receiver operating characteristic curve (time-dependent AUC) were used to evaluate the performance of the predictive models. A user-friendly web access panel was provided to easily evaluate the model for the prediction of survival and treatment recommendations. A total of 3615 patients were randomly divided into the training and validation cohorts (n = 2530 and 1085, respectively). The extra survival trees, RSF, GBS, and CoxPH models showed good discriminative ability and calibration in both the training and validation cohorts (mean of time-dependent AUC: > 0.84 and > 0.82; C-index: > 0.79 and > 0.77; IBS: < 0.16 and < 0.17, respectively). The RSF and GBS models were more consistent than the CoxPH model in predicting long-term survival. We implemented the developed models as web applications for deployment into clinical practice (accessible through https://shinyshine-820-lpaprediction-model-z3ubbu.streamlit.app/). All four prognostic models showed good discriminative ability and calibration. The RSF and GBS models exhibited the highest effectiveness among all models in predicting the long-term cancer-specific survival of patients with LPADC. This approach may facilitate the development of personalized treatment plans and prediction of prognosis for LPADC. Nature Publishing Group UK 2023-09-08 /pmc/articles/PMC10491759/ /pubmed/37684259 http://dx.doi.org/10.1038/s41598-023-40779-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Xia, Kaide Chen, Dinghua Jin, Shuai Yi, Xinglin Luo, Li Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models |
title | Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models |
title_full | Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models |
title_fullStr | Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models |
title_full_unstemmed | Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models |
title_short | Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models |
title_sort | prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491759/ https://www.ncbi.nlm.nih.gov/pubmed/37684259 http://dx.doi.org/10.1038/s41598-023-40779-1 |
work_keys_str_mv | AT xiakaide predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels AT chendinghua predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels AT jinshuai predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels AT yixinglin predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels AT luoli predictionoflungpapillaryadenocarcinomaspecificsurvivalusingensemblemachinelearningmodels |