Cargando…

Risk Assessment of Pulmonary Metastasis for Cervical Cancer Patients by Ensemble Learning Models: A Large Population Based Real-World Study

OBJECTIVE: Pulmonary metastasis (PM) is an independent risk factor affecting the prognosis of cervical patients, but it still lacks a prediction. This study aimed to develop machine learning-based predictive models for PM. METHODS: A total of 22,766 patients diagnosed with or without PM from the Sur...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Menglin, Wang, Bo, Wang, Tiejun, Chen, Yilin, He, Du
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8628546/
https://www.ncbi.nlm.nih.gov/pubmed/34853529
http://dx.doi.org/10.2147/IJGM.S338389
_version_ 1784607021025198080
author Zhu, Menglin
Wang, Bo
Wang, Tiejun
Chen, Yilin
He, Du
author_facet Zhu, Menglin
Wang, Bo
Wang, Tiejun
Chen, Yilin
He, Du
author_sort Zhu, Menglin
collection PubMed
description OBJECTIVE: Pulmonary metastasis (PM) is an independent risk factor affecting the prognosis of cervical patients, but it still lacks a prediction. This study aimed to develop machine learning-based predictive models for PM. METHODS: A total of 22,766 patients diagnosed with or without PM from the Surveillance, Epidemiology, and End Results (SEER) database were enrolled in this study. The cohort was randomly split into a train set (70%) and a validation set (30%). In addition, 884 Chinese patients from two tertiary medical centers were included as an external validation set. Duplicated and useless candidate variables were excluded, and sixteen variables were included for the machine learning algorithm. We developed five predictive models, including the generalized linear model (GLM), random forest model (RFM), naive Bayesian model (NBM), artificial neural networks model (ANNM), and decision tree model (DTM). The predictive performance of these models was evaluated by the receiver operating characteristic (ROC) curve and calibration curve. The Cox proportional hazard model (CPHM) and competing risk model (CRM) were also included for survival outcome prediction. RESULTS: Of the patients included in the analysis, 2456 (4.38%) patients were diagnosed with PM. Age, organ-site metastasis (liver, bone, brain), distant lymph metastasis, tumor size, and pathology were the important predictors of PM. The RFM with 9 variables introduced was identified as the best predictive model for PM (AUC = 0.972, 95% CI: 0.958–0.986). The C-index for the CPHM and CRM was 0.626 (95% CI: 0.604–0.648) and 0.611 (95% CI: 0.586–0.636), respectively. CONCLUSION: The prediction algorithm derived by machine-learning-based methods shows a robust ability to predict PM. This result suggests that machine learning techniques have the potential to improve the development and validation of predictive modeling in cervical patients with PM.
format Online
Article
Text
id pubmed-8628546
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-86285462021-11-30 Risk Assessment of Pulmonary Metastasis for Cervical Cancer Patients by Ensemble Learning Models: A Large Population Based Real-World Study Zhu, Menglin Wang, Bo Wang, Tiejun Chen, Yilin He, Du Int J Gen Med Original Research OBJECTIVE: Pulmonary metastasis (PM) is an independent risk factor affecting the prognosis of cervical patients, but it still lacks a prediction. This study aimed to develop machine learning-based predictive models for PM. METHODS: A total of 22,766 patients diagnosed with or without PM from the Surveillance, Epidemiology, and End Results (SEER) database were enrolled in this study. The cohort was randomly split into a train set (70%) and a validation set (30%). In addition, 884 Chinese patients from two tertiary medical centers were included as an external validation set. Duplicated and useless candidate variables were excluded, and sixteen variables were included for the machine learning algorithm. We developed five predictive models, including the generalized linear model (GLM), random forest model (RFM), naive Bayesian model (NBM), artificial neural networks model (ANNM), and decision tree model (DTM). The predictive performance of these models was evaluated by the receiver operating characteristic (ROC) curve and calibration curve. The Cox proportional hazard model (CPHM) and competing risk model (CRM) were also included for survival outcome prediction. RESULTS: Of the patients included in the analysis, 2456 (4.38%) patients were diagnosed with PM. Age, organ-site metastasis (liver, bone, brain), distant lymph metastasis, tumor size, and pathology were the important predictors of PM. The RFM with 9 variables introduced was identified as the best predictive model for PM (AUC = 0.972, 95% CI: 0.958–0.986). The C-index for the CPHM and CRM was 0.626 (95% CI: 0.604–0.648) and 0.611 (95% CI: 0.586–0.636), respectively. CONCLUSION: The prediction algorithm derived by machine-learning-based methods shows a robust ability to predict PM. This result suggests that machine learning techniques have the potential to improve the development and validation of predictive modeling in cervical patients with PM. Dove 2021-11-23 /pmc/articles/PMC8628546/ /pubmed/34853529 http://dx.doi.org/10.2147/IJGM.S338389 Text en © 2021 Zhu et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Zhu, Menglin
Wang, Bo
Wang, Tiejun
Chen, Yilin
He, Du
Risk Assessment of Pulmonary Metastasis for Cervical Cancer Patients by Ensemble Learning Models: A Large Population Based Real-World Study
title Risk Assessment of Pulmonary Metastasis for Cervical Cancer Patients by Ensemble Learning Models: A Large Population Based Real-World Study
title_full Risk Assessment of Pulmonary Metastasis for Cervical Cancer Patients by Ensemble Learning Models: A Large Population Based Real-World Study
title_fullStr Risk Assessment of Pulmonary Metastasis for Cervical Cancer Patients by Ensemble Learning Models: A Large Population Based Real-World Study
title_full_unstemmed Risk Assessment of Pulmonary Metastasis for Cervical Cancer Patients by Ensemble Learning Models: A Large Population Based Real-World Study
title_short Risk Assessment of Pulmonary Metastasis for Cervical Cancer Patients by Ensemble Learning Models: A Large Population Based Real-World Study
title_sort risk assessment of pulmonary metastasis for cervical cancer patients by ensemble learning models: a large population based real-world study
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8628546/
https://www.ncbi.nlm.nih.gov/pubmed/34853529
http://dx.doi.org/10.2147/IJGM.S338389
work_keys_str_mv AT zhumenglin riskassessmentofpulmonarymetastasisforcervicalcancerpatientsbyensemblelearningmodelsalargepopulationbasedrealworldstudy
AT wangbo riskassessmentofpulmonarymetastasisforcervicalcancerpatientsbyensemblelearningmodelsalargepopulationbasedrealworldstudy
AT wangtiejun riskassessmentofpulmonarymetastasisforcervicalcancerpatientsbyensemblelearningmodelsalargepopulationbasedrealworldstudy
AT chenyilin riskassessmentofpulmonarymetastasisforcervicalcancerpatientsbyensemblelearningmodelsalargepopulationbasedrealworldstudy
AT hedu riskassessmentofpulmonarymetastasisforcervicalcancerpatientsbyensemblelearningmodelsalargepopulationbasedrealworldstudy