Cargando…

A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms

BACKGROUND: Prediction models with high accuracy rates for nonmetastatic cervical cancer (CC) patients are limited. This study aimed to construct and compare predictive models on the basis of machine learning (ML) algorithms for predicting the 5‐year survival status of CC patients through using the...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Wenke, Lu, Yanwei, Shou, Huafeng, Xu, Hong’en, Shi, Lei, Geng, Xiaolu, Song, Tao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10067071/
https://www.ncbi.nlm.nih.gov/pubmed/36479910
http://dx.doi.org/10.1002/cam4.5477
_version_ 1785018388923285504
author Yu, Wenke
Lu, Yanwei
Shou, Huafeng
Xu, Hong’en
Shi, Lei
Geng, Xiaolu
Song, Tao
author_facet Yu, Wenke
Lu, Yanwei
Shou, Huafeng
Xu, Hong’en
Shi, Lei
Geng, Xiaolu
Song, Tao
author_sort Yu, Wenke
collection PubMed
description BACKGROUND: Prediction models with high accuracy rates for nonmetastatic cervical cancer (CC) patients are limited. This study aimed to construct and compare predictive models on the basis of machine learning (ML) algorithms for predicting the 5‐year survival status of CC patients through using the Surveillance, Epidemiology, and End Results public database of the National Cancer Institute. METHODS: The data registered from 2004 to 2016 were extracted and randomly divided into training and validation cohorts (8:2). The least absolute shrinkage and selection operator (LASSO) regression was employed to identify significant factors. Then, four predictive models were constructed, including logistic regression (LR), random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost). The predictive models were evaluated and compared using Receiver‐operating characteristics with areas under the curves (AUCs) and decision curve analysis (DCA), respectively. RESULTS: A total of 13,802 patients were involved and classified into training (N = 11,041) and validation (N = 2761) cohorts. By using the LASSO regression method, seven factors were identified. In the training cohort, the XGBoost model showed the best performance (AUC = 0.8400) compared to the other three models (all p < 0.05 by Delong's test). In the validation cohort, the XGBoost model also demonstrated a superior prediction ability (AUC = 0.8365) than LR and SVM models (both p < 0.05 by Delong's test), although the difference was not statistically significant between the XGBoost and the RF models (p = 0.4251 by Delong's test). Based on the DCA results, the XGBoost model was also superior, and feature importance analysis indicated that the tumor stage was the most important variable among the seven factors. CONCLUSIONS: The XGBoost model proved to be an effective algorithm with better prediction abilities. This model is proposed to support better decision‐making for nonmetastatic CC patients in the future.
format Online
Article
Text
id pubmed-10067071
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-100670712023-04-03 A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms Yu, Wenke Lu, Yanwei Shou, Huafeng Xu, Hong’en Shi, Lei Geng, Xiaolu Song, Tao Cancer Med RESEARCH ARTICLES BACKGROUND: Prediction models with high accuracy rates for nonmetastatic cervical cancer (CC) patients are limited. This study aimed to construct and compare predictive models on the basis of machine learning (ML) algorithms for predicting the 5‐year survival status of CC patients through using the Surveillance, Epidemiology, and End Results public database of the National Cancer Institute. METHODS: The data registered from 2004 to 2016 were extracted and randomly divided into training and validation cohorts (8:2). The least absolute shrinkage and selection operator (LASSO) regression was employed to identify significant factors. Then, four predictive models were constructed, including logistic regression (LR), random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost). The predictive models were evaluated and compared using Receiver‐operating characteristics with areas under the curves (AUCs) and decision curve analysis (DCA), respectively. RESULTS: A total of 13,802 patients were involved and classified into training (N = 11,041) and validation (N = 2761) cohorts. By using the LASSO regression method, seven factors were identified. In the training cohort, the XGBoost model showed the best performance (AUC = 0.8400) compared to the other three models (all p < 0.05 by Delong's test). In the validation cohort, the XGBoost model also demonstrated a superior prediction ability (AUC = 0.8365) than LR and SVM models (both p < 0.05 by Delong's test), although the difference was not statistically significant between the XGBoost and the RF models (p = 0.4251 by Delong's test). Based on the DCA results, the XGBoost model was also superior, and feature importance analysis indicated that the tumor stage was the most important variable among the seven factors. CONCLUSIONS: The XGBoost model proved to be an effective algorithm with better prediction abilities. This model is proposed to support better decision‐making for nonmetastatic CC patients in the future. John Wiley and Sons Inc. 2022-12-08 /pmc/articles/PMC10067071/ /pubmed/36479910 http://dx.doi.org/10.1002/cam4.5477 Text en © 2022 The Authors. Cancer Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle RESEARCH ARTICLES
Yu, Wenke
Lu, Yanwei
Shou, Huafeng
Xu, Hong’en
Shi, Lei
Geng, Xiaolu
Song, Tao
A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms
title A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms
title_full A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms
title_fullStr A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms
title_full_unstemmed A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms
title_short A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms
title_sort 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms
topic RESEARCH ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10067071/
https://www.ncbi.nlm.nih.gov/pubmed/36479910
http://dx.doi.org/10.1002/cam4.5477
work_keys_str_mv AT yuwenke a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT luyanwei a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT shouhuafeng a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT xuhongen a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT shilei a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT gengxiaolu a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT songtao a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT yuwenke 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT luyanwei 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT shouhuafeng 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT xuhongen 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT shilei 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT gengxiaolu 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms
AT songtao 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms