Cargando…
A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms
BACKGROUND: Prediction models with high accuracy rates for nonmetastatic cervical cancer (CC) patients are limited. This study aimed to construct and compare predictive models on the basis of machine learning (ML) algorithms for predicting the 5‐year survival status of CC patients through using the...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10067071/ https://www.ncbi.nlm.nih.gov/pubmed/36479910 http://dx.doi.org/10.1002/cam4.5477 |
_version_ | 1785018388923285504 |
---|---|
author | Yu, Wenke Lu, Yanwei Shou, Huafeng Xu, Hong’en Shi, Lei Geng, Xiaolu Song, Tao |
author_facet | Yu, Wenke Lu, Yanwei Shou, Huafeng Xu, Hong’en Shi, Lei Geng, Xiaolu Song, Tao |
author_sort | Yu, Wenke |
collection | PubMed |
description | BACKGROUND: Prediction models with high accuracy rates for nonmetastatic cervical cancer (CC) patients are limited. This study aimed to construct and compare predictive models on the basis of machine learning (ML) algorithms for predicting the 5‐year survival status of CC patients through using the Surveillance, Epidemiology, and End Results public database of the National Cancer Institute. METHODS: The data registered from 2004 to 2016 were extracted and randomly divided into training and validation cohorts (8:2). The least absolute shrinkage and selection operator (LASSO) regression was employed to identify significant factors. Then, four predictive models were constructed, including logistic regression (LR), random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost). The predictive models were evaluated and compared using Receiver‐operating characteristics with areas under the curves (AUCs) and decision curve analysis (DCA), respectively. RESULTS: A total of 13,802 patients were involved and classified into training (N = 11,041) and validation (N = 2761) cohorts. By using the LASSO regression method, seven factors were identified. In the training cohort, the XGBoost model showed the best performance (AUC = 0.8400) compared to the other three models (all p < 0.05 by Delong's test). In the validation cohort, the XGBoost model also demonstrated a superior prediction ability (AUC = 0.8365) than LR and SVM models (both p < 0.05 by Delong's test), although the difference was not statistically significant between the XGBoost and the RF models (p = 0.4251 by Delong's test). Based on the DCA results, the XGBoost model was also superior, and feature importance analysis indicated that the tumor stage was the most important variable among the seven factors. CONCLUSIONS: The XGBoost model proved to be an effective algorithm with better prediction abilities. This model is proposed to support better decision‐making for nonmetastatic CC patients in the future. |
format | Online Article Text |
id | pubmed-10067071 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-100670712023-04-03 A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms Yu, Wenke Lu, Yanwei Shou, Huafeng Xu, Hong’en Shi, Lei Geng, Xiaolu Song, Tao Cancer Med RESEARCH ARTICLES BACKGROUND: Prediction models with high accuracy rates for nonmetastatic cervical cancer (CC) patients are limited. This study aimed to construct and compare predictive models on the basis of machine learning (ML) algorithms for predicting the 5‐year survival status of CC patients through using the Surveillance, Epidemiology, and End Results public database of the National Cancer Institute. METHODS: The data registered from 2004 to 2016 were extracted and randomly divided into training and validation cohorts (8:2). The least absolute shrinkage and selection operator (LASSO) regression was employed to identify significant factors. Then, four predictive models were constructed, including logistic regression (LR), random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost). The predictive models were evaluated and compared using Receiver‐operating characteristics with areas under the curves (AUCs) and decision curve analysis (DCA), respectively. RESULTS: A total of 13,802 patients were involved and classified into training (N = 11,041) and validation (N = 2761) cohorts. By using the LASSO regression method, seven factors were identified. In the training cohort, the XGBoost model showed the best performance (AUC = 0.8400) compared to the other three models (all p < 0.05 by Delong's test). In the validation cohort, the XGBoost model also demonstrated a superior prediction ability (AUC = 0.8365) than LR and SVM models (both p < 0.05 by Delong's test), although the difference was not statistically significant between the XGBoost and the RF models (p = 0.4251 by Delong's test). Based on the DCA results, the XGBoost model was also superior, and feature importance analysis indicated that the tumor stage was the most important variable among the seven factors. CONCLUSIONS: The XGBoost model proved to be an effective algorithm with better prediction abilities. This model is proposed to support better decision‐making for nonmetastatic CC patients in the future. John Wiley and Sons Inc. 2022-12-08 /pmc/articles/PMC10067071/ /pubmed/36479910 http://dx.doi.org/10.1002/cam4.5477 Text en © 2022 The Authors. Cancer Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | RESEARCH ARTICLES Yu, Wenke Lu, Yanwei Shou, Huafeng Xu, Hong’en Shi, Lei Geng, Xiaolu Song, Tao A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms |
title | A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms |
title_full | A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms |
title_fullStr | A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms |
title_full_unstemmed | A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms |
title_short | A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms |
title_sort | 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms |
topic | RESEARCH ARTICLES |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10067071/ https://www.ncbi.nlm.nih.gov/pubmed/36479910 http://dx.doi.org/10.1002/cam4.5477 |
work_keys_str_mv | AT yuwenke a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT luyanwei a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT shouhuafeng a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT xuhongen a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT shilei a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT gengxiaolu a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT songtao a5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT yuwenke 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT luyanwei 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT shouhuafeng 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT xuhongen 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT shilei 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT gengxiaolu 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms AT songtao 5yearsurvivalstatusprognosisofnonmetastaticcervicalcancerpatientsthroughmachinelearningalgorithms |