Cargando…

A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer

OBJECTIVE: Distant metastasis other than non-regional lymph nodes and lung (i.e., M1b stage) significantly contributes to the poor survival prognosis of patients with germ cell testicular cancer (GCTC). The aim of this study was to develop a machine learning (ML) algorithm model to predict the risk...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, Li, Wang, Kun, Zhang, Chi, Zhang, Yang, Wang, Kanlirong, Li, Wang, Wang, Junqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9277219/
https://www.ncbi.nlm.nih.gov/pubmed/35844840
http://dx.doi.org/10.3389/fpubh.2022.916513
_version_ 1784745908289667072
author Ding, Li
Wang, Kun
Zhang, Chi
Zhang, Yang
Wang, Kanlirong
Li, Wang
Wang, Junqi
author_facet Ding, Li
Wang, Kun
Zhang, Chi
Zhang, Yang
Wang, Kanlirong
Li, Wang
Wang, Junqi
author_sort Ding, Li
collection PubMed
description OBJECTIVE: Distant metastasis other than non-regional lymph nodes and lung (i.e., M1b stage) significantly contributes to the poor survival prognosis of patients with germ cell testicular cancer (GCTC). The aim of this study was to develop a machine learning (ML) algorithm model to predict the risk of patients with GCTC developing the M1b stage, which can be used to assist in early intervention of patients. METHODS: The clinical and pathological data of patients with GCTC were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. Combing the patient's characteristic variables, we applied six machine learning (ML) algorithms to develop the predictive models, including logistic regression(LR), eXtreme Gradient Boosting (XGBoost), light Gradient Boosting Machine (lightGBM), random forest (RF), multilayer perceptron (MLP), and k-nearest neighbor (kNN). Model performances were evaluated by 10-fold cross-receiver operating characteristic (ROC) curves, which calculated the area under the curve (AUC) of models for predictive accuracy. A total of 54 patients from our own center (October 2006 to June 2021) were collected as the external validation cohort. RESULTS: A total of 4,323 patients eligible for inclusion were screened for enrollment from the SEER database, of which 178 (4.12%) developing M1b stage. Multivariate logistic regression showed that lymph node dissection (LND), T stage, N stage, lung metastases, and distant lymph node metastases were the independent predictors of developing M1b stage risk. The models based on both the XGBoost and RF algorithms showed stable and efficient prediction performance in the training and external validation groups. CONCLUSION: S-stage is not an independent factor for predicting the risk of developing the M1b stage of patients with GCTC. The ML models based on both XGBoost and RF algorithms have high predictive effectiveness and may be used to predict the risk of developing the M1b stage of patients with GCTC, which is of promising value in clinical decision-making. Models still need to be tested with a larger sample of real-world data.
format Online
Article
Text
id pubmed-9277219
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92772192022-07-14 A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer Ding, Li Wang, Kun Zhang, Chi Zhang, Yang Wang, Kanlirong Li, Wang Wang, Junqi Front Public Health Public Health OBJECTIVE: Distant metastasis other than non-regional lymph nodes and lung (i.e., M1b stage) significantly contributes to the poor survival prognosis of patients with germ cell testicular cancer (GCTC). The aim of this study was to develop a machine learning (ML) algorithm model to predict the risk of patients with GCTC developing the M1b stage, which can be used to assist in early intervention of patients. METHODS: The clinical and pathological data of patients with GCTC were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. Combing the patient's characteristic variables, we applied six machine learning (ML) algorithms to develop the predictive models, including logistic regression(LR), eXtreme Gradient Boosting (XGBoost), light Gradient Boosting Machine (lightGBM), random forest (RF), multilayer perceptron (MLP), and k-nearest neighbor (kNN). Model performances were evaluated by 10-fold cross-receiver operating characteristic (ROC) curves, which calculated the area under the curve (AUC) of models for predictive accuracy. A total of 54 patients from our own center (October 2006 to June 2021) were collected as the external validation cohort. RESULTS: A total of 4,323 patients eligible for inclusion were screened for enrollment from the SEER database, of which 178 (4.12%) developing M1b stage. Multivariate logistic regression showed that lymph node dissection (LND), T stage, N stage, lung metastases, and distant lymph node metastases were the independent predictors of developing M1b stage risk. The models based on both the XGBoost and RF algorithms showed stable and efficient prediction performance in the training and external validation groups. CONCLUSION: S-stage is not an independent factor for predicting the risk of developing the M1b stage of patients with GCTC. The ML models based on both XGBoost and RF algorithms have high predictive effectiveness and may be used to predict the risk of developing the M1b stage of patients with GCTC, which is of promising value in clinical decision-making. Models still need to be tested with a larger sample of real-world data. Frontiers Media S.A. 2022-06-29 /pmc/articles/PMC9277219/ /pubmed/35844840 http://dx.doi.org/10.3389/fpubh.2022.916513 Text en Copyright © 2022 Ding, Wang, Zhang, Zhang, Wang, Li and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Ding, Li
Wang, Kun
Zhang, Chi
Zhang, Yang
Wang, Kanlirong
Li, Wang
Wang, Junqi
A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer
title A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer
title_full A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer
title_fullStr A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer
title_full_unstemmed A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer
title_short A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer
title_sort machine learning algorithm for predicting the risk of developing to m1b stage of patients with germ cell testicular cancer
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9277219/
https://www.ncbi.nlm.nih.gov/pubmed/35844840
http://dx.doi.org/10.3389/fpubh.2022.916513
work_keys_str_mv AT dingli amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT wangkun amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT zhangchi amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT zhangyang amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT wangkanlirong amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT liwang amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT wangjunqi amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT dingli machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT wangkun machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT zhangchi machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT zhangyang machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT wangkanlirong machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT liwang machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer
AT wangjunqi machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer