Cargando…
A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer
OBJECTIVE: Distant metastasis other than non-regional lymph nodes and lung (i.e., M1b stage) significantly contributes to the poor survival prognosis of patients with germ cell testicular cancer (GCTC). The aim of this study was to develop a machine learning (ML) algorithm model to predict the risk...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9277219/ https://www.ncbi.nlm.nih.gov/pubmed/35844840 http://dx.doi.org/10.3389/fpubh.2022.916513 |
_version_ | 1784745908289667072 |
---|---|
author | Ding, Li Wang, Kun Zhang, Chi Zhang, Yang Wang, Kanlirong Li, Wang Wang, Junqi |
author_facet | Ding, Li Wang, Kun Zhang, Chi Zhang, Yang Wang, Kanlirong Li, Wang Wang, Junqi |
author_sort | Ding, Li |
collection | PubMed |
description | OBJECTIVE: Distant metastasis other than non-regional lymph nodes and lung (i.e., M1b stage) significantly contributes to the poor survival prognosis of patients with germ cell testicular cancer (GCTC). The aim of this study was to develop a machine learning (ML) algorithm model to predict the risk of patients with GCTC developing the M1b stage, which can be used to assist in early intervention of patients. METHODS: The clinical and pathological data of patients with GCTC were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. Combing the patient's characteristic variables, we applied six machine learning (ML) algorithms to develop the predictive models, including logistic regression(LR), eXtreme Gradient Boosting (XGBoost), light Gradient Boosting Machine (lightGBM), random forest (RF), multilayer perceptron (MLP), and k-nearest neighbor (kNN). Model performances were evaluated by 10-fold cross-receiver operating characteristic (ROC) curves, which calculated the area under the curve (AUC) of models for predictive accuracy. A total of 54 patients from our own center (October 2006 to June 2021) were collected as the external validation cohort. RESULTS: A total of 4,323 patients eligible for inclusion were screened for enrollment from the SEER database, of which 178 (4.12%) developing M1b stage. Multivariate logistic regression showed that lymph node dissection (LND), T stage, N stage, lung metastases, and distant lymph node metastases were the independent predictors of developing M1b stage risk. The models based on both the XGBoost and RF algorithms showed stable and efficient prediction performance in the training and external validation groups. CONCLUSION: S-stage is not an independent factor for predicting the risk of developing the M1b stage of patients with GCTC. The ML models based on both XGBoost and RF algorithms have high predictive effectiveness and may be used to predict the risk of developing the M1b stage of patients with GCTC, which is of promising value in clinical decision-making. Models still need to be tested with a larger sample of real-world data. |
format | Online Article Text |
id | pubmed-9277219 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92772192022-07-14 A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer Ding, Li Wang, Kun Zhang, Chi Zhang, Yang Wang, Kanlirong Li, Wang Wang, Junqi Front Public Health Public Health OBJECTIVE: Distant metastasis other than non-regional lymph nodes and lung (i.e., M1b stage) significantly contributes to the poor survival prognosis of patients with germ cell testicular cancer (GCTC). The aim of this study was to develop a machine learning (ML) algorithm model to predict the risk of patients with GCTC developing the M1b stage, which can be used to assist in early intervention of patients. METHODS: The clinical and pathological data of patients with GCTC were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. Combing the patient's characteristic variables, we applied six machine learning (ML) algorithms to develop the predictive models, including logistic regression(LR), eXtreme Gradient Boosting (XGBoost), light Gradient Boosting Machine (lightGBM), random forest (RF), multilayer perceptron (MLP), and k-nearest neighbor (kNN). Model performances were evaluated by 10-fold cross-receiver operating characteristic (ROC) curves, which calculated the area under the curve (AUC) of models for predictive accuracy. A total of 54 patients from our own center (October 2006 to June 2021) were collected as the external validation cohort. RESULTS: A total of 4,323 patients eligible for inclusion were screened for enrollment from the SEER database, of which 178 (4.12%) developing M1b stage. Multivariate logistic regression showed that lymph node dissection (LND), T stage, N stage, lung metastases, and distant lymph node metastases were the independent predictors of developing M1b stage risk. The models based on both the XGBoost and RF algorithms showed stable and efficient prediction performance in the training and external validation groups. CONCLUSION: S-stage is not an independent factor for predicting the risk of developing the M1b stage of patients with GCTC. The ML models based on both XGBoost and RF algorithms have high predictive effectiveness and may be used to predict the risk of developing the M1b stage of patients with GCTC, which is of promising value in clinical decision-making. Models still need to be tested with a larger sample of real-world data. Frontiers Media S.A. 2022-06-29 /pmc/articles/PMC9277219/ /pubmed/35844840 http://dx.doi.org/10.3389/fpubh.2022.916513 Text en Copyright © 2022 Ding, Wang, Zhang, Zhang, Wang, Li and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Public Health Ding, Li Wang, Kun Zhang, Chi Zhang, Yang Wang, Kanlirong Li, Wang Wang, Junqi A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer |
title | A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer |
title_full | A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer |
title_fullStr | A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer |
title_full_unstemmed | A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer |
title_short | A Machine Learning Algorithm for Predicting the Risk of Developing to M1b Stage of Patients With Germ Cell Testicular Cancer |
title_sort | machine learning algorithm for predicting the risk of developing to m1b stage of patients with germ cell testicular cancer |
topic | Public Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9277219/ https://www.ncbi.nlm.nih.gov/pubmed/35844840 http://dx.doi.org/10.3389/fpubh.2022.916513 |
work_keys_str_mv | AT dingli amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT wangkun amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT zhangchi amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT zhangyang amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT wangkanlirong amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT liwang amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT wangjunqi amachinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT dingli machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT wangkun machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT zhangchi machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT zhangyang machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT wangkanlirong machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT liwang machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer AT wangjunqi machinelearningalgorithmforpredictingtheriskofdevelopingtom1bstageofpatientswithgermcelltesticularcancer |