Cargando…

A Novel Machine Learning Algorithm Combined With Multivariate Analysis for the Prognosis of Renal Collecting Duct Carcinoma

OBJECTIVES: To investigate the clinical and non-clinical characteristics that may affect the prognosis of patients with renal collecting duct carcinoma (CDC) and to develop an accurate prognostic model for this disease. METHODS: The characteristics of 215 CDC patients were obtained from the U.S. Nat...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Liwei, Huang, Yongdi, Chen, Zheng, Li, Jinhua, Huang, Guangyi, Qin, Xiaoping, Cui, Lihong, Zhuo, Yumin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792389/
https://www.ncbi.nlm.nih.gov/pubmed/35096579
http://dx.doi.org/10.3389/fonc.2021.777735
_version_ 1784640350337368064
author Wei, Liwei
Huang, Yongdi
Chen, Zheng
Li, Jinhua
Huang, Guangyi
Qin, Xiaoping
Cui, Lihong
Zhuo, Yumin
author_facet Wei, Liwei
Huang, Yongdi
Chen, Zheng
Li, Jinhua
Huang, Guangyi
Qin, Xiaoping
Cui, Lihong
Zhuo, Yumin
author_sort Wei, Liwei
collection PubMed
description OBJECTIVES: To investigate the clinical and non-clinical characteristics that may affect the prognosis of patients with renal collecting duct carcinoma (CDC) and to develop an accurate prognostic model for this disease. METHODS: The characteristics of 215 CDC patients were obtained from the U.S. National Cancer Institute’s surveillance, epidemiology and end results database from 2004 to 2016. Univariate Cox proportional hazard model and Kaplan-Meier analysis were used to compare the impact of different factors on overall survival (OS). 10 variables were included to establish a machine learning (ML) model. Model performance was evaluated by the receiver operating characteristic curves (ROC) and calibration plots for predictive accuracy and decision curve analysis (DCA) were obtained to estimate its clinical benefits. RESULTS: The median follow-up and survival time was 16 months during which 164 (76.3%) patients died. 4.2, 32.1, 50.7 and 13.0% of patients were histological grade I, II, III, and IV, respectively. At diagnosis up to 61.9% of patients presented with a pT3 stage or higher tumor, and 36.7% of CDC patients had metastatic disease. 10 most clinical and non-clinical factors including M stage, tumor size, T stage, histological grade, N stage, radiotherapy, chemotherapy, age at diagnosis, surgery and the geographical region where the care delivered was either purchased or referred and these were allocated 95, 82, 78, 72, 49, 38, 36, 35, 28 and 21 points, respectively. The points were calculated by the XGBoost according to their importance. The XGBoost models showed the best predictive performance compared with other algorithms. DCA showed our models could be used to support clinical decisions in 1-3-year OS models. CONCLUSIONS: Our ML models had the highest predictive accuracy and net benefits, which may potentially help clinicians to make clinical decisions and follow-up strategies for patients with CDC. Larger studies are needed to better understand this aggressive tumor.
format Online
Article
Text
id pubmed-8792389
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-87923892022-01-28 A Novel Machine Learning Algorithm Combined With Multivariate Analysis for the Prognosis of Renal Collecting Duct Carcinoma Wei, Liwei Huang, Yongdi Chen, Zheng Li, Jinhua Huang, Guangyi Qin, Xiaoping Cui, Lihong Zhuo, Yumin Front Oncol Oncology OBJECTIVES: To investigate the clinical and non-clinical characteristics that may affect the prognosis of patients with renal collecting duct carcinoma (CDC) and to develop an accurate prognostic model for this disease. METHODS: The characteristics of 215 CDC patients were obtained from the U.S. National Cancer Institute’s surveillance, epidemiology and end results database from 2004 to 2016. Univariate Cox proportional hazard model and Kaplan-Meier analysis were used to compare the impact of different factors on overall survival (OS). 10 variables were included to establish a machine learning (ML) model. Model performance was evaluated by the receiver operating characteristic curves (ROC) and calibration plots for predictive accuracy and decision curve analysis (DCA) were obtained to estimate its clinical benefits. RESULTS: The median follow-up and survival time was 16 months during which 164 (76.3%) patients died. 4.2, 32.1, 50.7 and 13.0% of patients were histological grade I, II, III, and IV, respectively. At diagnosis up to 61.9% of patients presented with a pT3 stage or higher tumor, and 36.7% of CDC patients had metastatic disease. 10 most clinical and non-clinical factors including M stage, tumor size, T stage, histological grade, N stage, radiotherapy, chemotherapy, age at diagnosis, surgery and the geographical region where the care delivered was either purchased or referred and these were allocated 95, 82, 78, 72, 49, 38, 36, 35, 28 and 21 points, respectively. The points were calculated by the XGBoost according to their importance. The XGBoost models showed the best predictive performance compared with other algorithms. DCA showed our models could be used to support clinical decisions in 1-3-year OS models. CONCLUSIONS: Our ML models had the highest predictive accuracy and net benefits, which may potentially help clinicians to make clinical decisions and follow-up strategies for patients with CDC. Larger studies are needed to better understand this aggressive tumor. Frontiers Media S.A. 2022-01-13 /pmc/articles/PMC8792389/ /pubmed/35096579 http://dx.doi.org/10.3389/fonc.2021.777735 Text en Copyright © 2022 Wei, Huang, Chen, Li, Huang, Qin, Cui and Zhuo https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Wei, Liwei
Huang, Yongdi
Chen, Zheng
Li, Jinhua
Huang, Guangyi
Qin, Xiaoping
Cui, Lihong
Zhuo, Yumin
A Novel Machine Learning Algorithm Combined With Multivariate Analysis for the Prognosis of Renal Collecting Duct Carcinoma
title A Novel Machine Learning Algorithm Combined With Multivariate Analysis for the Prognosis of Renal Collecting Duct Carcinoma
title_full A Novel Machine Learning Algorithm Combined With Multivariate Analysis for the Prognosis of Renal Collecting Duct Carcinoma
title_fullStr A Novel Machine Learning Algorithm Combined With Multivariate Analysis for the Prognosis of Renal Collecting Duct Carcinoma
title_full_unstemmed A Novel Machine Learning Algorithm Combined With Multivariate Analysis for the Prognosis of Renal Collecting Duct Carcinoma
title_short A Novel Machine Learning Algorithm Combined With Multivariate Analysis for the Prognosis of Renal Collecting Duct Carcinoma
title_sort novel machine learning algorithm combined with multivariate analysis for the prognosis of renal collecting duct carcinoma
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792389/
https://www.ncbi.nlm.nih.gov/pubmed/35096579
http://dx.doi.org/10.3389/fonc.2021.777735
work_keys_str_mv AT weiliwei anovelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT huangyongdi anovelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT chenzheng anovelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT lijinhua anovelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT huangguangyi anovelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT qinxiaoping anovelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT cuilihong anovelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT zhuoyumin anovelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT weiliwei novelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT huangyongdi novelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT chenzheng novelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT lijinhua novelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT huangguangyi novelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT qinxiaoping novelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT cuilihong novelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma
AT zhuoyumin novelmachinelearningalgorithmcombinedwithmultivariateanalysisfortheprognosisofrenalcollectingductcarcinoma