Cargando…

Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study

The accuracy of indices widely used to evaluate lung metastasis (LM) in patients with kidney cancer (KC) is insufficient. Therefore, we aimed at developing a model to estimate the risk of developing LM in KC based on a large population size and machine learning algorithms. Demographic and clinicopat...

Descripción completa

Detalles Bibliográficos
Autores principales: Yi, Xinglin, Zhang, Yuhan, Cai, Juan, Hu, Yu, Wen, Kai, Xie, Pan, Yin, Na, Zhou, Xiangdong, Luo, Hu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10299882/
https://www.ncbi.nlm.nih.gov/pubmed/37383704
http://dx.doi.org/10.1155/2023/8001899
_version_ 1785064464692805632
author Yi, Xinglin
Zhang, Yuhan
Cai, Juan
Hu, Yu
Wen, Kai
Xie, Pan
Yin, Na
Zhou, Xiangdong
Luo, Hu
author_facet Yi, Xinglin
Zhang, Yuhan
Cai, Juan
Hu, Yu
Wen, Kai
Xie, Pan
Yin, Na
Zhou, Xiangdong
Luo, Hu
author_sort Yi, Xinglin
collection PubMed
description The accuracy of indices widely used to evaluate lung metastasis (LM) in patients with kidney cancer (KC) is insufficient. Therefore, we aimed at developing a model to estimate the risk of developing LM in KC based on a large population size and machine learning algorithms. Demographic and clinicopathologic variables of patients with KC diagnosed between 2004 and 2017 were retrospectively analyzed. We performed a univariate logistic regression analysis to identify risk factors for LM in patients with KC. Six machine learning (ML) classifiers were established and tuned using the ten-fold cross-validation method. External validation was performed using clinicopathologic information from 492 patients from the Southwest Hospital, Chongqing, China. Algorithm performance was estimated by analyzing the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, precision, recall, F1 score, clinical decision analysis (DCA), and clinical utility curve (CUC). A total of 52,714 eligible patients diagnosed with KC were enrolled, of whom 2,618 developed LM. Variables of age, sex, race, T stage, N stage, tumor size, histology, and grade were identified as important for the prediction of LM. The extreme gradient boosting (XGB) algorithm performed better than other models in both the internal validation (AUC: 0.913, sensitivity: 0.873, specificity: 0.809, and F1 score: 0.325) and the external validation (AUC: 0.904, sensitivity: 0.750, specificity: 0.878, and F1 score: 0.364). This study established a predictive model for LM in KC patients based on ML algorithms which showed high accuracy and applicative value. A web-based predictor was built using the XGB model to help clinicians make more rational and personalized decisions.
format Online
Article
Text
id pubmed-10299882
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-102998822023-06-28 Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study Yi, Xinglin Zhang, Yuhan Cai, Juan Hu, Yu Wen, Kai Xie, Pan Yin, Na Zhou, Xiangdong Luo, Hu Int J Clin Pract Research Article The accuracy of indices widely used to evaluate lung metastasis (LM) in patients with kidney cancer (KC) is insufficient. Therefore, we aimed at developing a model to estimate the risk of developing LM in KC based on a large population size and machine learning algorithms. Demographic and clinicopathologic variables of patients with KC diagnosed between 2004 and 2017 were retrospectively analyzed. We performed a univariate logistic regression analysis to identify risk factors for LM in patients with KC. Six machine learning (ML) classifiers were established and tuned using the ten-fold cross-validation method. External validation was performed using clinicopathologic information from 492 patients from the Southwest Hospital, Chongqing, China. Algorithm performance was estimated by analyzing the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, precision, recall, F1 score, clinical decision analysis (DCA), and clinical utility curve (CUC). A total of 52,714 eligible patients diagnosed with KC were enrolled, of whom 2,618 developed LM. Variables of age, sex, race, T stage, N stage, tumor size, histology, and grade were identified as important for the prediction of LM. The extreme gradient boosting (XGB) algorithm performed better than other models in both the internal validation (AUC: 0.913, sensitivity: 0.873, specificity: 0.809, and F1 score: 0.325) and the external validation (AUC: 0.904, sensitivity: 0.750, specificity: 0.878, and F1 score: 0.364). This study established a predictive model for LM in KC patients based on ML algorithms which showed high accuracy and applicative value. A web-based predictor was built using the XGB model to help clinicians make more rational and personalized decisions. Hindawi 2023-06-20 /pmc/articles/PMC10299882/ /pubmed/37383704 http://dx.doi.org/10.1155/2023/8001899 Text en Copyright © 2023 Xinglin Yi et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yi, Xinglin
Zhang, Yuhan
Cai, Juan
Hu, Yu
Wen, Kai
Xie, Pan
Yin, Na
Zhou, Xiangdong
Luo, Hu
Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study
title Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study
title_full Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study
title_fullStr Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study
title_full_unstemmed Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study
title_short Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study
title_sort development and external validation of machine learning-based models for predicting lung metastasis in kidney cancer: a large population-based study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10299882/
https://www.ncbi.nlm.nih.gov/pubmed/37383704
http://dx.doi.org/10.1155/2023/8001899
work_keys_str_mv AT yixinglin developmentandexternalvalidationofmachinelearningbasedmodelsforpredictinglungmetastasisinkidneycanceralargepopulationbasedstudy
AT zhangyuhan developmentandexternalvalidationofmachinelearningbasedmodelsforpredictinglungmetastasisinkidneycanceralargepopulationbasedstudy
AT caijuan developmentandexternalvalidationofmachinelearningbasedmodelsforpredictinglungmetastasisinkidneycanceralargepopulationbasedstudy
AT huyu developmentandexternalvalidationofmachinelearningbasedmodelsforpredictinglungmetastasisinkidneycanceralargepopulationbasedstudy
AT wenkai developmentandexternalvalidationofmachinelearningbasedmodelsforpredictinglungmetastasisinkidneycanceralargepopulationbasedstudy
AT xiepan developmentandexternalvalidationofmachinelearningbasedmodelsforpredictinglungmetastasisinkidneycanceralargepopulationbasedstudy
AT yinna developmentandexternalvalidationofmachinelearningbasedmodelsforpredictinglungmetastasisinkidneycanceralargepopulationbasedstudy
AT zhouxiangdong developmentandexternalvalidationofmachinelearningbasedmodelsforpredictinglungmetastasisinkidneycanceralargepopulationbasedstudy
AT luohu developmentandexternalvalidationofmachinelearningbasedmodelsforpredictinglungmetastasisinkidneycanceralargepopulationbasedstudy