Cargando…

Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone

PURPOSE: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier for patients in the prostate-specific antigen gray zone are to be developed and compared, identifying valuable predictors. Predictive models are to be integrated into actual clinical deci...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Taobin, Zhang, Xiaoming, Chen, Ru, Deng, Xinxi, Fu, Bin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Oncology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10285702/ https://www.ncbi.nlm.nih.gov/pubmed/37361597 http://dx.doi.org/10.3389/fonc.2023.1157384

_version_	1785061662775050240
author	Liu, Taobin Zhang, Xiaoming Chen, Ru Deng, Xinxi Fu, Bin
author_facet	Liu, Taobin Zhang, Xiaoming Chen, Ru Deng, Xinxi Fu, Bin
author_sort	Liu, Taobin
collection	PubMed
description	PURPOSE: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier for patients in the prostate-specific antigen gray zone are to be developed and compared, identifying valuable predictors. Predictive models are to be integrated into actual clinical decisions. METHODS: Patient information was collected from December 01, 2014 to December 01, 2022 from the Department of Urology, The First Affiliated Hospital of Nanchang University. Patients with a pathological diagnosis of prostate hyperplasia or prostate cancer (any PCa) and having a prostate-specific antigen (PSA) level of 4–10 ng/mL before prostate puncture were included in the initial information collection. Eventually, 756 patients were selected. Age, total prostate-specific antigen (tPSA), free prostate-specific antigen (fPSA), fPSA/tPSA, prostate volume (PV), prostate-specific antigen density (PSAD), (fPSA/tPSA)/PSAD, and the prostate MRI results of these patients were recorded. After univariate and multivariate logistic analyses, statistically significant predictors were screened to build and compare machine learning models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier to determine more valuable predictors. RESULTS: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier exhibit higher predictive power than individual metrics. The area under the curve (AUC) (95% CI), accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score of the LogisticRegression machine learning prediction model were 0.932 (0.881–0.983), 0.792, 0.824, 0.919, 0.652, 0.920, and 0.728, respectively; of the XGBoost machine learning prediction model were 0.813 (0.723–0.904), 0.771, 0.800, 0.768, 0.737, 0.793 and 0.767, respectively; of the GaussianNB machine learning prediction model were 0.902 (0.843–0.962), 0.813, 0.875, 0.819, 0.600, 0.909, and 0.712, respectively; and of the LGBMClassifier machine learning prediction model were 0.886 (0.809–0.963), 0.833, 0.882, 0.806, 0.725, 0.911, and 0.796, respectively. The LogisticRegression machine learning prediction model has the highest AUC among all prediction models, and the difference between the AUC of the LogisticRegression prediction model and those of XGBoost, GaussianNB, and LGBMClassifier is statistically significant (p < 0.001). CONCLUSION: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier algorithms exhibit superior predictability for patients in the PSA gray area, with the LogisticRegression model yielding the best prediction. The aforementioned predictive models can be used for actual clinical decision-making.
format	Online Article Text
id	pubmed-10285702
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-102857022023-06-23 Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone Liu, Taobin Zhang, Xiaoming Chen, Ru Deng, Xinxi Fu, Bin Front Oncol Oncology PURPOSE: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier for patients in the prostate-specific antigen gray zone are to be developed and compared, identifying valuable predictors. Predictive models are to be integrated into actual clinical decisions. METHODS: Patient information was collected from December 01, 2014 to December 01, 2022 from the Department of Urology, The First Affiliated Hospital of Nanchang University. Patients with a pathological diagnosis of prostate hyperplasia or prostate cancer (any PCa) and having a prostate-specific antigen (PSA) level of 4–10 ng/mL before prostate puncture were included in the initial information collection. Eventually, 756 patients were selected. Age, total prostate-specific antigen (tPSA), free prostate-specific antigen (fPSA), fPSA/tPSA, prostate volume (PV), prostate-specific antigen density (PSAD), (fPSA/tPSA)/PSAD, and the prostate MRI results of these patients were recorded. After univariate and multivariate logistic analyses, statistically significant predictors were screened to build and compare machine learning models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier to determine more valuable predictors. RESULTS: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier exhibit higher predictive power than individual metrics. The area under the curve (AUC) (95% CI), accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score of the LogisticRegression machine learning prediction model were 0.932 (0.881–0.983), 0.792, 0.824, 0.919, 0.652, 0.920, and 0.728, respectively; of the XGBoost machine learning prediction model were 0.813 (0.723–0.904), 0.771, 0.800, 0.768, 0.737, 0.793 and 0.767, respectively; of the GaussianNB machine learning prediction model were 0.902 (0.843–0.962), 0.813, 0.875, 0.819, 0.600, 0.909, and 0.712, respectively; and of the LGBMClassifier machine learning prediction model were 0.886 (0.809–0.963), 0.833, 0.882, 0.806, 0.725, 0.911, and 0.796, respectively. The LogisticRegression machine learning prediction model has the highest AUC among all prediction models, and the difference between the AUC of the LogisticRegression prediction model and those of XGBoost, GaussianNB, and LGBMClassifier is statistically significant (p < 0.001). CONCLUSION: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier algorithms exhibit superior predictability for patients in the PSA gray area, with the LogisticRegression model yielding the best prediction. The aforementioned predictive models can be used for actual clinical decision-making. Frontiers Media S.A. 2023-06-08 /pmc/articles/PMC10285702/ /pubmed/37361597 http://dx.doi.org/10.3389/fonc.2023.1157384 Text en Copyright © 2023 Liu, Zhang, Chen, Deng and Fu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Oncology Liu, Taobin Zhang, Xiaoming Chen, Ru Deng, Xinxi Fu, Bin Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title	Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_full	Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_fullStr	Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_full_unstemmed	Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_short	Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_sort	development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
topic	Oncology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10285702/ https://www.ncbi.nlm.nih.gov/pubmed/37361597 http://dx.doi.org/10.3389/fonc.2023.1157384
work_keys_str_mv	AT liutaobin developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone AT zhangxiaoming developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone AT chenru developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone AT dengxinxi developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone AT fubin developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone

Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone

Ejemplares similares