Cargando…

Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone

PURPOSE: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier for patients in the prostate-specific antigen gray zone are to be developed and compared, identifying valuable predictors. Predictive models are to be integrated into actual clinical deci...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Taobin, Zhang, Xiaoming, Chen, Ru, Deng, Xinxi, Fu, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10285702/
https://www.ncbi.nlm.nih.gov/pubmed/37361597
http://dx.doi.org/10.3389/fonc.2023.1157384
_version_ 1785061662775050240
author Liu, Taobin
Zhang, Xiaoming
Chen, Ru
Deng, Xinxi
Fu, Bin
author_facet Liu, Taobin
Zhang, Xiaoming
Chen, Ru
Deng, Xinxi
Fu, Bin
author_sort Liu, Taobin
collection PubMed
description PURPOSE: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier for patients in the prostate-specific antigen gray zone are to be developed and compared, identifying valuable predictors. Predictive models are to be integrated into actual clinical decisions. METHODS: Patient information was collected from December 01, 2014 to December 01, 2022 from the Department of Urology, The First Affiliated Hospital of Nanchang University. Patients with a pathological diagnosis of prostate hyperplasia or prostate cancer (any PCa) and having a prostate-specific antigen (PSA) level of 4–10 ng/mL before prostate puncture were included in the initial information collection. Eventually, 756 patients were selected. Age, total prostate-specific antigen (tPSA), free prostate-specific antigen (fPSA), fPSA/tPSA, prostate volume (PV), prostate-specific antigen density (PSAD), (fPSA/tPSA)/PSAD, and the prostate MRI results of these patients were recorded. After univariate and multivariate logistic analyses, statistically significant predictors were screened to build and compare machine learning models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier to determine more valuable predictors. RESULTS: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier exhibit higher predictive power than individual metrics. The area under the curve (AUC) (95% CI), accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score of the LogisticRegression machine learning prediction model were 0.932 (0.881–0.983), 0.792, 0.824, 0.919, 0.652, 0.920, and 0.728, respectively; of the XGBoost machine learning prediction model were 0.813 (0.723–0.904), 0.771, 0.800, 0.768, 0.737, 0.793 and 0.767, respectively; of the GaussianNB machine learning prediction model were 0.902 (0.843–0.962), 0.813, 0.875, 0.819, 0.600, 0.909, and 0.712, respectively; and of the LGBMClassifier machine learning prediction model were 0.886 (0.809–0.963), 0.833, 0.882, 0.806, 0.725, 0.911, and 0.796, respectively. The LogisticRegression machine learning prediction model has the highest AUC among all prediction models, and the difference between the AUC of the LogisticRegression prediction model and those of XGBoost, GaussianNB, and LGBMClassifier is statistically significant (p < 0.001). CONCLUSION: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier algorithms exhibit superior predictability for patients in the PSA gray area, with the LogisticRegression model yielding the best prediction. The aforementioned predictive models can be used for actual clinical decision-making.​
format Online
Article
Text
id pubmed-10285702
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102857022023-06-23 Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone Liu, Taobin Zhang, Xiaoming Chen, Ru Deng, Xinxi Fu, Bin Front Oncol Oncology PURPOSE: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier for patients in the prostate-specific antigen gray zone are to be developed and compared, identifying valuable predictors. Predictive models are to be integrated into actual clinical decisions. METHODS: Patient information was collected from December 01, 2014 to December 01, 2022 from the Department of Urology, The First Affiliated Hospital of Nanchang University. Patients with a pathological diagnosis of prostate hyperplasia or prostate cancer (any PCa) and having a prostate-specific antigen (PSA) level of 4–10 ng/mL before prostate puncture were included in the initial information collection. Eventually, 756 patients were selected. Age, total prostate-specific antigen (tPSA), free prostate-specific antigen (fPSA), fPSA/tPSA, prostate volume (PV), prostate-specific antigen density (PSAD), (fPSA/tPSA)/PSAD, and the prostate MRI results of these patients were recorded. After univariate and multivariate logistic analyses, statistically significant predictors were screened to build and compare machine learning models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier to determine more valuable predictors. RESULTS: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier exhibit higher predictive power than individual metrics. The area under the curve (AUC) (95% CI), accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score of the LogisticRegression machine learning prediction model were 0.932 (0.881–0.983), 0.792, 0.824, 0.919, 0.652, 0.920, and 0.728, respectively; of the XGBoost machine learning prediction model were 0.813 (0.723–0.904), 0.771, 0.800, 0.768, 0.737, 0.793 and 0.767, respectively; of the GaussianNB machine learning prediction model were 0.902 (0.843–0.962), 0.813, 0.875, 0.819, 0.600, 0.909, and 0.712, respectively; and of the LGBMClassifier machine learning prediction model were 0.886 (0.809–0.963), 0.833, 0.882, 0.806, 0.725, 0.911, and 0.796, respectively. The LogisticRegression machine learning prediction model has the highest AUC among all prediction models, and the difference between the AUC of the LogisticRegression prediction model and those of XGBoost, GaussianNB, and LGBMClassifier is statistically significant (p < 0.001). CONCLUSION: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier algorithms exhibit superior predictability for patients in the PSA gray area, with the LogisticRegression model yielding the best prediction. The aforementioned predictive models can be used for actual clinical decision-making.​ Frontiers Media S.A. 2023-06-08 /pmc/articles/PMC10285702/ /pubmed/37361597 http://dx.doi.org/10.3389/fonc.2023.1157384 Text en Copyright © 2023 Liu, Zhang, Chen, Deng and Fu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Liu, Taobin
Zhang, Xiaoming
Chen, Ru
Deng, Xinxi
Fu, Bin
Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_full Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_fullStr Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_full_unstemmed Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_short Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
title_sort development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10285702/
https://www.ncbi.nlm.nih.gov/pubmed/37361597
http://dx.doi.org/10.3389/fonc.2023.1157384
work_keys_str_mv AT liutaobin developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone
AT zhangxiaoming developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone
AT chenru developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone
AT dengxinxi developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone
AT fubin developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone