Cargando…
Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone
PURPOSE: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier for patients in the prostate-specific antigen gray zone are to be developed and compared, identifying valuable predictors. Predictive models are to be integrated into actual clinical deci...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10285702/ https://www.ncbi.nlm.nih.gov/pubmed/37361597 http://dx.doi.org/10.3389/fonc.2023.1157384 |
_version_ | 1785061662775050240 |
---|---|
author | Liu, Taobin Zhang, Xiaoming Chen, Ru Deng, Xinxi Fu, Bin |
author_facet | Liu, Taobin Zhang, Xiaoming Chen, Ru Deng, Xinxi Fu, Bin |
author_sort | Liu, Taobin |
collection | PubMed |
description | PURPOSE: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier for patients in the prostate-specific antigen gray zone are to be developed and compared, identifying valuable predictors. Predictive models are to be integrated into actual clinical decisions. METHODS: Patient information was collected from December 01, 2014 to December 01, 2022 from the Department of Urology, The First Affiliated Hospital of Nanchang University. Patients with a pathological diagnosis of prostate hyperplasia or prostate cancer (any PCa) and having a prostate-specific antigen (PSA) level of 4–10 ng/mL before prostate puncture were included in the initial information collection. Eventually, 756 patients were selected. Age, total prostate-specific antigen (tPSA), free prostate-specific antigen (fPSA), fPSA/tPSA, prostate volume (PV), prostate-specific antigen density (PSAD), (fPSA/tPSA)/PSAD, and the prostate MRI results of these patients were recorded. After univariate and multivariate logistic analyses, statistically significant predictors were screened to build and compare machine learning models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier to determine more valuable predictors. RESULTS: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier exhibit higher predictive power than individual metrics. The area under the curve (AUC) (95% CI), accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score of the LogisticRegression machine learning prediction model were 0.932 (0.881–0.983), 0.792, 0.824, 0.919, 0.652, 0.920, and 0.728, respectively; of the XGBoost machine learning prediction model were 0.813 (0.723–0.904), 0.771, 0.800, 0.768, 0.737, 0.793 and 0.767, respectively; of the GaussianNB machine learning prediction model were 0.902 (0.843–0.962), 0.813, 0.875, 0.819, 0.600, 0.909, and 0.712, respectively; and of the LGBMClassifier machine learning prediction model were 0.886 (0.809–0.963), 0.833, 0.882, 0.806, 0.725, 0.911, and 0.796, respectively. The LogisticRegression machine learning prediction model has the highest AUC among all prediction models, and the difference between the AUC of the LogisticRegression prediction model and those of XGBoost, GaussianNB, and LGBMClassifier is statistically significant (p < 0.001). CONCLUSION: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier algorithms exhibit superior predictability for patients in the PSA gray area, with the LogisticRegression model yielding the best prediction. The aforementioned predictive models can be used for actual clinical decision-making. |
format | Online Article Text |
id | pubmed-10285702 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-102857022023-06-23 Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone Liu, Taobin Zhang, Xiaoming Chen, Ru Deng, Xinxi Fu, Bin Front Oncol Oncology PURPOSE: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier for patients in the prostate-specific antigen gray zone are to be developed and compared, identifying valuable predictors. Predictive models are to be integrated into actual clinical decisions. METHODS: Patient information was collected from December 01, 2014 to December 01, 2022 from the Department of Urology, The First Affiliated Hospital of Nanchang University. Patients with a pathological diagnosis of prostate hyperplasia or prostate cancer (any PCa) and having a prostate-specific antigen (PSA) level of 4–10 ng/mL before prostate puncture were included in the initial information collection. Eventually, 756 patients were selected. Age, total prostate-specific antigen (tPSA), free prostate-specific antigen (fPSA), fPSA/tPSA, prostate volume (PV), prostate-specific antigen density (PSAD), (fPSA/tPSA)/PSAD, and the prostate MRI results of these patients were recorded. After univariate and multivariate logistic analyses, statistically significant predictors were screened to build and compare machine learning models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier to determine more valuable predictors. RESULTS: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier exhibit higher predictive power than individual metrics. The area under the curve (AUC) (95% CI), accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score of the LogisticRegression machine learning prediction model were 0.932 (0.881–0.983), 0.792, 0.824, 0.919, 0.652, 0.920, and 0.728, respectively; of the XGBoost machine learning prediction model were 0.813 (0.723–0.904), 0.771, 0.800, 0.768, 0.737, 0.793 and 0.767, respectively; of the GaussianNB machine learning prediction model were 0.902 (0.843–0.962), 0.813, 0.875, 0.819, 0.600, 0.909, and 0.712, respectively; and of the LGBMClassifier machine learning prediction model were 0.886 (0.809–0.963), 0.833, 0.882, 0.806, 0.725, 0.911, and 0.796, respectively. The LogisticRegression machine learning prediction model has the highest AUC among all prediction models, and the difference between the AUC of the LogisticRegression prediction model and those of XGBoost, GaussianNB, and LGBMClassifier is statistically significant (p < 0.001). CONCLUSION: Machine learning prediction models based on LogisticRegression, XGBoost, GaussianNB, and LGBMClassifier algorithms exhibit superior predictability for patients in the PSA gray area, with the LogisticRegression model yielding the best prediction. The aforementioned predictive models can be used for actual clinical decision-making. Frontiers Media S.A. 2023-06-08 /pmc/articles/PMC10285702/ /pubmed/37361597 http://dx.doi.org/10.3389/fonc.2023.1157384 Text en Copyright © 2023 Liu, Zhang, Chen, Deng and Fu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Liu, Taobin Zhang, Xiaoming Chen, Ru Deng, Xinxi Fu, Bin Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone |
title | Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone |
title_full | Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone |
title_fullStr | Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone |
title_full_unstemmed | Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone |
title_short | Development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone |
title_sort | development, comparison, and validation of four intelligent, practical machine learning models for patients with prostate-specific antigen in the gray zone |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10285702/ https://www.ncbi.nlm.nih.gov/pubmed/37361597 http://dx.doi.org/10.3389/fonc.2023.1157384 |
work_keys_str_mv | AT liutaobin developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone AT zhangxiaoming developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone AT chenru developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone AT dengxinxi developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone AT fubin developmentcomparisonandvalidationoffourintelligentpracticalmachinelearningmodelsforpatientswithprostatespecificantigeninthegrayzone |