Cargando…

Optimization and validation of (18)F-DCFPyL PET radiomics-based machine learning models in intermediate- to high-risk primary prostate cancer

INTRODUCTION: Radiomics extracted from prostate-specific membrane antigen (PSMA)-PET modeled with machine learning (ML) may be used for prediction of disease risk. However, validation of previously proposed approaches is lacking. We aimed to optimize and validate ML models based on (18)F-DCFPyL-PET...

Descripción completa

Detalles Bibliográficos
Autores principales: Luining, Wietske I., Oprea-Lager, Daniela E., Vis, André N., van Moorselaar, Reindert J. A., Knol, Remco J. J., Wondergem, Maurits, Boellaard, Ronald, Cysouw, Matthijs C. F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10635444/
https://www.ncbi.nlm.nih.gov/pubmed/37943772
http://dx.doi.org/10.1371/journal.pone.0293672
Descripción
Sumario:INTRODUCTION: Radiomics extracted from prostate-specific membrane antigen (PSMA)-PET modeled with machine learning (ML) may be used for prediction of disease risk. However, validation of previously proposed approaches is lacking. We aimed to optimize and validate ML models based on (18)F-DCFPyL-PET radiomics for the prediction of lymph-node involvement (LNI), extracapsular extension (ECE), and postoperative Gleason score (GS) in primary prostate cancer (PCa) patients. METHODS: Patients with intermediate- to high-risk PCa who underwent (18)F-DCFPyL-PET/CT before radical prostatectomy with pelvic lymph-node dissection were evaluated. The training dataset included 72 patients, the internal validation dataset 24 patients, and the external validation dataset 27 patients. PSMA-avid intra-prostatic lesions were delineated semi-automatically on PET and 480 radiomics features were extracted. Conventional PET-metrics were derived for comparative analysis. Segmentation, preprocessing, and ML methods were optimized in repeated 5-fold cross-validation (CV) on the training dataset. The trained models were tested on the combined validation dataset. Combat harmonization was applied to external radiomics data. Model performance was assessed using the receiver-operating-characteristics curve (AUC). RESULTS: The CV-AUCs in the training dataset were 0.88, 0.79 and 0.84 for LNI, ECE, and GS, respectively. In the combined validation dataset, the ML models could significantly predict GS with an AUC of 0.78 (p<0.05). However, validation AUCs for LNI and ECE prediction were not significant (0.57 and 0.63, respectively). Conventional PET metrics-based models had comparable AUCs for LNI (0.59, p>0.05) and ECE (0.66, p>0.05), but a lower AUC for GS (0.73, p<0.05). In general, Combat harmonization improved external validation AUCs (-0.03 to +0.18). CONCLUSION: In internal and external validation, (18)F-DCFPyL-PET radiomics-based ML models predicted high postoperative GS but not LNI or ECE in intermediate- to high-risk PCa. Therefore, the clinical benefit seems to be limited. These results underline the need for external and/or multicenter validation of PET radiomics-based ML model analyses to assess their generalizability.