Cargando…

Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma

PURPOSE: Pathologic complete response (pCR) is a critical factor in determining whether patients with rectal cancer (RC) should have surgery after neoadjuvant chemoradiotherapy (nCRT). Currently, a pathologist's histological analysis of surgical specimens is necessary for a reliable assessment...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Du, Lee, Sang Ho, Geng, Huaizhi, Zhong, Haoyu, Plastaras, John, Wojcieszynski, Andrzej, Caruana, Richard, Xiao, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9771385/
https://www.ncbi.nlm.nih.gov/pubmed/36568580
http://dx.doi.org/10.3389/frai.2022.1059033
_version_ 1784854816199016448
author Wang, Du
Lee, Sang Ho
Geng, Huaizhi
Zhong, Haoyu
Plastaras, John
Wojcieszynski, Andrzej
Caruana, Richard
Xiao, Ying
author_facet Wang, Du
Lee, Sang Ho
Geng, Huaizhi
Zhong, Haoyu
Plastaras, John
Wojcieszynski, Andrzej
Caruana, Richard
Xiao, Ying
author_sort Wang, Du
collection PubMed
description PURPOSE: Pathologic complete response (pCR) is a critical factor in determining whether patients with rectal cancer (RC) should have surgery after neoadjuvant chemoradiotherapy (nCRT). Currently, a pathologist's histological analysis of surgical specimens is necessary for a reliable assessment of pCR. Machine learning (ML) algorithms have the potential to be a non-invasive way for identifying appropriate candidates for non-operative therapy. However, these ML models' interpretability remains challenging. We propose using explainable boosting machine (EBM) to predict the pCR of RC patients following nCRT. METHODS: A total of 296 features were extracted, including clinical parameters (CPs), dose-volume histogram (DVH) parameters from gross tumor volume (GTV) and organs-at-risk, and radiomics (R) and dosiomics (D) features from GTV. R and D features were subcategorized into shape (S), first-order (L1), second-order (L2), and higher-order (L3) local texture features. Multi-view analysis was employed to determine the best set of input feature categories. Boruta was used to select all-relevant features for each input dataset. ML models were trained on 180 cases from our institution, with 37 cases from RTOG 0822 clinical trial serving as the independent dataset for model validation. The performance of EBM in predicting pCR on the test dataset was evaluated using ROC AUC and compared with that of three state-of-the-art black-box models: extreme gradient boosting (XGB), random forest (RF) and support vector machine (SVM). The predictions of all black-box models were interpreted using Shapley additive explanations. RESULTS: The best input feature categories were CP+DVH+S+R_L1+R_L2 for all models, from which Boruta-selected features enabled the EBM, XGB, RF, and SVM models to attain the AUCs of 0.820, 0.828, 0.828, and 0.774, respectively. Although EBM did not achieve the best performance, it provided the best capability for identifying critical turning points in response scores at distinct feature values, revealing that the bladder with maximum dose >50 Gy, and the tumor with maximum2DDiameterColumn >80 mm, elongation <0.55, leastAxisLength >50 mm and lower variance of CT intensities were associated with unfavorable outcomes. CONCLUSIONS: EBM has the potential to enhance the physician's ability to evaluate an ML-based prediction of pCR and has implications for selecting patients for a “watchful waiting” strategy to RC therapy.
format Online
Article
Text
id pubmed-9771385
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97713852022-12-22 Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma Wang, Du Lee, Sang Ho Geng, Huaizhi Zhong, Haoyu Plastaras, John Wojcieszynski, Andrzej Caruana, Richard Xiao, Ying Front Artif Intell Artificial Intelligence PURPOSE: Pathologic complete response (pCR) is a critical factor in determining whether patients with rectal cancer (RC) should have surgery after neoadjuvant chemoradiotherapy (nCRT). Currently, a pathologist's histological analysis of surgical specimens is necessary for a reliable assessment of pCR. Machine learning (ML) algorithms have the potential to be a non-invasive way for identifying appropriate candidates for non-operative therapy. However, these ML models' interpretability remains challenging. We propose using explainable boosting machine (EBM) to predict the pCR of RC patients following nCRT. METHODS: A total of 296 features were extracted, including clinical parameters (CPs), dose-volume histogram (DVH) parameters from gross tumor volume (GTV) and organs-at-risk, and radiomics (R) and dosiomics (D) features from GTV. R and D features were subcategorized into shape (S), first-order (L1), second-order (L2), and higher-order (L3) local texture features. Multi-view analysis was employed to determine the best set of input feature categories. Boruta was used to select all-relevant features for each input dataset. ML models were trained on 180 cases from our institution, with 37 cases from RTOG 0822 clinical trial serving as the independent dataset for model validation. The performance of EBM in predicting pCR on the test dataset was evaluated using ROC AUC and compared with that of three state-of-the-art black-box models: extreme gradient boosting (XGB), random forest (RF) and support vector machine (SVM). The predictions of all black-box models were interpreted using Shapley additive explanations. RESULTS: The best input feature categories were CP+DVH+S+R_L1+R_L2 for all models, from which Boruta-selected features enabled the EBM, XGB, RF, and SVM models to attain the AUCs of 0.820, 0.828, 0.828, and 0.774, respectively. Although EBM did not achieve the best performance, it provided the best capability for identifying critical turning points in response scores at distinct feature values, revealing that the bladder with maximum dose >50 Gy, and the tumor with maximum2DDiameterColumn >80 mm, elongation <0.55, leastAxisLength >50 mm and lower variance of CT intensities were associated with unfavorable outcomes. CONCLUSIONS: EBM has the potential to enhance the physician's ability to evaluate an ML-based prediction of pCR and has implications for selecting patients for a “watchful waiting” strategy to RC therapy. Frontiers Media S.A. 2022-12-07 /pmc/articles/PMC9771385/ /pubmed/36568580 http://dx.doi.org/10.3389/frai.2022.1059033 Text en Copyright © 2022 Wang, Lee, Geng, Zhong, Plastaras, Wojcieszynski, Caruana and Xiao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Wang, Du
Lee, Sang Ho
Geng, Huaizhi
Zhong, Haoyu
Plastaras, John
Wojcieszynski, Andrzej
Caruana, Richard
Xiao, Ying
Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma
title Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma
title_full Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma
title_fullStr Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma
title_full_unstemmed Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma
title_short Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma
title_sort interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9771385/
https://www.ncbi.nlm.nih.gov/pubmed/36568580
http://dx.doi.org/10.3389/frai.2022.1059033
work_keys_str_mv AT wangdu interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma
AT leesangho interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma
AT genghuaizhi interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma
AT zhonghaoyu interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma
AT plastarasjohn interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma
AT wojcieszynskiandrzej interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma
AT caruanarichard interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma
AT xiaoying interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma