Cargando…
Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma
PURPOSE: Pathologic complete response (pCR) is a critical factor in determining whether patients with rectal cancer (RC) should have surgery after neoadjuvant chemoradiotherapy (nCRT). Currently, a pathologist's histological analysis of surgical specimens is necessary for a reliable assessment...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9771385/ https://www.ncbi.nlm.nih.gov/pubmed/36568580 http://dx.doi.org/10.3389/frai.2022.1059033 |
_version_ | 1784854816199016448 |
---|---|
author | Wang, Du Lee, Sang Ho Geng, Huaizhi Zhong, Haoyu Plastaras, John Wojcieszynski, Andrzej Caruana, Richard Xiao, Ying |
author_facet | Wang, Du Lee, Sang Ho Geng, Huaizhi Zhong, Haoyu Plastaras, John Wojcieszynski, Andrzej Caruana, Richard Xiao, Ying |
author_sort | Wang, Du |
collection | PubMed |
description | PURPOSE: Pathologic complete response (pCR) is a critical factor in determining whether patients with rectal cancer (RC) should have surgery after neoadjuvant chemoradiotherapy (nCRT). Currently, a pathologist's histological analysis of surgical specimens is necessary for a reliable assessment of pCR. Machine learning (ML) algorithms have the potential to be a non-invasive way for identifying appropriate candidates for non-operative therapy. However, these ML models' interpretability remains challenging. We propose using explainable boosting machine (EBM) to predict the pCR of RC patients following nCRT. METHODS: A total of 296 features were extracted, including clinical parameters (CPs), dose-volume histogram (DVH) parameters from gross tumor volume (GTV) and organs-at-risk, and radiomics (R) and dosiomics (D) features from GTV. R and D features were subcategorized into shape (S), first-order (L1), second-order (L2), and higher-order (L3) local texture features. Multi-view analysis was employed to determine the best set of input feature categories. Boruta was used to select all-relevant features for each input dataset. ML models were trained on 180 cases from our institution, with 37 cases from RTOG 0822 clinical trial serving as the independent dataset for model validation. The performance of EBM in predicting pCR on the test dataset was evaluated using ROC AUC and compared with that of three state-of-the-art black-box models: extreme gradient boosting (XGB), random forest (RF) and support vector machine (SVM). The predictions of all black-box models were interpreted using Shapley additive explanations. RESULTS: The best input feature categories were CP+DVH+S+R_L1+R_L2 for all models, from which Boruta-selected features enabled the EBM, XGB, RF, and SVM models to attain the AUCs of 0.820, 0.828, 0.828, and 0.774, respectively. Although EBM did not achieve the best performance, it provided the best capability for identifying critical turning points in response scores at distinct feature values, revealing that the bladder with maximum dose >50 Gy, and the tumor with maximum2DDiameterColumn >80 mm, elongation <0.55, leastAxisLength >50 mm and lower variance of CT intensities were associated with unfavorable outcomes. CONCLUSIONS: EBM has the potential to enhance the physician's ability to evaluate an ML-based prediction of pCR and has implications for selecting patients for a “watchful waiting” strategy to RC therapy. |
format | Online Article Text |
id | pubmed-9771385 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97713852022-12-22 Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma Wang, Du Lee, Sang Ho Geng, Huaizhi Zhong, Haoyu Plastaras, John Wojcieszynski, Andrzej Caruana, Richard Xiao, Ying Front Artif Intell Artificial Intelligence PURPOSE: Pathologic complete response (pCR) is a critical factor in determining whether patients with rectal cancer (RC) should have surgery after neoadjuvant chemoradiotherapy (nCRT). Currently, a pathologist's histological analysis of surgical specimens is necessary for a reliable assessment of pCR. Machine learning (ML) algorithms have the potential to be a non-invasive way for identifying appropriate candidates for non-operative therapy. However, these ML models' interpretability remains challenging. We propose using explainable boosting machine (EBM) to predict the pCR of RC patients following nCRT. METHODS: A total of 296 features were extracted, including clinical parameters (CPs), dose-volume histogram (DVH) parameters from gross tumor volume (GTV) and organs-at-risk, and radiomics (R) and dosiomics (D) features from GTV. R and D features were subcategorized into shape (S), first-order (L1), second-order (L2), and higher-order (L3) local texture features. Multi-view analysis was employed to determine the best set of input feature categories. Boruta was used to select all-relevant features for each input dataset. ML models were trained on 180 cases from our institution, with 37 cases from RTOG 0822 clinical trial serving as the independent dataset for model validation. The performance of EBM in predicting pCR on the test dataset was evaluated using ROC AUC and compared with that of three state-of-the-art black-box models: extreme gradient boosting (XGB), random forest (RF) and support vector machine (SVM). The predictions of all black-box models were interpreted using Shapley additive explanations. RESULTS: The best input feature categories were CP+DVH+S+R_L1+R_L2 for all models, from which Boruta-selected features enabled the EBM, XGB, RF, and SVM models to attain the AUCs of 0.820, 0.828, 0.828, and 0.774, respectively. Although EBM did not achieve the best performance, it provided the best capability for identifying critical turning points in response scores at distinct feature values, revealing that the bladder with maximum dose >50 Gy, and the tumor with maximum2DDiameterColumn >80 mm, elongation <0.55, leastAxisLength >50 mm and lower variance of CT intensities were associated with unfavorable outcomes. CONCLUSIONS: EBM has the potential to enhance the physician's ability to evaluate an ML-based prediction of pCR and has implications for selecting patients for a “watchful waiting” strategy to RC therapy. Frontiers Media S.A. 2022-12-07 /pmc/articles/PMC9771385/ /pubmed/36568580 http://dx.doi.org/10.3389/frai.2022.1059033 Text en Copyright © 2022 Wang, Lee, Geng, Zhong, Plastaras, Wojcieszynski, Caruana and Xiao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Wang, Du Lee, Sang Ho Geng, Huaizhi Zhong, Haoyu Plastaras, John Wojcieszynski, Andrzej Caruana, Richard Xiao, Ying Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma |
title | Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma |
title_full | Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma |
title_fullStr | Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma |
title_full_unstemmed | Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma |
title_short | Interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma |
title_sort | interpretable machine learning for predicting pathologic complete response in patients treated with chemoradiation therapy for rectal adenocarcinoma |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9771385/ https://www.ncbi.nlm.nih.gov/pubmed/36568580 http://dx.doi.org/10.3389/frai.2022.1059033 |
work_keys_str_mv | AT wangdu interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma AT leesangho interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma AT genghuaizhi interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma AT zhonghaoyu interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma AT plastarasjohn interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma AT wojcieszynskiandrzej interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma AT caruanarichard interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma AT xiaoying interpretablemachinelearningforpredictingpathologiccompleteresponseinpatientstreatedwithchemoradiationtherapyforrectaladenocarcinoma |