Cargando…
Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies
SIMPLE SUMMARY: Radiomics is referred to as quantitative image biomarker analysis. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, the radiomic biomarkers lack reproducibility. In this manuscript, we show how this protocol-induced uncertainty can dr...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8909427/ https://www.ncbi.nlm.nih.gov/pubmed/35267597 http://dx.doi.org/10.3390/cancers14051288 |
_version_ | 1784666146881929216 |
---|---|
author | Zhovannik, Ivan Bontempi, Dennis Romita, Alessio Pfaehler, Elisabeth Primakov, Sergey Dekker, Andre Bussink, Johan Traverso, Alberto Monshouwer, René |
author_facet | Zhovannik, Ivan Bontempi, Dennis Romita, Alessio Pfaehler, Elisabeth Primakov, Sergey Dekker, Andre Bussink, Johan Traverso, Alberto Monshouwer, René |
author_sort | Zhovannik, Ivan |
collection | PubMed |
description | SIMPLE SUMMARY: Radiomics is referred to as quantitative image biomarker analysis. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, the radiomic biomarkers lack reproducibility. In this manuscript, we show how this protocol-induced uncertainty can drastically reduce prognostic model performance and propose some insights on how to use it for developing better prognostic models. ABSTRACT: Problem. Image biomarker analysis, also known as radiomics, is a tool for tissue characterization and treatment prognosis that relies on routinely acquired clinical images and delineations. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, radiomics often lack reproducibility. Radiomics harmonization techniques have been proposed as a solution to reduce these sources of uncertainty and/or their influence on the prognostic model performance. A relevant question is how to estimate the protocol-induced uncertainty of a specific image biomarker, what the effect is on the model performance, and how to optimize the model given the uncertainty. Methods. Two non-small cell lung cancer (NSCLC) cohorts, composed of 421 and 240 patients, respectively, were used for training and testing. Per patient, a Monte Carlo algorithm was used to generate three hundred synthetic contours with a surface dice tolerance measure of less than 1.18 mm with respect to the original GTV. These contours were subsequently used to derive 104 radiomic features, which were ranked on their relative sensitivity to contour perturbation, expressed in the parameter η. The top four (low η) and the bottom four (high η) features were selected for two models based on the Cox proportional hazards model. To investigate the influence of segmentation uncertainty on the prognostic model, we trained and tested the setup in 5000 augmented realizations (using a Monte Carlo sampling method); the log-rank test was used to assess the stratification performance and stability of segmentation uncertainty. Results. Although both low and high η setup showed significant testing set log-rank p-values (p = 0.01) in the original GTV delineations (without segmentation uncertainty introduced), in the model with high uncertainty, to effect ratio, only around 30% of the augmented realizations resulted in model performance with p < 0.05 in the test set. In contrast, the low η setup performed with a log-rank p < 0.05 in 90% of the augmented realizations. Moreover, the high η setup classification was uncertain in its predictions for 50% of the subjects in the testing set (for 80% agreement rate), whereas the low η setup was uncertain only in 10% of the cases. Discussion. Estimating image biomarker model performance based only on the original GTV segmentation, without considering segmentation, uncertainty may be deceiving. The model might result in a significant stratification performance, but can be unstable for delineation variations, which are inherent to manual segmentation. Simulating segmentation uncertainty using the method described allows for more stable image biomarker estimation, selection, and model development. The segmentation uncertainty estimation method described here is universal and can be extended to estimate other protocol uncertainties (such as image acquisition and pre-processing). |
format | Online Article Text |
id | pubmed-8909427 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-89094272022-03-11 Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies Zhovannik, Ivan Bontempi, Dennis Romita, Alessio Pfaehler, Elisabeth Primakov, Sergey Dekker, Andre Bussink, Johan Traverso, Alberto Monshouwer, René Cancers (Basel) Article SIMPLE SUMMARY: Radiomics is referred to as quantitative image biomarker analysis. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, the radiomic biomarkers lack reproducibility. In this manuscript, we show how this protocol-induced uncertainty can drastically reduce prognostic model performance and propose some insights on how to use it for developing better prognostic models. ABSTRACT: Problem. Image biomarker analysis, also known as radiomics, is a tool for tissue characterization and treatment prognosis that relies on routinely acquired clinical images and delineations. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, radiomics often lack reproducibility. Radiomics harmonization techniques have been proposed as a solution to reduce these sources of uncertainty and/or their influence on the prognostic model performance. A relevant question is how to estimate the protocol-induced uncertainty of a specific image biomarker, what the effect is on the model performance, and how to optimize the model given the uncertainty. Methods. Two non-small cell lung cancer (NSCLC) cohorts, composed of 421 and 240 patients, respectively, were used for training and testing. Per patient, a Monte Carlo algorithm was used to generate three hundred synthetic contours with a surface dice tolerance measure of less than 1.18 mm with respect to the original GTV. These contours were subsequently used to derive 104 radiomic features, which were ranked on their relative sensitivity to contour perturbation, expressed in the parameter η. The top four (low η) and the bottom four (high η) features were selected for two models based on the Cox proportional hazards model. To investigate the influence of segmentation uncertainty on the prognostic model, we trained and tested the setup in 5000 augmented realizations (using a Monte Carlo sampling method); the log-rank test was used to assess the stratification performance and stability of segmentation uncertainty. Results. Although both low and high η setup showed significant testing set log-rank p-values (p = 0.01) in the original GTV delineations (without segmentation uncertainty introduced), in the model with high uncertainty, to effect ratio, only around 30% of the augmented realizations resulted in model performance with p < 0.05 in the test set. In contrast, the low η setup performed with a log-rank p < 0.05 in 90% of the augmented realizations. Moreover, the high η setup classification was uncertain in its predictions for 50% of the subjects in the testing set (for 80% agreement rate), whereas the low η setup was uncertain only in 10% of the cases. Discussion. Estimating image biomarker model performance based only on the original GTV segmentation, without considering segmentation, uncertainty may be deceiving. The model might result in a significant stratification performance, but can be unstable for delineation variations, which are inherent to manual segmentation. Simulating segmentation uncertainty using the method described allows for more stable image biomarker estimation, selection, and model development. The segmentation uncertainty estimation method described here is universal and can be extended to estimate other protocol uncertainties (such as image acquisition and pre-processing). MDPI 2022-03-02 /pmc/articles/PMC8909427/ /pubmed/35267597 http://dx.doi.org/10.3390/cancers14051288 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhovannik, Ivan Bontempi, Dennis Romita, Alessio Pfaehler, Elisabeth Primakov, Sergey Dekker, Andre Bussink, Johan Traverso, Alberto Monshouwer, René Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies |
title | Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies |
title_full | Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies |
title_fullStr | Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies |
title_full_unstemmed | Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies |
title_short | Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies |
title_sort | segmentation uncertainty estimation as a sanity check for image biomarker studies |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8909427/ https://www.ncbi.nlm.nih.gov/pubmed/35267597 http://dx.doi.org/10.3390/cancers14051288 |
work_keys_str_mv | AT zhovannikivan segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies AT bontempidennis segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies AT romitaalessio segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies AT pfaehlerelisabeth segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies AT primakovsergey segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies AT dekkerandre segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies AT bussinkjohan segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies AT traversoalberto segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies AT monshouwerrene segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies |