Cargando…

Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies

SIMPLE SUMMARY: Radiomics is referred to as quantitative image biomarker analysis. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, the radiomic biomarkers lack reproducibility. In this manuscript, we show how this protocol-induced uncertainty can dr...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhovannik, Ivan, Bontempi, Dennis, Romita, Alessio, Pfaehler, Elisabeth, Primakov, Sergey, Dekker, Andre, Bussink, Johan, Traverso, Alberto, Monshouwer, René
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8909427/
https://www.ncbi.nlm.nih.gov/pubmed/35267597
http://dx.doi.org/10.3390/cancers14051288
_version_ 1784666146881929216
author Zhovannik, Ivan
Bontempi, Dennis
Romita, Alessio
Pfaehler, Elisabeth
Primakov, Sergey
Dekker, Andre
Bussink, Johan
Traverso, Alberto
Monshouwer, René
author_facet Zhovannik, Ivan
Bontempi, Dennis
Romita, Alessio
Pfaehler, Elisabeth
Primakov, Sergey
Dekker, Andre
Bussink, Johan
Traverso, Alberto
Monshouwer, René
author_sort Zhovannik, Ivan
collection PubMed
description SIMPLE SUMMARY: Radiomics is referred to as quantitative image biomarker analysis. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, the radiomic biomarkers lack reproducibility. In this manuscript, we show how this protocol-induced uncertainty can drastically reduce prognostic model performance and propose some insights on how to use it for developing better prognostic models. ABSTRACT: Problem. Image biomarker analysis, also known as radiomics, is a tool for tissue characterization and treatment prognosis that relies on routinely acquired clinical images and delineations. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, radiomics often lack reproducibility. Radiomics harmonization techniques have been proposed as a solution to reduce these sources of uncertainty and/or their influence on the prognostic model performance. A relevant question is how to estimate the protocol-induced uncertainty of a specific image biomarker, what the effect is on the model performance, and how to optimize the model given the uncertainty. Methods. Two non-small cell lung cancer (NSCLC) cohorts, composed of 421 and 240 patients, respectively, were used for training and testing. Per patient, a Monte Carlo algorithm was used to generate three hundred synthetic contours with a surface dice tolerance measure of less than 1.18 mm with respect to the original GTV. These contours were subsequently used to derive 104 radiomic features, which were ranked on their relative sensitivity to contour perturbation, expressed in the parameter η. The top four (low η) and the bottom four (high η) features were selected for two models based on the Cox proportional hazards model. To investigate the influence of segmentation uncertainty on the prognostic model, we trained and tested the setup in 5000 augmented realizations (using a Monte Carlo sampling method); the log-rank test was used to assess the stratification performance and stability of segmentation uncertainty. Results. Although both low and high η setup showed significant testing set log-rank p-values (p = 0.01) in the original GTV delineations (without segmentation uncertainty introduced), in the model with high uncertainty, to effect ratio, only around 30% of the augmented realizations resulted in model performance with p < 0.05 in the test set. In contrast, the low η setup performed with a log-rank p < 0.05 in 90% of the augmented realizations. Moreover, the high η setup classification was uncertain in its predictions for 50% of the subjects in the testing set (for 80% agreement rate), whereas the low η setup was uncertain only in 10% of the cases. Discussion. Estimating image biomarker model performance based only on the original GTV segmentation, without considering segmentation, uncertainty may be deceiving. The model might result in a significant stratification performance, but can be unstable for delineation variations, which are inherent to manual segmentation. Simulating segmentation uncertainty using the method described allows for more stable image biomarker estimation, selection, and model development. The segmentation uncertainty estimation method described here is universal and can be extended to estimate other protocol uncertainties (such as image acquisition and pre-processing).
format Online
Article
Text
id pubmed-8909427
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89094272022-03-11 Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies Zhovannik, Ivan Bontempi, Dennis Romita, Alessio Pfaehler, Elisabeth Primakov, Sergey Dekker, Andre Bussink, Johan Traverso, Alberto Monshouwer, René Cancers (Basel) Article SIMPLE SUMMARY: Radiomics is referred to as quantitative image biomarker analysis. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, the radiomic biomarkers lack reproducibility. In this manuscript, we show how this protocol-induced uncertainty can drastically reduce prognostic model performance and propose some insights on how to use it for developing better prognostic models. ABSTRACT: Problem. Image biomarker analysis, also known as radiomics, is a tool for tissue characterization and treatment prognosis that relies on routinely acquired clinical images and delineations. Due to the uncertainty in image acquisition, processing, and segmentation (delineation) protocols, radiomics often lack reproducibility. Radiomics harmonization techniques have been proposed as a solution to reduce these sources of uncertainty and/or their influence on the prognostic model performance. A relevant question is how to estimate the protocol-induced uncertainty of a specific image biomarker, what the effect is on the model performance, and how to optimize the model given the uncertainty. Methods. Two non-small cell lung cancer (NSCLC) cohorts, composed of 421 and 240 patients, respectively, were used for training and testing. Per patient, a Monte Carlo algorithm was used to generate three hundred synthetic contours with a surface dice tolerance measure of less than 1.18 mm with respect to the original GTV. These contours were subsequently used to derive 104 radiomic features, which were ranked on their relative sensitivity to contour perturbation, expressed in the parameter η. The top four (low η) and the bottom four (high η) features were selected for two models based on the Cox proportional hazards model. To investigate the influence of segmentation uncertainty on the prognostic model, we trained and tested the setup in 5000 augmented realizations (using a Monte Carlo sampling method); the log-rank test was used to assess the stratification performance and stability of segmentation uncertainty. Results. Although both low and high η setup showed significant testing set log-rank p-values (p = 0.01) in the original GTV delineations (without segmentation uncertainty introduced), in the model with high uncertainty, to effect ratio, only around 30% of the augmented realizations resulted in model performance with p < 0.05 in the test set. In contrast, the low η setup performed with a log-rank p < 0.05 in 90% of the augmented realizations. Moreover, the high η setup classification was uncertain in its predictions for 50% of the subjects in the testing set (for 80% agreement rate), whereas the low η setup was uncertain only in 10% of the cases. Discussion. Estimating image biomarker model performance based only on the original GTV segmentation, without considering segmentation, uncertainty may be deceiving. The model might result in a significant stratification performance, but can be unstable for delineation variations, which are inherent to manual segmentation. Simulating segmentation uncertainty using the method described allows for more stable image biomarker estimation, selection, and model development. The segmentation uncertainty estimation method described here is universal and can be extended to estimate other protocol uncertainties (such as image acquisition and pre-processing). MDPI 2022-03-02 /pmc/articles/PMC8909427/ /pubmed/35267597 http://dx.doi.org/10.3390/cancers14051288 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhovannik, Ivan
Bontempi, Dennis
Romita, Alessio
Pfaehler, Elisabeth
Primakov, Sergey
Dekker, Andre
Bussink, Johan
Traverso, Alberto
Monshouwer, René
Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies
title Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies
title_full Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies
title_fullStr Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies
title_full_unstemmed Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies
title_short Segmentation Uncertainty Estimation as a Sanity Check for Image Biomarker Studies
title_sort segmentation uncertainty estimation as a sanity check for image biomarker studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8909427/
https://www.ncbi.nlm.nih.gov/pubmed/35267597
http://dx.doi.org/10.3390/cancers14051288
work_keys_str_mv AT zhovannikivan segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies
AT bontempidennis segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies
AT romitaalessio segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies
AT pfaehlerelisabeth segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies
AT primakovsergey segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies
AT dekkerandre segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies
AT bussinkjohan segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies
AT traversoalberto segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies
AT monshouwerrene segmentationuncertaintyestimationasasanitycheckforimagebiomarkerstudies