Cargando…

Comparison of the output of a deep learning segmentation model for locoregional breast cancer radiotherapy trained on 2 different datasets

INTRODUCTION: The development of deep learning (DL) models for auto-segmentation is increasing and more models become commercially available. Mostly, commercial models are trained on external data. To study the effect of using a model trained on external data, compared to the same model trained on i...

Descripción completa

Detalles Bibliográficos
Autores principales: Bakx, Nienke, van der Sangen, Maurice, Theuws, Jacqueline, Bluemink, Hanneke, Hurkmans, Coen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199413/
https://www.ncbi.nlm.nih.gov/pubmed/37213441
http://dx.doi.org/10.1016/j.tipsro.2023.100209
_version_ 1785044928581074944
author Bakx, Nienke
van der Sangen, Maurice
Theuws, Jacqueline
Bluemink, Hanneke
Hurkmans, Coen
author_facet Bakx, Nienke
van der Sangen, Maurice
Theuws, Jacqueline
Bluemink, Hanneke
Hurkmans, Coen
author_sort Bakx, Nienke
collection PubMed
description INTRODUCTION: The development of deep learning (DL) models for auto-segmentation is increasing and more models become commercially available. Mostly, commercial models are trained on external data. To study the effect of using a model trained on external data, compared to the same model trained on in-house collected data, the performance of these two DL models was evaluated. METHODS: The evaluation was performed using in-house collected data of 30 breast cancer patients. Quantitative analysis was performed using Dice similarity coefficient (DSC), surface DSC (sDSC) and 95th percentile of Hausdorff Distance (95% HD). These values were compared with previously reported inter-observer variations (IOV). RESULTS: For a number of structures, statistically significant differences were found between the two models. For organs at risk, mean values for DSC ranged from 0.63 to 0.98 and 0.71 to 0.96 for the in-house and external model, respectively. For target volumes, mean DSC values of 0.57 to 0.94 and 0.33 to 0.92 were found. The difference of 95% HD values ranged 0.08 to 3.23 mm between the two models, except for CTVn4 with 9.95 mm. For the external model, both DSC and 95% HD are outside the range of IOV for CTVn4, whereas this is the case for the DSC found for the thyroid of the in-house model. CONCLUSIONS: Statistically significant differences were found between both models, which were mostly within published inter-observer variations, showing clinical usefulness of both models. Our findings could encourage discussion and revision of existing guidelines, to further decrease inter-observer, but also inter-institute variability.
format Online
Article
Text
id pubmed-10199413
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-101994132023-05-21 Comparison of the output of a deep learning segmentation model for locoregional breast cancer radiotherapy trained on 2 different datasets Bakx, Nienke van der Sangen, Maurice Theuws, Jacqueline Bluemink, Hanneke Hurkmans, Coen Tech Innov Patient Support Radiat Oncol Research article INTRODUCTION: The development of deep learning (DL) models for auto-segmentation is increasing and more models become commercially available. Mostly, commercial models are trained on external data. To study the effect of using a model trained on external data, compared to the same model trained on in-house collected data, the performance of these two DL models was evaluated. METHODS: The evaluation was performed using in-house collected data of 30 breast cancer patients. Quantitative analysis was performed using Dice similarity coefficient (DSC), surface DSC (sDSC) and 95th percentile of Hausdorff Distance (95% HD). These values were compared with previously reported inter-observer variations (IOV). RESULTS: For a number of structures, statistically significant differences were found between the two models. For organs at risk, mean values for DSC ranged from 0.63 to 0.98 and 0.71 to 0.96 for the in-house and external model, respectively. For target volumes, mean DSC values of 0.57 to 0.94 and 0.33 to 0.92 were found. The difference of 95% HD values ranged 0.08 to 3.23 mm between the two models, except for CTVn4 with 9.95 mm. For the external model, both DSC and 95% HD are outside the range of IOV for CTVn4, whereas this is the case for the DSC found for the thyroid of the in-house model. CONCLUSIONS: Statistically significant differences were found between both models, which were mostly within published inter-observer variations, showing clinical usefulness of both models. Our findings could encourage discussion and revision of existing guidelines, to further decrease inter-observer, but also inter-institute variability. Elsevier 2023-05-13 /pmc/articles/PMC10199413/ /pubmed/37213441 http://dx.doi.org/10.1016/j.tipsro.2023.100209 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research article
Bakx, Nienke
van der Sangen, Maurice
Theuws, Jacqueline
Bluemink, Hanneke
Hurkmans, Coen
Comparison of the output of a deep learning segmentation model for locoregional breast cancer radiotherapy trained on 2 different datasets
title Comparison of the output of a deep learning segmentation model for locoregional breast cancer radiotherapy trained on 2 different datasets
title_full Comparison of the output of a deep learning segmentation model for locoregional breast cancer radiotherapy trained on 2 different datasets
title_fullStr Comparison of the output of a deep learning segmentation model for locoregional breast cancer radiotherapy trained on 2 different datasets
title_full_unstemmed Comparison of the output of a deep learning segmentation model for locoregional breast cancer radiotherapy trained on 2 different datasets
title_short Comparison of the output of a deep learning segmentation model for locoregional breast cancer radiotherapy trained on 2 different datasets
title_sort comparison of the output of a deep learning segmentation model for locoregional breast cancer radiotherapy trained on 2 different datasets
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199413/
https://www.ncbi.nlm.nih.gov/pubmed/37213441
http://dx.doi.org/10.1016/j.tipsro.2023.100209
work_keys_str_mv AT bakxnienke comparisonoftheoutputofadeeplearningsegmentationmodelforlocoregionalbreastcancerradiotherapytrainedon2differentdatasets
AT vandersangenmaurice comparisonoftheoutputofadeeplearningsegmentationmodelforlocoregionalbreastcancerradiotherapytrainedon2differentdatasets
AT theuwsjacqueline comparisonoftheoutputofadeeplearningsegmentationmodelforlocoregionalbreastcancerradiotherapytrainedon2differentdatasets
AT blueminkhanneke comparisonoftheoutputofadeeplearningsegmentationmodelforlocoregionalbreastcancerradiotherapytrainedon2differentdatasets
AT hurkmanscoen comparisonoftheoutputofadeeplearningsegmentationmodelforlocoregionalbreastcancerradiotherapytrainedon2differentdatasets