Cargando…

The suitability of common metrics for assessing parotid and larynx autosegmentation accuracy

Contouring structures in the head and neck is time‐consuming, and automatic segmentation is an important part of an adaptive radiotherapy workflow. Geometric accuracy of automatic segmentation algorithms has been widely reported, but there is no consensus as to which metrics provide clinically meani...

Descripción completa

Detalles Bibliográficos
Autores principales: Beasley, William J., McWilliam, Alan, Aitkenhead, Adam, Mackay, Ranald I., Rowbottom, Carl G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5875550/
https://www.ncbi.nlm.nih.gov/pubmed/27074471
http://dx.doi.org/10.1120/jacmp.v17i2.5889
_version_ 1783310367585206272
author Beasley, William J.
McWilliam, Alan
Aitkenhead, Adam
Mackay, Ranald I.
Rowbottom, Carl G.
author_facet Beasley, William J.
McWilliam, Alan
Aitkenhead, Adam
Mackay, Ranald I.
Rowbottom, Carl G.
author_sort Beasley, William J.
collection PubMed
description Contouring structures in the head and neck is time‐consuming, and automatic segmentation is an important part of an adaptive radiotherapy workflow. Geometric accuracy of automatic segmentation algorithms has been widely reported, but there is no consensus as to which metrics provide clinically meaningful results. This study investigated whether geometric accuracy (as quantified by several commonly used metrics) was associated with dosimetric differences for the parotid and larynx, comparing automatically generated contours against manually drawn ground truth contours. This enabled the suitability of different commonly used metrics to be assessed for measuring automatic segmentation accuracy of the parotid and larynx. Parotid and larynx structures for 10 head and neck patients were outlined by five clinicians to create ground truth structures. An automatic segmentation algorithm was used to create automatically generated normal structures, which were then used to create volumetric‐modulated arc therapy plans. The mean doses to the automatically generated structures were compared with those of the corresponding ground truth structures, and the relative difference in mean dose was calculated for each structure. It was found that this difference did not correlate with the geometric accuracy provided by several metrics, notably the Dice similarity coefficient, which is a commonly used measure of spatial overlap. Surface‐based metrics provided stronger correlation and are, therefore, more suitable for assessing automatic segmentation of the parotid and larynx. PACS number(s): 87.57.nm, 87.55.D, 87.55.Qr
format Online
Article
Text
id pubmed-5875550
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-58755502018-04-02 The suitability of common metrics for assessing parotid and larynx autosegmentation accuracy Beasley, William J. McWilliam, Alan Aitkenhead, Adam Mackay, Ranald I. Rowbottom, Carl G. J Appl Clin Med Phys Radiation Oncology Physics Contouring structures in the head and neck is time‐consuming, and automatic segmentation is an important part of an adaptive radiotherapy workflow. Geometric accuracy of automatic segmentation algorithms has been widely reported, but there is no consensus as to which metrics provide clinically meaningful results. This study investigated whether geometric accuracy (as quantified by several commonly used metrics) was associated with dosimetric differences for the parotid and larynx, comparing automatically generated contours against manually drawn ground truth contours. This enabled the suitability of different commonly used metrics to be assessed for measuring automatic segmentation accuracy of the parotid and larynx. Parotid and larynx structures for 10 head and neck patients were outlined by five clinicians to create ground truth structures. An automatic segmentation algorithm was used to create automatically generated normal structures, which were then used to create volumetric‐modulated arc therapy plans. The mean doses to the automatically generated structures were compared with those of the corresponding ground truth structures, and the relative difference in mean dose was calculated for each structure. It was found that this difference did not correlate with the geometric accuracy provided by several metrics, notably the Dice similarity coefficient, which is a commonly used measure of spatial overlap. Surface‐based metrics provided stronger correlation and are, therefore, more suitable for assessing automatic segmentation of the parotid and larynx. PACS number(s): 87.57.nm, 87.55.D, 87.55.Qr John Wiley and Sons Inc. 2016-03-08 /pmc/articles/PMC5875550/ /pubmed/27074471 http://dx.doi.org/10.1120/jacmp.v17i2.5889 Text en © 2016 The Authors. This is an open access article under the terms of the http://creativecommons.org/licenses/by/3.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Radiation Oncology Physics
Beasley, William J.
McWilliam, Alan
Aitkenhead, Adam
Mackay, Ranald I.
Rowbottom, Carl G.
The suitability of common metrics for assessing parotid and larynx autosegmentation accuracy
title The suitability of common metrics for assessing parotid and larynx autosegmentation accuracy
title_full The suitability of common metrics for assessing parotid and larynx autosegmentation accuracy
title_fullStr The suitability of common metrics for assessing parotid and larynx autosegmentation accuracy
title_full_unstemmed The suitability of common metrics for assessing parotid and larynx autosegmentation accuracy
title_short The suitability of common metrics for assessing parotid and larynx autosegmentation accuracy
title_sort suitability of common metrics for assessing parotid and larynx autosegmentation accuracy
topic Radiation Oncology Physics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5875550/
https://www.ncbi.nlm.nih.gov/pubmed/27074471
http://dx.doi.org/10.1120/jacmp.v17i2.5889
work_keys_str_mv AT beasleywilliamj thesuitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy
AT mcwilliamalan thesuitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy
AT aitkenheadadam thesuitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy
AT mackayranaldi thesuitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy
AT rowbottomcarlg thesuitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy
AT beasleywilliamj suitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy
AT mcwilliamalan suitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy
AT aitkenheadadam suitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy
AT mackayranaldi suitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy
AT rowbottomcarlg suitabilityofcommonmetricsforassessingparotidandlarynxautosegmentationaccuracy