Cargando…

Toward a standard for the evaluation of PET‐Auto‐Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation

PURPOSE: The aim of this paper is to define the requirements and describe the design and implementation of a standard benchmark tool for evaluation and validation of PET‐auto‐segmentation (PET‐AS) algorithms. This work follows the recommendations of Task Group 211 (TG211) appointed by the American A...

Descripción completa

Detalles Bibliográficos
Autores principales:	Berthon, Beatrice, Spezi, Emiliano, Galavis, Paulina, Shepherd, Tony, Apte, Aditya, Hatt, Mathieu, Fayad, Hadi, De Bernardi, Elisabetta, Soffientini, Chiara D., Ross Schmidtlein, C., El Naqa, Issam, Jeraj, Robert, Lu, Wei, Das, Shiva, Zaidi, Habib, Mawlawi, Osama R., Visvikis, Dimitris, Lee, John A., Kirov, Assen S.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2017
Materias:	QUANTITATIVE IMAGING AND IMAGE PROCESSING
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5575543/ https://www.ncbi.nlm.nih.gov/pubmed/28474819 http://dx.doi.org/10.1002/mp.12312

_version_	1783260068980981760
author	Berthon, Beatrice Spezi, Emiliano Galavis, Paulina Shepherd, Tony Apte, Aditya Hatt, Mathieu Fayad, Hadi De Bernardi, Elisabetta Soffientini, Chiara D. Ross Schmidtlein, C. El Naqa, Issam Jeraj, Robert Lu, Wei Das, Shiva Zaidi, Habib Mawlawi, Osama R. Visvikis, Dimitris Lee, John A. Kirov, Assen S.
author_facet	Berthon, Beatrice Spezi, Emiliano Galavis, Paulina Shepherd, Tony Apte, Aditya Hatt, Mathieu Fayad, Hadi De Bernardi, Elisabetta Soffientini, Chiara D. Ross Schmidtlein, C. El Naqa, Issam Jeraj, Robert Lu, Wei Das, Shiva Zaidi, Habib Mawlawi, Osama R. Visvikis, Dimitris Lee, John A. Kirov, Assen S.
author_sort	Berthon, Beatrice
collection	PubMed
description	PURPOSE: The aim of this paper is to define the requirements and describe the design and implementation of a standard benchmark tool for evaluation and validation of PET‐auto‐segmentation (PET‐AS) algorithms. This work follows the recommendations of Task Group 211 (TG211) appointed by the American Association of Physicists in Medicine (AAPM). METHODS: The recommendations published in the AAPM TG211 report were used to derive a set of required features and to guide the design and structure of a benchmarking software tool. These items included the selection of appropriate representative data and reference contours obtained from established approaches and the description of available metrics. The benchmark was designed in a way that it could be extendable by inclusion of bespoke segmentation methods, while maintaining its main purpose of being a standard testing platform for newly developed PET‐AS methods. An example of implementation of the proposed framework, named PETASset, was built. In this work, a selection of PET‐AS methods representing common approaches to PET image segmentation was evaluated within PETASset for the purpose of testing and demonstrating the capabilities of the software as a benchmark platform. RESULTS: A selection of clinical, physical, and simulated phantom data, including “best estimates” reference contours from macroscopic specimens, simulation template, and CT scans was built into the PETASset application database. Specific metrics such as Dice Similarity Coefficient (DSC), Positive Predictive Value (PPV), and Sensitivity (S), were included to allow the user to compare the results of any given PET‐AS algorithm to the reference contours. In addition, a tool to generate structured reports on the evaluation of the performance of PET‐AS algorithms against the reference contours was built. The variation of the metric agreement values with the reference contours across the PET‐AS methods evaluated for demonstration were between 0.51 and 0.83, 0.44 and 0.86, and 0.61 and 1.00 for DSC, PPV, and the S metric, respectively. Examples of agreement limits were provided to show how the software could be used to evaluate a new algorithm against the existing state‐of‐the art. CONCLUSIONS: PETASset provides a platform that allows standardizing the evaluation and comparison of different PET‐AS methods on a wide range of PET datasets. The developed platform will be available to users willing to evaluate their PET‐AS methods and contribute with more evaluation datasets.
format	Online Article Text
id	pubmed-5575543
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-55755432017-09-18 Toward a standard for the evaluation of PET‐Auto‐Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation Berthon, Beatrice Spezi, Emiliano Galavis, Paulina Shepherd, Tony Apte, Aditya Hatt, Mathieu Fayad, Hadi De Bernardi, Elisabetta Soffientini, Chiara D. Ross Schmidtlein, C. El Naqa, Issam Jeraj, Robert Lu, Wei Das, Shiva Zaidi, Habib Mawlawi, Osama R. Visvikis, Dimitris Lee, John A. Kirov, Assen S. Med Phys QUANTITATIVE IMAGING AND IMAGE PROCESSING PURPOSE: The aim of this paper is to define the requirements and describe the design and implementation of a standard benchmark tool for evaluation and validation of PET‐auto‐segmentation (PET‐AS) algorithms. This work follows the recommendations of Task Group 211 (TG211) appointed by the American Association of Physicists in Medicine (AAPM). METHODS: The recommendations published in the AAPM TG211 report were used to derive a set of required features and to guide the design and structure of a benchmarking software tool. These items included the selection of appropriate representative data and reference contours obtained from established approaches and the description of available metrics. The benchmark was designed in a way that it could be extendable by inclusion of bespoke segmentation methods, while maintaining its main purpose of being a standard testing platform for newly developed PET‐AS methods. An example of implementation of the proposed framework, named PETASset, was built. In this work, a selection of PET‐AS methods representing common approaches to PET image segmentation was evaluated within PETASset for the purpose of testing and demonstrating the capabilities of the software as a benchmark platform. RESULTS: A selection of clinical, physical, and simulated phantom data, including “best estimates” reference contours from macroscopic specimens, simulation template, and CT scans was built into the PETASset application database. Specific metrics such as Dice Similarity Coefficient (DSC), Positive Predictive Value (PPV), and Sensitivity (S), were included to allow the user to compare the results of any given PET‐AS algorithm to the reference contours. In addition, a tool to generate structured reports on the evaluation of the performance of PET‐AS algorithms against the reference contours was built. The variation of the metric agreement values with the reference contours across the PET‐AS methods evaluated for demonstration were between 0.51 and 0.83, 0.44 and 0.86, and 0.61 and 1.00 for DSC, PPV, and the S metric, respectively. Examples of agreement limits were provided to show how the software could be used to evaluate a new algorithm against the existing state‐of‐the art. CONCLUSIONS: PETASset provides a platform that allows standardizing the evaluation and comparison of different PET‐AS methods on a wide range of PET datasets. The developed platform will be available to users willing to evaluate their PET‐AS methods and contribute with more evaluation datasets. John Wiley and Sons Inc. 2017-07-02 2017-08 /pmc/articles/PMC5575543/ /pubmed/28474819 http://dx.doi.org/10.1002/mp.12312 Text en © 2017 The Authors. Medical Physics published by Wiley Periodicals, Inc. on behalf of American Association of Physicists in Medicine. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle	QUANTITATIVE IMAGING AND IMAGE PROCESSING Berthon, Beatrice Spezi, Emiliano Galavis, Paulina Shepherd, Tony Apte, Aditya Hatt, Mathieu Fayad, Hadi De Bernardi, Elisabetta Soffientini, Chiara D. Ross Schmidtlein, C. El Naqa, Issam Jeraj, Robert Lu, Wei Das, Shiva Zaidi, Habib Mawlawi, Osama R. Visvikis, Dimitris Lee, John A. Kirov, Assen S. Toward a standard for the evaluation of PET‐Auto‐Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation
title	Toward a standard for the evaluation of PET‐Auto‐Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation
title_full	Toward a standard for the evaluation of PET‐Auto‐Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation
title_fullStr	Toward a standard for the evaluation of PET‐Auto‐Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation
title_full_unstemmed	Toward a standard for the evaluation of PET‐Auto‐Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation
title_short	Toward a standard for the evaluation of PET‐Auto‐Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation
title_sort	toward a standard for the evaluation of pet‐auto‐segmentation methods following the recommendations of aapm task group no. 211: requirements and implementation
topic	QUANTITATIVE IMAGING AND IMAGE PROCESSING
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5575543/ https://www.ncbi.nlm.nih.gov/pubmed/28474819 http://dx.doi.org/10.1002/mp.12312
work_keys_str_mv	AT berthonbeatrice towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT speziemiliano towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT galavispaulina towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT shepherdtony towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT apteaditya towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT hattmathieu towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT fayadhadi towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT debernardielisabetta towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT soffientinichiarad towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT rossschmidtleinc towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT elnaqaissam towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT jerajrobert towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT luwei towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT dasshiva towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT zaidihabib towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT mawlawiosamar towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT visvikisdimitris towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT leejohna towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation AT kirovassens towardastandardfortheevaluationofpetautosegmentationmethodsfollowingtherecommendationsofaapmtaskgroupno211requirementsandimplementation

Toward a standard for the evaluation of PET‐Auto‐Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation

Ejemplares similares