Cargando…
Segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: Benchmarking data sharing and artificial intelligence use-cases
BACKGROUND: Demand for head and neck cancer (HNC) radiotherapy data in algorithmic development has prompted increased image dataset sharing. Medical images must comply with data protection requirements so that re-use is enabled without disclosing patient identifiers. Defacing, i.e., the removal of f...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10011442/ https://www.ncbi.nlm.nih.gov/pubmed/36925936 http://dx.doi.org/10.3389/fonc.2023.1120392 |
_version_ | 1784906393884557312 |
---|---|
author | Sahlsten, Jaakko Wahid, Kareem A. Glerean, Enrico Jaskari, Joel Naser, Mohamed A. He, Renjie Kann, Benjamin H. Mäkitie, Antti Fuller, Clifton D. Kaski, Kimmo |
author_facet | Sahlsten, Jaakko Wahid, Kareem A. Glerean, Enrico Jaskari, Joel Naser, Mohamed A. He, Renjie Kann, Benjamin H. Mäkitie, Antti Fuller, Clifton D. Kaski, Kimmo |
author_sort | Sahlsten, Jaakko |
collection | PubMed |
description | BACKGROUND: Demand for head and neck cancer (HNC) radiotherapy data in algorithmic development has prompted increased image dataset sharing. Medical images must comply with data protection requirements so that re-use is enabled without disclosing patient identifiers. Defacing, i.e., the removal of facial features from images, is often considered a reasonable compromise between data protection and re-usability for neuroimaging data. While defacing tools have been developed by the neuroimaging community, their acceptability for radiotherapy applications have not been explored. Therefore, this study systematically investigated the impact of available defacing algorithms on HNC organs at risk (OARs). METHODS: A publicly available dataset of magnetic resonance imaging scans for 55 HNC patients with eight segmented OARs (bilateral submandibular glands, parotid glands, level II neck lymph nodes, level III neck lymph nodes) was utilized. Eight publicly available defacing algorithms were investigated: afni_refacer, DeepDefacer, defacer, fsl_deface, mask_face, mri_deface, pydeface, and quickshear. Using a subset of scans where defacing succeeded (N=29), a 5-fold cross-validation 3D U-net based OAR auto-segmentation model was utilized to perform two main experiments: 1.) comparing original and defaced data for training when evaluated on original data; 2.) using original data for training and comparing the model evaluation on original and defaced data. Models were primarily assessed using the Dice similarity coefficient (DSC). RESULTS: Most defacing methods were unable to produce any usable images for evaluation, while mask_face, fsl_deface, and pydeface were unable to remove the face for 29%, 18%, and 24% of subjects, respectively. When using the original data for evaluation, the composite OAR DSC was statistically higher (p ≤ 0.05) for the model trained with the original data with a DSC of 0.760 compared to the mask_face, fsl_deface, and pydeface models with DSCs of 0.742, 0.736, and 0.449, respectively. Moreover, the model trained with original data had decreased performance (p ≤ 0.05) when evaluated on the defaced data with DSCs of 0.673, 0.693, and 0.406 for mask_face, fsl_deface, and pydeface, respectively. CONCLUSION: Defacing algorithms may have a significant impact on HNC OAR auto-segmentation model training and testing. This work highlights the need for further development of HNC-specific image anonymization methods. |
format | Online Article Text |
id | pubmed-10011442 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-100114422023-03-15 Segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: Benchmarking data sharing and artificial intelligence use-cases Sahlsten, Jaakko Wahid, Kareem A. Glerean, Enrico Jaskari, Joel Naser, Mohamed A. He, Renjie Kann, Benjamin H. Mäkitie, Antti Fuller, Clifton D. Kaski, Kimmo Front Oncol Oncology BACKGROUND: Demand for head and neck cancer (HNC) radiotherapy data in algorithmic development has prompted increased image dataset sharing. Medical images must comply with data protection requirements so that re-use is enabled without disclosing patient identifiers. Defacing, i.e., the removal of facial features from images, is often considered a reasonable compromise between data protection and re-usability for neuroimaging data. While defacing tools have been developed by the neuroimaging community, their acceptability for radiotherapy applications have not been explored. Therefore, this study systematically investigated the impact of available defacing algorithms on HNC organs at risk (OARs). METHODS: A publicly available dataset of magnetic resonance imaging scans for 55 HNC patients with eight segmented OARs (bilateral submandibular glands, parotid glands, level II neck lymph nodes, level III neck lymph nodes) was utilized. Eight publicly available defacing algorithms were investigated: afni_refacer, DeepDefacer, defacer, fsl_deface, mask_face, mri_deface, pydeface, and quickshear. Using a subset of scans where defacing succeeded (N=29), a 5-fold cross-validation 3D U-net based OAR auto-segmentation model was utilized to perform two main experiments: 1.) comparing original and defaced data for training when evaluated on original data; 2.) using original data for training and comparing the model evaluation on original and defaced data. Models were primarily assessed using the Dice similarity coefficient (DSC). RESULTS: Most defacing methods were unable to produce any usable images for evaluation, while mask_face, fsl_deface, and pydeface were unable to remove the face for 29%, 18%, and 24% of subjects, respectively. When using the original data for evaluation, the composite OAR DSC was statistically higher (p ≤ 0.05) for the model trained with the original data with a DSC of 0.760 compared to the mask_face, fsl_deface, and pydeface models with DSCs of 0.742, 0.736, and 0.449, respectively. Moreover, the model trained with original data had decreased performance (p ≤ 0.05) when evaluated on the defaced data with DSCs of 0.673, 0.693, and 0.406 for mask_face, fsl_deface, and pydeface, respectively. CONCLUSION: Defacing algorithms may have a significant impact on HNC OAR auto-segmentation model training and testing. This work highlights the need for further development of HNC-specific image anonymization methods. Frontiers Media S.A. 2023-02-28 /pmc/articles/PMC10011442/ /pubmed/36925936 http://dx.doi.org/10.3389/fonc.2023.1120392 Text en Copyright © 2023 Sahlsten, Wahid, Glerean, Jaskari, Naser, He, Kann, Mäkitie, Fuller and Kaski https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Sahlsten, Jaakko Wahid, Kareem A. Glerean, Enrico Jaskari, Joel Naser, Mohamed A. He, Renjie Kann, Benjamin H. Mäkitie, Antti Fuller, Clifton D. Kaski, Kimmo Segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: Benchmarking data sharing and artificial intelligence use-cases |
title | Segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: Benchmarking data sharing and artificial intelligence use-cases |
title_full | Segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: Benchmarking data sharing and artificial intelligence use-cases |
title_fullStr | Segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: Benchmarking data sharing and artificial intelligence use-cases |
title_full_unstemmed | Segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: Benchmarking data sharing and artificial intelligence use-cases |
title_short | Segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: Benchmarking data sharing and artificial intelligence use-cases |
title_sort | segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: benchmarking data sharing and artificial intelligence use-cases |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10011442/ https://www.ncbi.nlm.nih.gov/pubmed/36925936 http://dx.doi.org/10.3389/fonc.2023.1120392 |
work_keys_str_mv | AT sahlstenjaakko segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases AT wahidkareema segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases AT glereanenrico segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases AT jaskarijoel segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases AT nasermohameda segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases AT herenjie segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases AT kannbenjaminh segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases AT makitieantti segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases AT fullercliftond segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases AT kaskikimmo segmentationstabilityofhumanheadandneckcancermedicalimagesforradiotherapyapplicationsunderdeidentificationconditionsbenchmarkingdatasharingandartificialintelligenceusecases |