Cargando…

Multisite Comparison of MRI Defacing Software Across Multiple Cohorts

With improvements to both scan quality and facial recognition software, there is an increased risk of participants being identified by a 3D render of their structural neuroimaging scans, even when all other personal information has been removed. To prevent this, facial features should be removed bef...

Descripción completa

Detalles Bibliográficos
Autores principales: Theyers, Athena E., Zamyadi, Mojdeh, O'Reilly, Mark, Bartha, Robert, Symons, Sean, MacQueen, Glenda M., Hassel, Stefanie, Lerch, Jason P., Anagnostou, Evdokia, Lam, Raymond W., Frey, Benicio N., Milev, Roumen, Müller, Daniel J., Kennedy, Sidney H., Scott, Christopher J. M., Strother, Stephen C., Arnott, Stephen R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7943842/
https://www.ncbi.nlm.nih.gov/pubmed/33716819
http://dx.doi.org/10.3389/fpsyt.2021.617997
_version_ 1783662578469175296
author Theyers, Athena E.
Zamyadi, Mojdeh
O'Reilly, Mark
Bartha, Robert
Symons, Sean
MacQueen, Glenda M.
Hassel, Stefanie
Lerch, Jason P.
Anagnostou, Evdokia
Lam, Raymond W.
Frey, Benicio N.
Milev, Roumen
Müller, Daniel J.
Kennedy, Sidney H.
Scott, Christopher J. M.
Strother, Stephen C.
Arnott, Stephen R.
author_facet Theyers, Athena E.
Zamyadi, Mojdeh
O'Reilly, Mark
Bartha, Robert
Symons, Sean
MacQueen, Glenda M.
Hassel, Stefanie
Lerch, Jason P.
Anagnostou, Evdokia
Lam, Raymond W.
Frey, Benicio N.
Milev, Roumen
Müller, Daniel J.
Kennedy, Sidney H.
Scott, Christopher J. M.
Strother, Stephen C.
Arnott, Stephen R.
author_sort Theyers, Athena E.
collection PubMed
description With improvements to both scan quality and facial recognition software, there is an increased risk of participants being identified by a 3D render of their structural neuroimaging scans, even when all other personal information has been removed. To prevent this, facial features should be removed before data are shared or openly released, but while there are several publicly available software algorithms to do this, there has been no comprehensive review of their accuracy within the general population. To address this, we tested multiple algorithms on 300 scans from three neuroscience research projects, funded in part by the Ontario Brain Institute, to cover a wide range of ages (3–85 years) and multiple patient cohorts. While skull stripping is more thorough at removing identifiable features, we focused mainly on defacing software, as skull stripping also removes potentially useful information, which may be required for future analyses. We tested six publicly available algorithms (afni_refacer, deepdefacer, mri_deface, mridefacer, pydeface, quickshear), with one skull stripper (FreeSurfer) included for comparison. Accuracy was measured through a pass/fail system with two criteria; one, that all facial features had been removed and two, that no brain tissue was removed in the process. A subset of defaced scans were also run through several preprocessing pipelines to ensure that none of the algorithms would alter the resulting outputs. We found that the success rates varied strongly between defacers, with afni_refacer (89%) and pydeface (83%) having the highest rates, overall. In both cases, the primary source of failure came from a single dataset that the defacer appeared to struggle with - the youngest cohort (3–20 years) for afni_refacer and the oldest (44–85 years) for pydeface, demonstrating that defacer performance not only depends on the data provided, but that this effect varies between algorithms. While there were some very minor differences between the preprocessing results for defaced and original scans, none of these were significant and were within the range of variation between using different NIfTI converters, or using raw DICOM files.
format Online
Article
Text
id pubmed-7943842
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79438422021-03-11 Multisite Comparison of MRI Defacing Software Across Multiple Cohorts Theyers, Athena E. Zamyadi, Mojdeh O'Reilly, Mark Bartha, Robert Symons, Sean MacQueen, Glenda M. Hassel, Stefanie Lerch, Jason P. Anagnostou, Evdokia Lam, Raymond W. Frey, Benicio N. Milev, Roumen Müller, Daniel J. Kennedy, Sidney H. Scott, Christopher J. M. Strother, Stephen C. Arnott, Stephen R. Front Psychiatry Psychiatry With improvements to both scan quality and facial recognition software, there is an increased risk of participants being identified by a 3D render of their structural neuroimaging scans, even when all other personal information has been removed. To prevent this, facial features should be removed before data are shared or openly released, but while there are several publicly available software algorithms to do this, there has been no comprehensive review of their accuracy within the general population. To address this, we tested multiple algorithms on 300 scans from three neuroscience research projects, funded in part by the Ontario Brain Institute, to cover a wide range of ages (3–85 years) and multiple patient cohorts. While skull stripping is more thorough at removing identifiable features, we focused mainly on defacing software, as skull stripping also removes potentially useful information, which may be required for future analyses. We tested six publicly available algorithms (afni_refacer, deepdefacer, mri_deface, mridefacer, pydeface, quickshear), with one skull stripper (FreeSurfer) included for comparison. Accuracy was measured through a pass/fail system with two criteria; one, that all facial features had been removed and two, that no brain tissue was removed in the process. A subset of defaced scans were also run through several preprocessing pipelines to ensure that none of the algorithms would alter the resulting outputs. We found that the success rates varied strongly between defacers, with afni_refacer (89%) and pydeface (83%) having the highest rates, overall. In both cases, the primary source of failure came from a single dataset that the defacer appeared to struggle with - the youngest cohort (3–20 years) for afni_refacer and the oldest (44–85 years) for pydeface, demonstrating that defacer performance not only depends on the data provided, but that this effect varies between algorithms. While there were some very minor differences between the preprocessing results for defaced and original scans, none of these were significant and were within the range of variation between using different NIfTI converters, or using raw DICOM files. Frontiers Media S.A. 2021-02-24 /pmc/articles/PMC7943842/ /pubmed/33716819 http://dx.doi.org/10.3389/fpsyt.2021.617997 Text en Copyright © 2021 Theyers, Zamyadi, O'Reilly, Bartha, Symons, MacQueen, Hassel, Lerch, Anagnostou, Lam, Frey, Milev, Müller, Kennedy, Scott, Strother and Arnott. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychiatry
Theyers, Athena E.
Zamyadi, Mojdeh
O'Reilly, Mark
Bartha, Robert
Symons, Sean
MacQueen, Glenda M.
Hassel, Stefanie
Lerch, Jason P.
Anagnostou, Evdokia
Lam, Raymond W.
Frey, Benicio N.
Milev, Roumen
Müller, Daniel J.
Kennedy, Sidney H.
Scott, Christopher J. M.
Strother, Stephen C.
Arnott, Stephen R.
Multisite Comparison of MRI Defacing Software Across Multiple Cohorts
title Multisite Comparison of MRI Defacing Software Across Multiple Cohorts
title_full Multisite Comparison of MRI Defacing Software Across Multiple Cohorts
title_fullStr Multisite Comparison of MRI Defacing Software Across Multiple Cohorts
title_full_unstemmed Multisite Comparison of MRI Defacing Software Across Multiple Cohorts
title_short Multisite Comparison of MRI Defacing Software Across Multiple Cohorts
title_sort multisite comparison of mri defacing software across multiple cohorts
topic Psychiatry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7943842/
https://www.ncbi.nlm.nih.gov/pubmed/33716819
http://dx.doi.org/10.3389/fpsyt.2021.617997
work_keys_str_mv AT theyersathenae multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT zamyadimojdeh multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT oreillymark multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT bartharobert multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT symonssean multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT macqueenglendam multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT hasselstefanie multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT lerchjasonp multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT anagnostouevdokia multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT lamraymondw multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT freybenicion multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT milevroumen multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT mullerdanielj multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT kennedysidneyh multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT scottchristopherjm multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT strotherstephenc multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts
AT arnottstephenr multisitecomparisonofmridefacingsoftwareacrossmultiplecohorts