Cargando…

Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets

Large, multi-site, heterogeneous brain imaging datasets are increasingly required for the training, validation, and testing of advanced deep learning (DL)-based automated tools, including structural magnetic resonance (MR) image-based diagnostic and treatment monitoring approaches. When assembling a...

Descripción completa

Detalles Bibliográficos
Autores principales: Bento, Mariana, Fantini, Irene, Park, Justin, Rittner, Leticia, Frayne, Richard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8811356/
https://www.ncbi.nlm.nih.gov/pubmed/35126080
http://dx.doi.org/10.3389/fninf.2021.805669
_version_ 1784644416385843200
author Bento, Mariana
Fantini, Irene
Park, Justin
Rittner, Leticia
Frayne, Richard
author_facet Bento, Mariana
Fantini, Irene
Park, Justin
Rittner, Leticia
Frayne, Richard
author_sort Bento, Mariana
collection PubMed
description Large, multi-site, heterogeneous brain imaging datasets are increasingly required for the training, validation, and testing of advanced deep learning (DL)-based automated tools, including structural magnetic resonance (MR) image-based diagnostic and treatment monitoring approaches. When assembling a number of smaller datasets to form a larger dataset, understanding the underlying variability between different acquisition and processing protocols across the aggregated dataset (termed “batch effects”) is critical. The presence of variation in the training dataset is important as it more closely reflects the true underlying data distribution and, thus, may enhance the overall generalizability of the tool. However, the impact of batch effects must be carefully evaluated in order to avoid undesirable effects that, for example, may reduce performance measures. Batch effects can result from many sources, including differences in acquisition equipment, imaging technique and parameters, as well as applied processing methodologies. Their impact, both beneficial and adversarial, must be considered when developing tools to ensure that their outputs are related to the proposed clinical or research question (i.e., actual disease-related or pathological changes) and are not simply due to the peculiarities of underlying batch effects in the aggregated dataset. We reviewed applications of DL in structural brain MR imaging that aggregated images from neuroimaging datasets, typically acquired at multiple sites. We examined datasets containing both healthy control participants and patients that were acquired using varying acquisition protocols. First, we discussed issues around Data Access and enumerated the key characteristics of some commonly used publicly available brain datasets. Then we reviewed methods for correcting batch effects by exploring the two main classes of approaches: Data Harmonization that uses data standardization, quality control protocols or other similar algorithms and procedures to explicitly understand and minimize unwanted batch effects; and Domain Adaptation that develops DL tools that implicitly handle the batch effects by using approaches to achieve reliable and robust results. In this narrative review, we highlighted the advantages and disadvantages of both classes of DL approaches, and described key challenges to be addressed in future studies.
format Online
Article
Text
id pubmed-8811356
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88113562022-02-04 Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets Bento, Mariana Fantini, Irene Park, Justin Rittner, Leticia Frayne, Richard Front Neuroinform Neuroscience Large, multi-site, heterogeneous brain imaging datasets are increasingly required for the training, validation, and testing of advanced deep learning (DL)-based automated tools, including structural magnetic resonance (MR) image-based diagnostic and treatment monitoring approaches. When assembling a number of smaller datasets to form a larger dataset, understanding the underlying variability between different acquisition and processing protocols across the aggregated dataset (termed “batch effects”) is critical. The presence of variation in the training dataset is important as it more closely reflects the true underlying data distribution and, thus, may enhance the overall generalizability of the tool. However, the impact of batch effects must be carefully evaluated in order to avoid undesirable effects that, for example, may reduce performance measures. Batch effects can result from many sources, including differences in acquisition equipment, imaging technique and parameters, as well as applied processing methodologies. Their impact, both beneficial and adversarial, must be considered when developing tools to ensure that their outputs are related to the proposed clinical or research question (i.e., actual disease-related or pathological changes) and are not simply due to the peculiarities of underlying batch effects in the aggregated dataset. We reviewed applications of DL in structural brain MR imaging that aggregated images from neuroimaging datasets, typically acquired at multiple sites. We examined datasets containing both healthy control participants and patients that were acquired using varying acquisition protocols. First, we discussed issues around Data Access and enumerated the key characteristics of some commonly used publicly available brain datasets. Then we reviewed methods for correcting batch effects by exploring the two main classes of approaches: Data Harmonization that uses data standardization, quality control protocols or other similar algorithms and procedures to explicitly understand and minimize unwanted batch effects; and Domain Adaptation that develops DL tools that implicitly handle the batch effects by using approaches to achieve reliable and robust results. In this narrative review, we highlighted the advantages and disadvantages of both classes of DL approaches, and described key challenges to be addressed in future studies. Frontiers Media S.A. 2022-01-20 /pmc/articles/PMC8811356/ /pubmed/35126080 http://dx.doi.org/10.3389/fninf.2021.805669 Text en Copyright © 2022 Bento, Fantini, Park, Rittner and Frayne. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Bento, Mariana
Fantini, Irene
Park, Justin
Rittner, Leticia
Frayne, Richard
Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets
title Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets
title_full Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets
title_fullStr Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets
title_full_unstemmed Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets
title_short Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets
title_sort deep learning in large and multi-site structural brain mr imaging datasets
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8811356/
https://www.ncbi.nlm.nih.gov/pubmed/35126080
http://dx.doi.org/10.3389/fninf.2021.805669
work_keys_str_mv AT bentomariana deeplearninginlargeandmultisitestructuralbrainmrimagingdatasets
AT fantiniirene deeplearninginlargeandmultisitestructuralbrainmrimagingdatasets
AT parkjustin deeplearninginlargeandmultisitestructuralbrainmrimagingdatasets
AT rittnerleticia deeplearninginlargeandmultisitestructuralbrainmrimagingdatasets
AT fraynerichard deeplearninginlargeandmultisitestructuralbrainmrimagingdatasets