Cargando…

Mitigating site effects in covariance for machine learning in neuroimaging data

To acquire larger samples for answering complex questions in neuroscience, researchers have increasingly turned to multi‐site neuroimaging studies. However, these studies are hindered by differences in images acquired across multiple sites. These effects have been shown to bias comparison between si...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Andrew A., Beer, Joanne C., Tustison, Nicholas J., Cook, Philip A., Shinohara, Russell T., Shou, Haochang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837590/
https://www.ncbi.nlm.nih.gov/pubmed/34904312
http://dx.doi.org/10.1002/hbm.25688
_version_ 1784649946936377344
author Chen, Andrew A.
Beer, Joanne C.
Tustison, Nicholas J.
Cook, Philip A.
Shinohara, Russell T.
Shou, Haochang
author_facet Chen, Andrew A.
Beer, Joanne C.
Tustison, Nicholas J.
Cook, Philip A.
Shinohara, Russell T.
Shou, Haochang
author_sort Chen, Andrew A.
collection PubMed
description To acquire larger samples for answering complex questions in neuroscience, researchers have increasingly turned to multi‐site neuroimaging studies. However, these studies are hindered by differences in images acquired across multiple sites. These effects have been shown to bias comparison between sites, mask biologically meaningful associations, and even introduce spurious associations. To address this, the field has focused on harmonizing data by removing site‐related effects in the mean and variance of measurements. Contemporaneously with the increase in popularity of multi‐center imaging, the use of machine learning (ML) in neuroimaging has also become commonplace. These approaches have been shown to provide improved sensitivity, specificity, and power due to their modeling the joint relationship across measurements in the brain. In this work, we demonstrate that methods for removing site effects in mean and variance may not be sufficient for ML. This stems from the fact that such methods fail to address how correlations between measurements can vary across sites. Data from the Alzheimer's Disease Neuroimaging Initiative is used to show that considerable differences in covariance exist across sites and that popular harmonization techniques do not address this issue. We then propose a novel harmonization method called Correcting Covariance Batch Effects (CovBat) that removes site effects in mean, variance, and covariance. We apply CovBat and show that within‐site correlation matrices are successfully harmonized. Furthermore, we find that ML methods are unable to distinguish scanner manufacturer after our proposed harmonization is applied, and that the CovBat‐harmonized data retain accurate prediction of disease group.
format Online
Article
Text
id pubmed-8837590
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-88375902022-02-14 Mitigating site effects in covariance for machine learning in neuroimaging data Chen, Andrew A. Beer, Joanne C. Tustison, Nicholas J. Cook, Philip A. Shinohara, Russell T. Shou, Haochang Hum Brain Mapp Research Articles To acquire larger samples for answering complex questions in neuroscience, researchers have increasingly turned to multi‐site neuroimaging studies. However, these studies are hindered by differences in images acquired across multiple sites. These effects have been shown to bias comparison between sites, mask biologically meaningful associations, and even introduce spurious associations. To address this, the field has focused on harmonizing data by removing site‐related effects in the mean and variance of measurements. Contemporaneously with the increase in popularity of multi‐center imaging, the use of machine learning (ML) in neuroimaging has also become commonplace. These approaches have been shown to provide improved sensitivity, specificity, and power due to their modeling the joint relationship across measurements in the brain. In this work, we demonstrate that methods for removing site effects in mean and variance may not be sufficient for ML. This stems from the fact that such methods fail to address how correlations between measurements can vary across sites. Data from the Alzheimer's Disease Neuroimaging Initiative is used to show that considerable differences in covariance exist across sites and that popular harmonization techniques do not address this issue. We then propose a novel harmonization method called Correcting Covariance Batch Effects (CovBat) that removes site effects in mean, variance, and covariance. We apply CovBat and show that within‐site correlation matrices are successfully harmonized. Furthermore, we find that ML methods are unable to distinguish scanner manufacturer after our proposed harmonization is applied, and that the CovBat‐harmonized data retain accurate prediction of disease group. John Wiley & Sons, Inc. 2021-12-14 /pmc/articles/PMC8837590/ /pubmed/34904312 http://dx.doi.org/10.1002/hbm.25688 Text en © 2021 The Authors. Human Brain Mapping published by Wiley Periodicals LLC. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Chen, Andrew A.
Beer, Joanne C.
Tustison, Nicholas J.
Cook, Philip A.
Shinohara, Russell T.
Shou, Haochang
Mitigating site effects in covariance for machine learning in neuroimaging data
title Mitigating site effects in covariance for machine learning in neuroimaging data
title_full Mitigating site effects in covariance for machine learning in neuroimaging data
title_fullStr Mitigating site effects in covariance for machine learning in neuroimaging data
title_full_unstemmed Mitigating site effects in covariance for machine learning in neuroimaging data
title_short Mitigating site effects in covariance for machine learning in neuroimaging data
title_sort mitigating site effects in covariance for machine learning in neuroimaging data
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837590/
https://www.ncbi.nlm.nih.gov/pubmed/34904312
http://dx.doi.org/10.1002/hbm.25688
work_keys_str_mv AT chenandrewa mitigatingsiteeffectsincovarianceformachinelearninginneuroimagingdata
AT beerjoannec mitigatingsiteeffectsincovarianceformachinelearninginneuroimagingdata
AT tustisonnicholasj mitigatingsiteeffectsincovarianceformachinelearninginneuroimagingdata
AT cookphilipa mitigatingsiteeffectsincovarianceformachinelearninginneuroimagingdata
AT shinohararussellt mitigatingsiteeffectsincovarianceformachinelearninginneuroimagingdata
AT shouhaochang mitigatingsiteeffectsincovarianceformachinelearninginneuroimagingdata
AT mitigatingsiteeffectsincovarianceformachinelearninginneuroimagingdata