Cargando…

Sample size requirement for achieving multisite harmonization using structural brain MRI features

When data is pooled across multiple sites, the extracted features are confounded by site effects. Harmonization methods attempt to correct these site effects while preserving the biological variability within the features. However, little is known about the sample size requirement for effectively le...

Descripción completa

Detalles Bibliográficos
Autores principales: Parekh, Pravesh, Bhalerao, Gaurav Vivek, John, John P., Venkatasubramanian, G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7615107/
https://www.ncbi.nlm.nih.gov/pubmed/36435343
http://dx.doi.org/10.1016/j.neuroimage.2022.119768
_version_ 1783605698068742144
author Parekh, Pravesh
Bhalerao, Gaurav Vivek
John, John P.
Venkatasubramanian, G.
author_facet Parekh, Pravesh
Bhalerao, Gaurav Vivek
John, John P.
Venkatasubramanian, G.
author_sort Parekh, Pravesh
collection PubMed
description When data is pooled across multiple sites, the extracted features are confounded by site effects. Harmonization methods attempt to correct these site effects while preserving the biological variability within the features. However, little is known about the sample size requirement for effectively learning the harmonization parameters and their relationship with the increasing number of sites. In this study, we performed experiments to find the minimum sample size required to achieve multisite harmonization (using neuroHarmonize) using volumetric and surface features by leveraging the concept of learning curves. Our first two experiments show that site-effects are effectively removed in a univariate and multivariate manner; however, it is essential to regress the effect of covariates from the harmonized data additionally. Our following two experiments with actual and simulated data showed that the minimum sample size required for achieving harmonization grows with the increasing average Mahalanobis distances between the sites and their reference distribution. We conclude by positing a general framework to understand the site effects using the Mahalanobis distance. Further, we provide insights on the various factors in a cross-validation design to achieve optimal inter-site harmonization.
format Online
Article
Text
id pubmed-7615107
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-76151072023-09-19 Sample size requirement for achieving multisite harmonization using structural brain MRI features Parekh, Pravesh Bhalerao, Gaurav Vivek John, John P. Venkatasubramanian, G. Neuroimage Article When data is pooled across multiple sites, the extracted features are confounded by site effects. Harmonization methods attempt to correct these site effects while preserving the biological variability within the features. However, little is known about the sample size requirement for effectively learning the harmonization parameters and their relationship with the increasing number of sites. In this study, we performed experiments to find the minimum sample size required to achieve multisite harmonization (using neuroHarmonize) using volumetric and surface features by leveraging the concept of learning curves. Our first two experiments show that site-effects are effectively removed in a univariate and multivariate manner; however, it is essential to regress the effect of covariates from the harmonized data additionally. Our following two experiments with actual and simulated data showed that the minimum sample size required for achieving harmonization grows with the increasing average Mahalanobis distances between the sites and their reference distribution. We conclude by positing a general framework to understand the site effects using the Mahalanobis distance. Further, we provide insights on the various factors in a cross-validation design to achieve optimal inter-site harmonization. 2022-12-01 2022-11-24 /pmc/articles/PMC7615107/ /pubmed/36435343 http://dx.doi.org/10.1016/j.neuroimage.2022.119768 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/)
spellingShingle Article
Parekh, Pravesh
Bhalerao, Gaurav Vivek
John, John P.
Venkatasubramanian, G.
Sample size requirement for achieving multisite harmonization using structural brain MRI features
title Sample size requirement for achieving multisite harmonization using structural brain MRI features
title_full Sample size requirement for achieving multisite harmonization using structural brain MRI features
title_fullStr Sample size requirement for achieving multisite harmonization using structural brain MRI features
title_full_unstemmed Sample size requirement for achieving multisite harmonization using structural brain MRI features
title_short Sample size requirement for achieving multisite harmonization using structural brain MRI features
title_sort sample size requirement for achieving multisite harmonization using structural brain mri features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7615107/
https://www.ncbi.nlm.nih.gov/pubmed/36435343
http://dx.doi.org/10.1016/j.neuroimage.2022.119768
work_keys_str_mv AT parekhpravesh samplesizerequirementforachievingmultisiteharmonizationusingstructuralbrainmrifeatures
AT bhaleraogauravvivek samplesizerequirementforachievingmultisiteharmonizationusingstructuralbrainmrifeatures
AT samplesizerequirementforachievingmultisiteharmonizationusingstructuralbrainmrifeatures
AT johnjohnp samplesizerequirementforachievingmultisiteharmonizationusingstructuralbrainmrifeatures
AT venkatasubramaniang samplesizerequirementforachievingmultisiteharmonizationusingstructuralbrainmrifeatures