Cargando…

Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data

Multicenter and multi-scanner imaging studies might be needed to provide sample sizes large enough for developing accurate predictive models. However, multicenter studies, which likely include confounding factors due to subtle differences in research participant characteristics, MRI scanners, and im...

Descripción completa

Detalles Bibliográficos
Autores principales: Yoon, Hyunsoo, Schwedt, Todd J., Chong, Catherine D., Olatunde, Oyekanmi, Wu, Teresa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327280/
https://www.ncbi.nlm.nih.gov/pubmed/37425905
http://dx.doi.org/10.1101/2023.06.26.23291909
_version_ 1785069590496149504
author Yoon, Hyunsoo
Schwedt, Todd J.
Chong, Catherine D.
Olatunde, Oyekanmi
Wu, Teresa
author_facet Yoon, Hyunsoo
Schwedt, Todd J.
Chong, Catherine D.
Olatunde, Oyekanmi
Wu, Teresa
author_sort Yoon, Hyunsoo
collection PubMed
description Multicenter and multi-scanner imaging studies might be needed to provide sample sizes large enough for developing accurate predictive models. However, multicenter studies, which likely include confounding factors due to subtle differences in research participant characteristics, MRI scanners, and imaging acquisition protocols, might not yield generalizable machine learning models, that is, models developed using one dataset may not be applicable to a different dataset. The generalizability of classification models is key for multi-scanner and multicenter studies, and for providing reproducible results. This study developed a data harmonization strategy to identify healthy controls with similar (homogenous) characteristics from multicenter studies to validate the generalization of machine-learning techniques for classifying individual migraine patients and healthy controls using brain MRI data. The Maximum Mean Discrepancy (MMD) was used to compare the two datasets represented in Geodesic Flow Kernel (GFK) space, capturing the data variabilities for identifying a “healthy core”. A set of homogeneous healthy controls can assist in overcoming some of the unwanted heterogeneity and allow for the development of classification models that have high accuracy when applied to new datasets. Extensive experimental results show the utilization of a healthy core. One dataset consists of 120 individuals (66 with migraine and 54 healthy controls) and another dataset consists of 76 (34 with migraine and 42 healthy controls) individuals. A homogeneous dataset derived from a cohort of healthy controls improves the performance of classification models by about 25% accuracy improvements for both episodic and chronic migraineurs.
format Online
Article
Text
id pubmed-10327280
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-103272802023-07-08 Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data Yoon, Hyunsoo Schwedt, Todd J. Chong, Catherine D. Olatunde, Oyekanmi Wu, Teresa medRxiv Article Multicenter and multi-scanner imaging studies might be needed to provide sample sizes large enough for developing accurate predictive models. However, multicenter studies, which likely include confounding factors due to subtle differences in research participant characteristics, MRI scanners, and imaging acquisition protocols, might not yield generalizable machine learning models, that is, models developed using one dataset may not be applicable to a different dataset. The generalizability of classification models is key for multi-scanner and multicenter studies, and for providing reproducible results. This study developed a data harmonization strategy to identify healthy controls with similar (homogenous) characteristics from multicenter studies to validate the generalization of machine-learning techniques for classifying individual migraine patients and healthy controls using brain MRI data. The Maximum Mean Discrepancy (MMD) was used to compare the two datasets represented in Geodesic Flow Kernel (GFK) space, capturing the data variabilities for identifying a “healthy core”. A set of homogeneous healthy controls can assist in overcoming some of the unwanted heterogeneity and allow for the development of classification models that have high accuracy when applied to new datasets. Extensive experimental results show the utilization of a healthy core. One dataset consists of 120 individuals (66 with migraine and 54 healthy controls) and another dataset consists of 76 (34 with migraine and 42 healthy controls) individuals. A homogeneous dataset derived from a cohort of healthy controls improves the performance of classification models by about 25% accuracy improvements for both episodic and chronic migraineurs. Cold Spring Harbor Laboratory 2023-06-28 /pmc/articles/PMC10327280/ /pubmed/37425905 http://dx.doi.org/10.1101/2023.06.26.23291909 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Yoon, Hyunsoo
Schwedt, Todd J.
Chong, Catherine D.
Olatunde, Oyekanmi
Wu, Teresa
Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data
title Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data
title_full Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data
title_fullStr Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data
title_full_unstemmed Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data
title_short Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data
title_sort harmonizing healthy cohorts to support multicenter studies on migraine classification using brain mri data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327280/
https://www.ncbi.nlm.nih.gov/pubmed/37425905
http://dx.doi.org/10.1101/2023.06.26.23291909
work_keys_str_mv AT yoonhyunsoo harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata
AT schwedttoddj harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata
AT chongcatherined harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata
AT olatundeoyekanmi harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata
AT wuteresa harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata