Cargando…
Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data
Multicenter and multi-scanner imaging studies might be needed to provide sample sizes large enough for developing accurate predictive models. However, multicenter studies, which likely include confounding factors due to subtle differences in research participant characteristics, MRI scanners, and im...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327280/ https://www.ncbi.nlm.nih.gov/pubmed/37425905 http://dx.doi.org/10.1101/2023.06.26.23291909 |
_version_ | 1785069590496149504 |
---|---|
author | Yoon, Hyunsoo Schwedt, Todd J. Chong, Catherine D. Olatunde, Oyekanmi Wu, Teresa |
author_facet | Yoon, Hyunsoo Schwedt, Todd J. Chong, Catherine D. Olatunde, Oyekanmi Wu, Teresa |
author_sort | Yoon, Hyunsoo |
collection | PubMed |
description | Multicenter and multi-scanner imaging studies might be needed to provide sample sizes large enough for developing accurate predictive models. However, multicenter studies, which likely include confounding factors due to subtle differences in research participant characteristics, MRI scanners, and imaging acquisition protocols, might not yield generalizable machine learning models, that is, models developed using one dataset may not be applicable to a different dataset. The generalizability of classification models is key for multi-scanner and multicenter studies, and for providing reproducible results. This study developed a data harmonization strategy to identify healthy controls with similar (homogenous) characteristics from multicenter studies to validate the generalization of machine-learning techniques for classifying individual migraine patients and healthy controls using brain MRI data. The Maximum Mean Discrepancy (MMD) was used to compare the two datasets represented in Geodesic Flow Kernel (GFK) space, capturing the data variabilities for identifying a “healthy core”. A set of homogeneous healthy controls can assist in overcoming some of the unwanted heterogeneity and allow for the development of classification models that have high accuracy when applied to new datasets. Extensive experimental results show the utilization of a healthy core. One dataset consists of 120 individuals (66 with migraine and 54 healthy controls) and another dataset consists of 76 (34 with migraine and 42 healthy controls) individuals. A homogeneous dataset derived from a cohort of healthy controls improves the performance of classification models by about 25% accuracy improvements for both episodic and chronic migraineurs. |
format | Online Article Text |
id | pubmed-10327280 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-103272802023-07-08 Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data Yoon, Hyunsoo Schwedt, Todd J. Chong, Catherine D. Olatunde, Oyekanmi Wu, Teresa medRxiv Article Multicenter and multi-scanner imaging studies might be needed to provide sample sizes large enough for developing accurate predictive models. However, multicenter studies, which likely include confounding factors due to subtle differences in research participant characteristics, MRI scanners, and imaging acquisition protocols, might not yield generalizable machine learning models, that is, models developed using one dataset may not be applicable to a different dataset. The generalizability of classification models is key for multi-scanner and multicenter studies, and for providing reproducible results. This study developed a data harmonization strategy to identify healthy controls with similar (homogenous) characteristics from multicenter studies to validate the generalization of machine-learning techniques for classifying individual migraine patients and healthy controls using brain MRI data. The Maximum Mean Discrepancy (MMD) was used to compare the two datasets represented in Geodesic Flow Kernel (GFK) space, capturing the data variabilities for identifying a “healthy core”. A set of homogeneous healthy controls can assist in overcoming some of the unwanted heterogeneity and allow for the development of classification models that have high accuracy when applied to new datasets. Extensive experimental results show the utilization of a healthy core. One dataset consists of 120 individuals (66 with migraine and 54 healthy controls) and another dataset consists of 76 (34 with migraine and 42 healthy controls) individuals. A homogeneous dataset derived from a cohort of healthy controls improves the performance of classification models by about 25% accuracy improvements for both episodic and chronic migraineurs. Cold Spring Harbor Laboratory 2023-06-28 /pmc/articles/PMC10327280/ /pubmed/37425905 http://dx.doi.org/10.1101/2023.06.26.23291909 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Yoon, Hyunsoo Schwedt, Todd J. Chong, Catherine D. Olatunde, Oyekanmi Wu, Teresa Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data |
title | Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data |
title_full | Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data |
title_fullStr | Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data |
title_full_unstemmed | Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data |
title_short | Harmonizing Healthy Cohorts to Support Multicenter Studies on Migraine Classification using Brain MRI Data |
title_sort | harmonizing healthy cohorts to support multicenter studies on migraine classification using brain mri data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327280/ https://www.ncbi.nlm.nih.gov/pubmed/37425905 http://dx.doi.org/10.1101/2023.06.26.23291909 |
work_keys_str_mv | AT yoonhyunsoo harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata AT schwedttoddj harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata AT chongcatherined harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata AT olatundeoyekanmi harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata AT wuteresa harmonizinghealthycohortstosupportmulticenterstudiesonmigraineclassificationusingbrainmridata |