Cargando…

Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset

Machine Learning (ML) techniques have been widely used in Neuroimaging studies of Autism Spectrum Disorders (ASD) both to identify possible brain alterations related to this condition and to evaluate the predictive power of brain imaging modalities. The collection and public sharing of large imaging...

Descripción completa

Detalles Bibliográficos
Autores principales: Saponaro, Sara, Giuliano, Alessia, Bellotti, Roberto, Lombardi, Angela, Tangaro, Sabina, Oliva, Piernicola, Calderoni, Sara, Retico, Alessandra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9198380/
https://www.ncbi.nlm.nih.gov/pubmed/35700598
http://dx.doi.org/10.1016/j.nicl.2022.103082
_version_ 1784727602104107008
author Saponaro, Sara
Giuliano, Alessia
Bellotti, Roberto
Lombardi, Angela
Tangaro, Sabina
Oliva, Piernicola
Calderoni, Sara
Retico, Alessandra
author_facet Saponaro, Sara
Giuliano, Alessia
Bellotti, Roberto
Lombardi, Angela
Tangaro, Sabina
Oliva, Piernicola
Calderoni, Sara
Retico, Alessandra
author_sort Saponaro, Sara
collection PubMed
description Machine Learning (ML) techniques have been widely used in Neuroimaging studies of Autism Spectrum Disorders (ASD) both to identify possible brain alterations related to this condition and to evaluate the predictive power of brain imaging modalities. The collection and public sharing of large imaging samples has favored an even greater diffusion of the use of ML-based analyses. However, multi-center data collections may suffer the batch effect, which, especially in case of Magnetic Resonance Imaging (MRI) studies, should be curated to avoid confounding effects for ML classifiers and masking biases. This is particularly important in the study of barely separable populations according to MRI data, such as subjects with ASD compared to controls with typical development (TD). Here, we show how the implementation of a harmo- nization protocol on brain structural features unlocks the case-control ML separation capability in the analysis of a multi-center MRI dataset. This effect is demonstrated on the ABIDE data collection, involving subjects encompassing a wide age range. After data harmonization, the overall ASD vs. TD discrimination capability by a Random Forest (RF) classifier improves from a very low performance (AUC = 0.58 ± 0.04) to a still low, but reasonably significant AUC = 0.67 ± 0.03. The performances of the RF classifier have been evaluated also in the age-specific subgroups of children, adolescents and adults, obtaining AUC = 0.62 ± 0.02, AUC = 0.65 ± 0.03 and AUC = 0.69 ± 0.06, respectively. Specific and consistent patterns of anatomical differences related to the ASD condition have been identified for the three different age subgroups.
format Online
Article
Text
id pubmed-9198380
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-91983802022-06-16 Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset Saponaro, Sara Giuliano, Alessia Bellotti, Roberto Lombardi, Angela Tangaro, Sabina Oliva, Piernicola Calderoni, Sara Retico, Alessandra Neuroimage Clin Regular Article Machine Learning (ML) techniques have been widely used in Neuroimaging studies of Autism Spectrum Disorders (ASD) both to identify possible brain alterations related to this condition and to evaluate the predictive power of brain imaging modalities. The collection and public sharing of large imaging samples has favored an even greater diffusion of the use of ML-based analyses. However, multi-center data collections may suffer the batch effect, which, especially in case of Magnetic Resonance Imaging (MRI) studies, should be curated to avoid confounding effects for ML classifiers and masking biases. This is particularly important in the study of barely separable populations according to MRI data, such as subjects with ASD compared to controls with typical development (TD). Here, we show how the implementation of a harmo- nization protocol on brain structural features unlocks the case-control ML separation capability in the analysis of a multi-center MRI dataset. This effect is demonstrated on the ABIDE data collection, involving subjects encompassing a wide age range. After data harmonization, the overall ASD vs. TD discrimination capability by a Random Forest (RF) classifier improves from a very low performance (AUC = 0.58 ± 0.04) to a still low, but reasonably significant AUC = 0.67 ± 0.03. The performances of the RF classifier have been evaluated also in the age-specific subgroups of children, adolescents and adults, obtaining AUC = 0.62 ± 0.02, AUC = 0.65 ± 0.03 and AUC = 0.69 ± 0.06, respectively. Specific and consistent patterns of anatomical differences related to the ASD condition have been identified for the three different age subgroups. Elsevier 2022-06-08 /pmc/articles/PMC9198380/ /pubmed/35700598 http://dx.doi.org/10.1016/j.nicl.2022.103082 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Regular Article
Saponaro, Sara
Giuliano, Alessia
Bellotti, Roberto
Lombardi, Angela
Tangaro, Sabina
Oliva, Piernicola
Calderoni, Sara
Retico, Alessandra
Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset
title Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset
title_full Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset
title_fullStr Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset
title_full_unstemmed Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset
title_short Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset
title_sort multi-site harmonization of mri data uncovers machine-learning discrimination capability in barely separable populations: an example from the abide dataset
topic Regular Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9198380/
https://www.ncbi.nlm.nih.gov/pubmed/35700598
http://dx.doi.org/10.1016/j.nicl.2022.103082
work_keys_str_mv AT saponarosara multisiteharmonizationofmridatauncoversmachinelearningdiscriminationcapabilityinbarelyseparablepopulationsanexamplefromtheabidedataset
AT giulianoalessia multisiteharmonizationofmridatauncoversmachinelearningdiscriminationcapabilityinbarelyseparablepopulationsanexamplefromtheabidedataset
AT bellottiroberto multisiteharmonizationofmridatauncoversmachinelearningdiscriminationcapabilityinbarelyseparablepopulationsanexamplefromtheabidedataset
AT lombardiangela multisiteharmonizationofmridatauncoversmachinelearningdiscriminationcapabilityinbarelyseparablepopulationsanexamplefromtheabidedataset
AT tangarosabina multisiteharmonizationofmridatauncoversmachinelearningdiscriminationcapabilityinbarelyseparablepopulationsanexamplefromtheabidedataset
AT olivapiernicola multisiteharmonizationofmridatauncoversmachinelearningdiscriminationcapabilityinbarelyseparablepopulationsanexamplefromtheabidedataset
AT calderonisara multisiteharmonizationofmridatauncoversmachinelearningdiscriminationcapabilityinbarelyseparablepopulationsanexamplefromtheabidedataset
AT reticoalessandra multisiteharmonizationofmridatauncoversmachinelearningdiscriminationcapabilityinbarelyseparablepopulationsanexamplefromtheabidedataset