Cargando…

Two-step clustering-based pipeline for big dynamic functional network connectivity data

BACKGROUND: Dynamic functional network connectivity (dFNC) estimated from resting-state functional magnetic imaging (rs-fMRI) studies the temporally varying functional integration between brain networks. In a conventional dFNC pipeline, a clustering stage to summarize the connectivity patterns that...

Descripción completa

Detalles Bibliográficos
Autores principales: Sendi, Mohammad S. E., Salat, David H., Miller, Robyn L., Calhoun, Vince D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9358255/
https://www.ncbi.nlm.nih.gov/pubmed/35958983
http://dx.doi.org/10.3389/fnins.2022.895637
_version_ 1784763889926275072
author Sendi, Mohammad S. E.
Salat, David H.
Miller, Robyn L.
Calhoun, Vince D.
author_facet Sendi, Mohammad S. E.
Salat, David H.
Miller, Robyn L.
Calhoun, Vince D.
author_sort Sendi, Mohammad S. E.
collection PubMed
description BACKGROUND: Dynamic functional network connectivity (dFNC) estimated from resting-state functional magnetic imaging (rs-fMRI) studies the temporally varying functional integration between brain networks. In a conventional dFNC pipeline, a clustering stage to summarize the connectivity patterns that are transiently but reliably realized over the course of a scanning session. However, identifying the right number of clusters (or states) through a conventional clustering criterion computed by running the algorithm repeatedly over a large range of cluster numbers is time-consuming and requires substantial computational power even for typical dFNC datasets, and the computational demands become prohibitive as datasets become larger and scans longer. Here we developed a new dFNC pipeline based on a two-step clustering approach to analyze large dFNC data without having access to huge computational power. METHODS: In the proposed dFNC pipeline, we implement two-step clustering. In the first step, we randomly use a sub-sample dFNC data and identify several sets of states at different model orders. In the second step, we aggregate all dFNC states estimated from all iterations in the first step and use this to identify the optimum number of clusters using the elbow criteria. Additionally, we use this new reduced dataset and estimate a final set of states by performing a second kmeans clustering on the aggregated dFNC states from the first k-means clustering. To validate the reproducibility of results in the new pipeline, we analyzed four dFNC datasets from the human connectome project (HCP). RESULTS: We found that both conventional and proposed dFNC pipelines generate similar brain dFNC states across all four sessions with more than 99% similarity. We found that the conventional dFNC pipeline evaluates the clustering order and finds the final dFNC state in 275 min, while this process takes only 11 min for the proposed dFNC pipeline. In other words, the new pipeline is 25 times faster than the traditional method in finding the optimum number of clusters and finding the final dFNC states. We also found that the new method results in better clustering quality than the conventional approach (p < 0.001). We show that the results are replicated across four different datasets from HCP. CONCLUSION: We developed a new analytic pipeline that facilitates the analysis of large dFNC datasets without having access to a huge computational power source. We validated the reproducibility of the result across multiple datasets.
format Online
Article
Text
id pubmed-9358255
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93582552022-08-10 Two-step clustering-based pipeline for big dynamic functional network connectivity data Sendi, Mohammad S. E. Salat, David H. Miller, Robyn L. Calhoun, Vince D. Front Neurosci Neuroscience BACKGROUND: Dynamic functional network connectivity (dFNC) estimated from resting-state functional magnetic imaging (rs-fMRI) studies the temporally varying functional integration between brain networks. In a conventional dFNC pipeline, a clustering stage to summarize the connectivity patterns that are transiently but reliably realized over the course of a scanning session. However, identifying the right number of clusters (or states) through a conventional clustering criterion computed by running the algorithm repeatedly over a large range of cluster numbers is time-consuming and requires substantial computational power even for typical dFNC datasets, and the computational demands become prohibitive as datasets become larger and scans longer. Here we developed a new dFNC pipeline based on a two-step clustering approach to analyze large dFNC data without having access to huge computational power. METHODS: In the proposed dFNC pipeline, we implement two-step clustering. In the first step, we randomly use a sub-sample dFNC data and identify several sets of states at different model orders. In the second step, we aggregate all dFNC states estimated from all iterations in the first step and use this to identify the optimum number of clusters using the elbow criteria. Additionally, we use this new reduced dataset and estimate a final set of states by performing a second kmeans clustering on the aggregated dFNC states from the first k-means clustering. To validate the reproducibility of results in the new pipeline, we analyzed four dFNC datasets from the human connectome project (HCP). RESULTS: We found that both conventional and proposed dFNC pipelines generate similar brain dFNC states across all four sessions with more than 99% similarity. We found that the conventional dFNC pipeline evaluates the clustering order and finds the final dFNC state in 275 min, while this process takes only 11 min for the proposed dFNC pipeline. In other words, the new pipeline is 25 times faster than the traditional method in finding the optimum number of clusters and finding the final dFNC states. We also found that the new method results in better clustering quality than the conventional approach (p < 0.001). We show that the results are replicated across four different datasets from HCP. CONCLUSION: We developed a new analytic pipeline that facilitates the analysis of large dFNC datasets without having access to a huge computational power source. We validated the reproducibility of the result across multiple datasets. Frontiers Media S.A. 2022-07-25 /pmc/articles/PMC9358255/ /pubmed/35958983 http://dx.doi.org/10.3389/fnins.2022.895637 Text en Copyright © 2022 Sendi, Salat, Miller and Calhoun. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Sendi, Mohammad S. E.
Salat, David H.
Miller, Robyn L.
Calhoun, Vince D.
Two-step clustering-based pipeline for big dynamic functional network connectivity data
title Two-step clustering-based pipeline for big dynamic functional network connectivity data
title_full Two-step clustering-based pipeline for big dynamic functional network connectivity data
title_fullStr Two-step clustering-based pipeline for big dynamic functional network connectivity data
title_full_unstemmed Two-step clustering-based pipeline for big dynamic functional network connectivity data
title_short Two-step clustering-based pipeline for big dynamic functional network connectivity data
title_sort two-step clustering-based pipeline for big dynamic functional network connectivity data
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9358255/
https://www.ncbi.nlm.nih.gov/pubmed/35958983
http://dx.doi.org/10.3389/fnins.2022.895637
work_keys_str_mv AT sendimohammadse twostepclusteringbasedpipelineforbigdynamicfunctionalnetworkconnectivitydata
AT salatdavidh twostepclusteringbasedpipelineforbigdynamicfunctionalnetworkconnectivitydata
AT millerrobynl twostepclusteringbasedpipelineforbigdynamicfunctionalnetworkconnectivitydata
AT calhounvinced twostepclusteringbasedpipelineforbigdynamicfunctionalnetworkconnectivitydata