Cargando…

Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism

Fiber clustering methods are typically used in brain research to study the organization of white matter bundles from large diffusion MRI tractography datasets. These methods enable exploratory bundle inspection using visualization and other methods that require identifying brain white matter structu...

Descripción completa

Detalles Bibliográficos
Autores principales: Goicovich, Isaac, Olivares, Paulo, Román, Claudio, Vázquez, Andrea, Poupon, Cyril, Mangin, Jean-François, Guevara, Pamela, Hernández, Cecilia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8445177/
https://www.ncbi.nlm.nih.gov/pubmed/34539370
http://dx.doi.org/10.3389/fninf.2021.727859
_version_ 1784568599285858304
author Goicovich, Isaac
Olivares, Paulo
Román, Claudio
Vázquez, Andrea
Poupon, Cyril
Mangin, Jean-François
Guevara, Pamela
Hernández, Cecilia
author_facet Goicovich, Isaac
Olivares, Paulo
Román, Claudio
Vázquez, Andrea
Poupon, Cyril
Mangin, Jean-François
Guevara, Pamela
Hernández, Cecilia
author_sort Goicovich, Isaac
collection PubMed
description Fiber clustering methods are typically used in brain research to study the organization of white matter bundles from large diffusion MRI tractography datasets. These methods enable exploratory bundle inspection using visualization and other methods that require identifying brain white matter structures in individuals or a population. Some applications, such as real-time visualization and inter-subject clustering, need fast and high-quality intra-subject clustering algorithms. This work proposes a parallel algorithm using a General Purpose Graphics Processing Unit (GPGPU) for fiber clustering based on the FFClust algorithm. The proposed GPGPU implementation exploits data parallelism using both multicore and GPU fine-grained parallelism present in commodity architectures, including current laptops and desktop computers. Our approach implements all FFClust steps in parallel, improving execution times in all of them. In addition, our parallel approach includes a parallel Kmeans++ algorithm implementation and defines a new variant of Kmeans++ to reduce the impact of choosing outliers as initial centroids. The results show that our approach provides clustering quality results very similar to FFClust, and it requires an execution time of 3.5 s for processing about a million fibers, achieving a speedup of 11.5 times compared to FFClust.
format Online
Article
Text
id pubmed-8445177
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84451772021-09-17 Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism Goicovich, Isaac Olivares, Paulo Román, Claudio Vázquez, Andrea Poupon, Cyril Mangin, Jean-François Guevara, Pamela Hernández, Cecilia Front Neuroinform Neuroscience Fiber clustering methods are typically used in brain research to study the organization of white matter bundles from large diffusion MRI tractography datasets. These methods enable exploratory bundle inspection using visualization and other methods that require identifying brain white matter structures in individuals or a population. Some applications, such as real-time visualization and inter-subject clustering, need fast and high-quality intra-subject clustering algorithms. This work proposes a parallel algorithm using a General Purpose Graphics Processing Unit (GPGPU) for fiber clustering based on the FFClust algorithm. The proposed GPGPU implementation exploits data parallelism using both multicore and GPU fine-grained parallelism present in commodity architectures, including current laptops and desktop computers. Our approach implements all FFClust steps in parallel, improving execution times in all of them. In addition, our parallel approach includes a parallel Kmeans++ algorithm implementation and defines a new variant of Kmeans++ to reduce the impact of choosing outliers as initial centroids. The results show that our approach provides clustering quality results very similar to FFClust, and it requires an execution time of 3.5 s for processing about a million fibers, achieving a speedup of 11.5 times compared to FFClust. Frontiers Media S.A. 2021-09-01 /pmc/articles/PMC8445177/ /pubmed/34539370 http://dx.doi.org/10.3389/fninf.2021.727859 Text en Copyright © 2021 Goicovich, Olivares, Román, Vázquez, Poupon, Mangin, Guevara and Hernández. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Goicovich, Isaac
Olivares, Paulo
Román, Claudio
Vázquez, Andrea
Poupon, Cyril
Mangin, Jean-François
Guevara, Pamela
Hernández, Cecilia
Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism
title Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism
title_full Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism
title_fullStr Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism
title_full_unstemmed Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism
title_short Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism
title_sort fiber clustering acceleration with a modified kmeans++ algorithm using data parallelism
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8445177/
https://www.ncbi.nlm.nih.gov/pubmed/34539370
http://dx.doi.org/10.3389/fninf.2021.727859
work_keys_str_mv AT goicovichisaac fiberclusteringaccelerationwithamodifiedkmeansalgorithmusingdataparallelism
AT olivarespaulo fiberclusteringaccelerationwithamodifiedkmeansalgorithmusingdataparallelism
AT romanclaudio fiberclusteringaccelerationwithamodifiedkmeansalgorithmusingdataparallelism
AT vazquezandrea fiberclusteringaccelerationwithamodifiedkmeansalgorithmusingdataparallelism
AT pouponcyril fiberclusteringaccelerationwithamodifiedkmeansalgorithmusingdataparallelism
AT manginjeanfrancois fiberclusteringaccelerationwithamodifiedkmeansalgorithmusingdataparallelism
AT guevarapamela fiberclusteringaccelerationwithamodifiedkmeansalgorithmusingdataparallelism
AT hernandezcecilia fiberclusteringaccelerationwithamodifiedkmeansalgorithmusingdataparallelism