Cargando…

On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms

Incremental clustering algorithms play a vital role in various applications such as massive data analysis and real-time data processing. Typical application scenarios of incremental clustering raise high demand on computing power of the hardware platform. Parallel computing is a common solution to m...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Chunlei, He, Li, Zhang, Huixiang, Zheng, Hao, Wang, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5662818/
https://www.ncbi.nlm.nih.gov/pubmed/29123546
http://dx.doi.org/10.1155/2017/2519782
_version_ 1783274712781029376
author Chen, Chunlei
He, Li
Zhang, Huixiang
Zheng, Hao
Wang, Lei
author_facet Chen, Chunlei
He, Li
Zhang, Huixiang
Zheng, Hao
Wang, Lei
author_sort Chen, Chunlei
collection PubMed
description Incremental clustering algorithms play a vital role in various applications such as massive data analysis and real-time data processing. Typical application scenarios of incremental clustering raise high demand on computing power of the hardware platform. Parallel computing is a common solution to meet this demand. Moreover, General Purpose Graphic Processing Unit (GPGPU) is a promising parallel computing device. Nevertheless, the incremental clustering algorithm is facing a dilemma between clustering accuracy and parallelism when they are powered by GPGPU. We formally analyzed the cause of this dilemma. First, we formalized concepts relevant to incremental clustering like evolving granularity. Second, we formally proved two theorems. The first theorem proves the relation between clustering accuracy and evolving granularity. Additionally, this theorem analyzes the upper and lower bounds of different-to-same mis-affiliation. Fewer occurrences of such mis-affiliation mean higher accuracy. The second theorem reveals the relation between parallelism and evolving granularity. Smaller work-depth means superior parallelism. Through the proofs, we conclude that accuracy of an incremental clustering algorithm is negatively related to evolving granularity while parallelism is positively related to the granularity. Thus the contradictory relations cause the dilemma. Finally, we validated the relations through a demo algorithm. Experiment results verified theoretical conclusions.
format Online
Article
Text
id pubmed-5662818
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-56628182017-11-09 On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms Chen, Chunlei He, Li Zhang, Huixiang Zheng, Hao Wang, Lei Comput Intell Neurosci Research Article Incremental clustering algorithms play a vital role in various applications such as massive data analysis and real-time data processing. Typical application scenarios of incremental clustering raise high demand on computing power of the hardware platform. Parallel computing is a common solution to meet this demand. Moreover, General Purpose Graphic Processing Unit (GPGPU) is a promising parallel computing device. Nevertheless, the incremental clustering algorithm is facing a dilemma between clustering accuracy and parallelism when they are powered by GPGPU. We formally analyzed the cause of this dilemma. First, we formalized concepts relevant to incremental clustering like evolving granularity. Second, we formally proved two theorems. The first theorem proves the relation between clustering accuracy and evolving granularity. Additionally, this theorem analyzes the upper and lower bounds of different-to-same mis-affiliation. Fewer occurrences of such mis-affiliation mean higher accuracy. The second theorem reveals the relation between parallelism and evolving granularity. Smaller work-depth means superior parallelism. Through the proofs, we conclude that accuracy of an incremental clustering algorithm is negatively related to evolving granularity while parallelism is positively related to the granularity. Thus the contradictory relations cause the dilemma. Finally, we validated the relations through a demo algorithm. Experiment results verified theoretical conclusions. Hindawi 2017 2017-10-11 /pmc/articles/PMC5662818/ /pubmed/29123546 http://dx.doi.org/10.1155/2017/2519782 Text en Copyright © 2017 Chunlei Chen et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Chunlei
He, Li
Zhang, Huixiang
Zheng, Hao
Wang, Lei
On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms
title On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms
title_full On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms
title_fullStr On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms
title_full_unstemmed On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms
title_short On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms
title_sort on the accuracy and parallelism of gpgpu-powered incremental clustering algorithms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5662818/
https://www.ncbi.nlm.nih.gov/pubmed/29123546
http://dx.doi.org/10.1155/2017/2519782
work_keys_str_mv AT chenchunlei ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms
AT heli ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms
AT zhanghuixiang ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms
AT zhenghao ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms
AT wanglei ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms