Cargando…
On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms
Incremental clustering algorithms play a vital role in various applications such as massive data analysis and real-time data processing. Typical application scenarios of incremental clustering raise high demand on computing power of the hardware platform. Parallel computing is a common solution to m...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5662818/ https://www.ncbi.nlm.nih.gov/pubmed/29123546 http://dx.doi.org/10.1155/2017/2519782 |
_version_ | 1783274712781029376 |
---|---|
author | Chen, Chunlei He, Li Zhang, Huixiang Zheng, Hao Wang, Lei |
author_facet | Chen, Chunlei He, Li Zhang, Huixiang Zheng, Hao Wang, Lei |
author_sort | Chen, Chunlei |
collection | PubMed |
description | Incremental clustering algorithms play a vital role in various applications such as massive data analysis and real-time data processing. Typical application scenarios of incremental clustering raise high demand on computing power of the hardware platform. Parallel computing is a common solution to meet this demand. Moreover, General Purpose Graphic Processing Unit (GPGPU) is a promising parallel computing device. Nevertheless, the incremental clustering algorithm is facing a dilemma between clustering accuracy and parallelism when they are powered by GPGPU. We formally analyzed the cause of this dilemma. First, we formalized concepts relevant to incremental clustering like evolving granularity. Second, we formally proved two theorems. The first theorem proves the relation between clustering accuracy and evolving granularity. Additionally, this theorem analyzes the upper and lower bounds of different-to-same mis-affiliation. Fewer occurrences of such mis-affiliation mean higher accuracy. The second theorem reveals the relation between parallelism and evolving granularity. Smaller work-depth means superior parallelism. Through the proofs, we conclude that accuracy of an incremental clustering algorithm is negatively related to evolving granularity while parallelism is positively related to the granularity. Thus the contradictory relations cause the dilemma. Finally, we validated the relations through a demo algorithm. Experiment results verified theoretical conclusions. |
format | Online Article Text |
id | pubmed-5662818 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-56628182017-11-09 On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms Chen, Chunlei He, Li Zhang, Huixiang Zheng, Hao Wang, Lei Comput Intell Neurosci Research Article Incremental clustering algorithms play a vital role in various applications such as massive data analysis and real-time data processing. Typical application scenarios of incremental clustering raise high demand on computing power of the hardware platform. Parallel computing is a common solution to meet this demand. Moreover, General Purpose Graphic Processing Unit (GPGPU) is a promising parallel computing device. Nevertheless, the incremental clustering algorithm is facing a dilemma between clustering accuracy and parallelism when they are powered by GPGPU. We formally analyzed the cause of this dilemma. First, we formalized concepts relevant to incremental clustering like evolving granularity. Second, we formally proved two theorems. The first theorem proves the relation between clustering accuracy and evolving granularity. Additionally, this theorem analyzes the upper and lower bounds of different-to-same mis-affiliation. Fewer occurrences of such mis-affiliation mean higher accuracy. The second theorem reveals the relation between parallelism and evolving granularity. Smaller work-depth means superior parallelism. Through the proofs, we conclude that accuracy of an incremental clustering algorithm is negatively related to evolving granularity while parallelism is positively related to the granularity. Thus the contradictory relations cause the dilemma. Finally, we validated the relations through a demo algorithm. Experiment results verified theoretical conclusions. Hindawi 2017 2017-10-11 /pmc/articles/PMC5662818/ /pubmed/29123546 http://dx.doi.org/10.1155/2017/2519782 Text en Copyright © 2017 Chunlei Chen et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chen, Chunlei He, Li Zhang, Huixiang Zheng, Hao Wang, Lei On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms |
title | On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms |
title_full | On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms |
title_fullStr | On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms |
title_full_unstemmed | On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms |
title_short | On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms |
title_sort | on the accuracy and parallelism of gpgpu-powered incremental clustering algorithms |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5662818/ https://www.ncbi.nlm.nih.gov/pubmed/29123546 http://dx.doi.org/10.1155/2017/2519782 |
work_keys_str_mv | AT chenchunlei ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms AT heli ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms AT zhanghuixiang ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms AT zhenghao ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms AT wanglei ontheaccuracyandparallelismofgpgpupoweredincrementalclusteringalgorithms |