Cargando…

A physical model inspired density peak clustering

Clustering is an important technology of data mining, which plays a vital role in bioscience, social network and network analysis. As a clustering algorithm based on density and distance, density peak clustering is extensively used to solve practical problems. The algorithm assumes that the clusteri...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhuang, Hui, Cui, Jiancong, Liu, Taoran, Wang, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514087/
https://www.ncbi.nlm.nih.gov/pubmed/32970727
http://dx.doi.org/10.1371/journal.pone.0239406
_version_ 1783586506858823680
author Zhuang, Hui
Cui, Jiancong
Liu, Taoran
Wang, Hong
author_facet Zhuang, Hui
Cui, Jiancong
Liu, Taoran
Wang, Hong
author_sort Zhuang, Hui
collection PubMed
description Clustering is an important technology of data mining, which plays a vital role in bioscience, social network and network analysis. As a clustering algorithm based on density and distance, density peak clustering is extensively used to solve practical problems. The algorithm assumes that the clustering center has a larger local density and is farther away from the higher density points. However, the density peak clustering algorithm is highly sensitive to density and distance and cannot accurately identify clusters in a dataset having significant differences in cluster structure. In addition, the density peak clustering algorithm’s allocation strategy can easily cause attached allocation errors in data point allocation. To solve these problems, this study proposes a potential-field-diffusion-based density peak clustering. As compared to existing clustering algorithms, the advantages of the potential-field-diffusion-based density peak clustering algorithm is three-fold: 1) The potential field concept is introduced in the proposed algorithm, and a density measure based on the potential field’s diffusion is proposed. The cluster center can be accurately selected using this measure. 2) The potential-field-diffusion-based density peak clustering algorithm defines the judgment conditions of similar points and adopts different allocation strategies for dissimilar points to avoid attached errors in data point allocation. 3) This study conducted many experiments on synthetic and real-world datasets. Results demonstrate that the proposed potential-field-diffusion-based density peak clustering algorithm achieves excellent clustering effect and is suitable for complex datasets of different sizes, dimensions, and shapes. Besides, the proposed potential-field-diffusion-based density peak clustering algorithm shows particularly excellent performance on variable density and nonconvex datasets.
format Online
Article
Text
id pubmed-7514087
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-75140872020-10-01 A physical model inspired density peak clustering Zhuang, Hui Cui, Jiancong Liu, Taoran Wang, Hong PLoS One Research Article Clustering is an important technology of data mining, which plays a vital role in bioscience, social network and network analysis. As a clustering algorithm based on density and distance, density peak clustering is extensively used to solve practical problems. The algorithm assumes that the clustering center has a larger local density and is farther away from the higher density points. However, the density peak clustering algorithm is highly sensitive to density and distance and cannot accurately identify clusters in a dataset having significant differences in cluster structure. In addition, the density peak clustering algorithm’s allocation strategy can easily cause attached allocation errors in data point allocation. To solve these problems, this study proposes a potential-field-diffusion-based density peak clustering. As compared to existing clustering algorithms, the advantages of the potential-field-diffusion-based density peak clustering algorithm is three-fold: 1) The potential field concept is introduced in the proposed algorithm, and a density measure based on the potential field’s diffusion is proposed. The cluster center can be accurately selected using this measure. 2) The potential-field-diffusion-based density peak clustering algorithm defines the judgment conditions of similar points and adopts different allocation strategies for dissimilar points to avoid attached errors in data point allocation. 3) This study conducted many experiments on synthetic and real-world datasets. Results demonstrate that the proposed potential-field-diffusion-based density peak clustering algorithm achieves excellent clustering effect and is suitable for complex datasets of different sizes, dimensions, and shapes. Besides, the proposed potential-field-diffusion-based density peak clustering algorithm shows particularly excellent performance on variable density and nonconvex datasets. Public Library of Science 2020-09-24 /pmc/articles/PMC7514087/ /pubmed/32970727 http://dx.doi.org/10.1371/journal.pone.0239406 Text en © 2020 Zhuang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhuang, Hui
Cui, Jiancong
Liu, Taoran
Wang, Hong
A physical model inspired density peak clustering
title A physical model inspired density peak clustering
title_full A physical model inspired density peak clustering
title_fullStr A physical model inspired density peak clustering
title_full_unstemmed A physical model inspired density peak clustering
title_short A physical model inspired density peak clustering
title_sort physical model inspired density peak clustering
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514087/
https://www.ncbi.nlm.nih.gov/pubmed/32970727
http://dx.doi.org/10.1371/journal.pone.0239406
work_keys_str_mv AT zhuanghui aphysicalmodelinspireddensitypeakclustering
AT cuijiancong aphysicalmodelinspireddensitypeakclustering
AT liutaoran aphysicalmodelinspireddensitypeakclustering
AT wanghong aphysicalmodelinspireddensitypeakclustering
AT zhuanghui physicalmodelinspireddensitypeakclustering
AT cuijiancong physicalmodelinspireddensitypeakclustering
AT liutaoran physicalmodelinspireddensitypeakclustering
AT wanghong physicalmodelinspireddensitypeakclustering