Cargando…

Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function

Fuzzy C-means clustering algorithm is one of the typical clustering algorithms in data mining applications. However, due to the sensitive information in the dataset, there is a risk of user privacy being leaked during the clustering process. The fuzzy C-means clustering of differential privacy prote...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yaling, Han, Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7987176/
https://www.ncbi.nlm.nih.gov/pubmed/33755689
http://dx.doi.org/10.1371/journal.pone.0248737
_version_ 1783668569964281856
author Zhang, Yaling
Han, Jin
author_facet Zhang, Yaling
Han, Jin
author_sort Zhang, Yaling
collection PubMed
description Fuzzy C-means clustering algorithm is one of the typical clustering algorithms in data mining applications. However, due to the sensitive information in the dataset, there is a risk of user privacy being leaked during the clustering process. The fuzzy C-means clustering of differential privacy protection can protect the user’s individual privacy while mining data rules, however, the decline in availability caused by data disturbances is a common problem of these algorithms. Aiming at the problem that the algorithm accuracy is reduced by randomly initializing the membership matrix of fuzzy C-means, in this paper, the maximum distance method is firstly used to determine the initial center point. Then, the gaussian value of the cluster center point is used to calculate the privacy budget allocation ratio. Additionally, Laplace noise is added to complete differential privacy protection. The experimental results demonstrate that the clustering accuracy and effectiveness of the proposed algorithm are higher than baselines under the same privacy protection intensity.
format Online
Article
Text
id pubmed-7987176
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-79871762021-04-02 Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function Zhang, Yaling Han, Jin PLoS One Research Article Fuzzy C-means clustering algorithm is one of the typical clustering algorithms in data mining applications. However, due to the sensitive information in the dataset, there is a risk of user privacy being leaked during the clustering process. The fuzzy C-means clustering of differential privacy protection can protect the user’s individual privacy while mining data rules, however, the decline in availability caused by data disturbances is a common problem of these algorithms. Aiming at the problem that the algorithm accuracy is reduced by randomly initializing the membership matrix of fuzzy C-means, in this paper, the maximum distance method is firstly used to determine the initial center point. Then, the gaussian value of the cluster center point is used to calculate the privacy budget allocation ratio. Additionally, Laplace noise is added to complete differential privacy protection. The experimental results demonstrate that the clustering accuracy and effectiveness of the proposed algorithm are higher than baselines under the same privacy protection intensity. Public Library of Science 2021-03-23 /pmc/articles/PMC7987176/ /pubmed/33755689 http://dx.doi.org/10.1371/journal.pone.0248737 Text en © 2021 Zhang, Han http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Yaling
Han, Jin
Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function
title Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function
title_full Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function
title_fullStr Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function
title_full_unstemmed Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function
title_short Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function
title_sort differential privacy fuzzy c-means clustering algorithm based on gaussian kernel function
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7987176/
https://www.ncbi.nlm.nih.gov/pubmed/33755689
http://dx.doi.org/10.1371/journal.pone.0248737
work_keys_str_mv AT zhangyaling differentialprivacyfuzzycmeansclusteringalgorithmbasedongaussiankernelfunction
AT hanjin differentialprivacyfuzzycmeansclusteringalgorithmbasedongaussiankernelfunction