Cargando…
Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function
Fuzzy C-means clustering algorithm is one of the typical clustering algorithms in data mining applications. However, due to the sensitive information in the dataset, there is a risk of user privacy being leaked during the clustering process. The fuzzy C-means clustering of differential privacy prote...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7987176/ https://www.ncbi.nlm.nih.gov/pubmed/33755689 http://dx.doi.org/10.1371/journal.pone.0248737 |
_version_ | 1783668569964281856 |
---|---|
author | Zhang, Yaling Han, Jin |
author_facet | Zhang, Yaling Han, Jin |
author_sort | Zhang, Yaling |
collection | PubMed |
description | Fuzzy C-means clustering algorithm is one of the typical clustering algorithms in data mining applications. However, due to the sensitive information in the dataset, there is a risk of user privacy being leaked during the clustering process. The fuzzy C-means clustering of differential privacy protection can protect the user’s individual privacy while mining data rules, however, the decline in availability caused by data disturbances is a common problem of these algorithms. Aiming at the problem that the algorithm accuracy is reduced by randomly initializing the membership matrix of fuzzy C-means, in this paper, the maximum distance method is firstly used to determine the initial center point. Then, the gaussian value of the cluster center point is used to calculate the privacy budget allocation ratio. Additionally, Laplace noise is added to complete differential privacy protection. The experimental results demonstrate that the clustering accuracy and effectiveness of the proposed algorithm are higher than baselines under the same privacy protection intensity. |
format | Online Article Text |
id | pubmed-7987176 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-79871762021-04-02 Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function Zhang, Yaling Han, Jin PLoS One Research Article Fuzzy C-means clustering algorithm is one of the typical clustering algorithms in data mining applications. However, due to the sensitive information in the dataset, there is a risk of user privacy being leaked during the clustering process. The fuzzy C-means clustering of differential privacy protection can protect the user’s individual privacy while mining data rules, however, the decline in availability caused by data disturbances is a common problem of these algorithms. Aiming at the problem that the algorithm accuracy is reduced by randomly initializing the membership matrix of fuzzy C-means, in this paper, the maximum distance method is firstly used to determine the initial center point. Then, the gaussian value of the cluster center point is used to calculate the privacy budget allocation ratio. Additionally, Laplace noise is added to complete differential privacy protection. The experimental results demonstrate that the clustering accuracy and effectiveness of the proposed algorithm are higher than baselines under the same privacy protection intensity. Public Library of Science 2021-03-23 /pmc/articles/PMC7987176/ /pubmed/33755689 http://dx.doi.org/10.1371/journal.pone.0248737 Text en © 2021 Zhang, Han http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zhang, Yaling Han, Jin Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function |
title | Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function |
title_full | Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function |
title_fullStr | Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function |
title_full_unstemmed | Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function |
title_short | Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function |
title_sort | differential privacy fuzzy c-means clustering algorithm based on gaussian kernel function |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7987176/ https://www.ncbi.nlm.nih.gov/pubmed/33755689 http://dx.doi.org/10.1371/journal.pone.0248737 |
work_keys_str_mv | AT zhangyaling differentialprivacyfuzzycmeansclusteringalgorithmbasedongaussiankernelfunction AT hanjin differentialprivacyfuzzycmeansclusteringalgorithmbasedongaussiankernelfunction |