Cargando…
A differential privacy protecting K-means clustering algorithm based on contour coefficients
This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy protection by adding data-disturbing Laplace noise to cluster center point. In order to solve the problem of Laplace noise randomness which causes the center point to deviate, especially when poor ava...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6248925/ https://www.ncbi.nlm.nih.gov/pubmed/30462662 http://dx.doi.org/10.1371/journal.pone.0206832 |
_version_ | 1783372651328176128 |
---|---|
author | Zhang, Yaling Liu, Na Wang, Shangping |
author_facet | Zhang, Yaling Liu, Na Wang, Shangping |
author_sort | Zhang, Yaling |
collection | PubMed |
description | This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy protection by adding data-disturbing Laplace noise to cluster center point. In order to solve the problem of Laplace noise randomness which causes the center point to deviate, especially when poor availability of clustering results appears because of small privacy budget parameters, an improved differential privacy protecting K-means clustering algorithm was raised in this paper. The improved algorithm uses the contour coefficients to quantitatively evaluate the clustering effect of each iteration and add different noise to different clusters. In order to be adapted to the huge number of data, this paper provides an algorithm design in MapReduce Framework. Experimental finding shows that the new algorithm improves the availability of the algorithm clustering results under the condition of ensuring individual privacy without significantly increasing its operating time. |
format | Online Article Text |
id | pubmed-6248925 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-62489252018-12-06 A differential privacy protecting K-means clustering algorithm based on contour coefficients Zhang, Yaling Liu, Na Wang, Shangping PLoS One Research Article This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy protection by adding data-disturbing Laplace noise to cluster center point. In order to solve the problem of Laplace noise randomness which causes the center point to deviate, especially when poor availability of clustering results appears because of small privacy budget parameters, an improved differential privacy protecting K-means clustering algorithm was raised in this paper. The improved algorithm uses the contour coefficients to quantitatively evaluate the clustering effect of each iteration and add different noise to different clusters. In order to be adapted to the huge number of data, this paper provides an algorithm design in MapReduce Framework. Experimental finding shows that the new algorithm improves the availability of the algorithm clustering results under the condition of ensuring individual privacy without significantly increasing its operating time. Public Library of Science 2018-11-21 /pmc/articles/PMC6248925/ /pubmed/30462662 http://dx.doi.org/10.1371/journal.pone.0206832 Text en © 2018 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zhang, Yaling Liu, Na Wang, Shangping A differential privacy protecting K-means clustering algorithm based on contour coefficients |
title | A differential privacy protecting K-means clustering algorithm based on contour coefficients |
title_full | A differential privacy protecting K-means clustering algorithm based on contour coefficients |
title_fullStr | A differential privacy protecting K-means clustering algorithm based on contour coefficients |
title_full_unstemmed | A differential privacy protecting K-means clustering algorithm based on contour coefficients |
title_short | A differential privacy protecting K-means clustering algorithm based on contour coefficients |
title_sort | differential privacy protecting k-means clustering algorithm based on contour coefficients |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6248925/ https://www.ncbi.nlm.nih.gov/pubmed/30462662 http://dx.doi.org/10.1371/journal.pone.0206832 |
work_keys_str_mv | AT zhangyaling adifferentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients AT liuna adifferentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients AT wangshangping adifferentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients AT zhangyaling differentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients AT liuna differentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients AT wangshangping differentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients |