Cargando…

A differential privacy protecting K-means clustering algorithm based on contour coefficients

This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy protection by adding data-disturbing Laplace noise to cluster center point. In order to solve the problem of Laplace noise randomness which causes the center point to deviate, especially when poor ava...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yaling, Liu, Na, Wang, Shangping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6248925/
https://www.ncbi.nlm.nih.gov/pubmed/30462662
http://dx.doi.org/10.1371/journal.pone.0206832
_version_ 1783372651328176128
author Zhang, Yaling
Liu, Na
Wang, Shangping
author_facet Zhang, Yaling
Liu, Na
Wang, Shangping
author_sort Zhang, Yaling
collection PubMed
description This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy protection by adding data-disturbing Laplace noise to cluster center point. In order to solve the problem of Laplace noise randomness which causes the center point to deviate, especially when poor availability of clustering results appears because of small privacy budget parameters, an improved differential privacy protecting K-means clustering algorithm was raised in this paper. The improved algorithm uses the contour coefficients to quantitatively evaluate the clustering effect of each iteration and add different noise to different clusters. In order to be adapted to the huge number of data, this paper provides an algorithm design in MapReduce Framework. Experimental finding shows that the new algorithm improves the availability of the algorithm clustering results under the condition of ensuring individual privacy without significantly increasing its operating time.
format Online
Article
Text
id pubmed-6248925
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62489252018-12-06 A differential privacy protecting K-means clustering algorithm based on contour coefficients Zhang, Yaling Liu, Na Wang, Shangping PLoS One Research Article This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy protection by adding data-disturbing Laplace noise to cluster center point. In order to solve the problem of Laplace noise randomness which causes the center point to deviate, especially when poor availability of clustering results appears because of small privacy budget parameters, an improved differential privacy protecting K-means clustering algorithm was raised in this paper. The improved algorithm uses the contour coefficients to quantitatively evaluate the clustering effect of each iteration and add different noise to different clusters. In order to be adapted to the huge number of data, this paper provides an algorithm design in MapReduce Framework. Experimental finding shows that the new algorithm improves the availability of the algorithm clustering results under the condition of ensuring individual privacy without significantly increasing its operating time. Public Library of Science 2018-11-21 /pmc/articles/PMC6248925/ /pubmed/30462662 http://dx.doi.org/10.1371/journal.pone.0206832 Text en © 2018 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Yaling
Liu, Na
Wang, Shangping
A differential privacy protecting K-means clustering algorithm based on contour coefficients
title A differential privacy protecting K-means clustering algorithm based on contour coefficients
title_full A differential privacy protecting K-means clustering algorithm based on contour coefficients
title_fullStr A differential privacy protecting K-means clustering algorithm based on contour coefficients
title_full_unstemmed A differential privacy protecting K-means clustering algorithm based on contour coefficients
title_short A differential privacy protecting K-means clustering algorithm based on contour coefficients
title_sort differential privacy protecting k-means clustering algorithm based on contour coefficients
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6248925/
https://www.ncbi.nlm.nih.gov/pubmed/30462662
http://dx.doi.org/10.1371/journal.pone.0206832
work_keys_str_mv AT zhangyaling adifferentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients
AT liuna adifferentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients
AT wangshangping adifferentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients
AT zhangyaling differentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients
AT liuna differentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients
AT wangshangping differentialprivacyprotectingkmeansclusteringalgorithmbasedoncontourcoefficients