Cargando…

Deconvoluting kernel density estimation and regression for locally differentially private data

Local differential privacy has become the gold-standard of privacy literature for gathering or releasing sensitive individual data points in a privacy-preserving manner. However, locally differential data can twist the probability density of the data because of the additive noise used to ensure priv...

Descripción completa

Detalles Bibliográficos
Autor principal: Farokhi, Farhad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7721740/
https://www.ncbi.nlm.nih.gov/pubmed/33288799
http://dx.doi.org/10.1038/s41598-020-78323-0
_version_ 1783620086785900544
author Farokhi, Farhad
author_facet Farokhi, Farhad
author_sort Farokhi, Farhad
collection PubMed
description Local differential privacy has become the gold-standard of privacy literature for gathering or releasing sensitive individual data points in a privacy-preserving manner. However, locally differential data can twist the probability density of the data because of the additive noise used to ensure privacy. In fact, the density of privacy-preserving data (no matter how many samples we gather) is always flatter in comparison with the density function of the original data points due to convolution with privacy-preserving noise density function. The effect is especially more pronounced when using slow-decaying privacy-preserving noises, such as the Laplace noise. This can result in under/over-estimation of the heavy-hitters. This is an important challenge facing social scientists due to the use of differential privacy in the 2020 Census in the United States. In this paper, we develop density estimation methods using smoothing kernels. We use the framework of deconvoluting kernel density estimators to remove the effect of privacy-preserving noise. This approach also allows us to adapt the results from non-parametric regression with errors-in-variables to develop regression models based on locally differentially private data. We demonstrate the performance of the developed methods on financial and demographic datasets.
format Online
Article
Text
id pubmed-7721740
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-77217402020-12-08 Deconvoluting kernel density estimation and regression for locally differentially private data Farokhi, Farhad Sci Rep Article Local differential privacy has become the gold-standard of privacy literature for gathering or releasing sensitive individual data points in a privacy-preserving manner. However, locally differential data can twist the probability density of the data because of the additive noise used to ensure privacy. In fact, the density of privacy-preserving data (no matter how many samples we gather) is always flatter in comparison with the density function of the original data points due to convolution with privacy-preserving noise density function. The effect is especially more pronounced when using slow-decaying privacy-preserving noises, such as the Laplace noise. This can result in under/over-estimation of the heavy-hitters. This is an important challenge facing social scientists due to the use of differential privacy in the 2020 Census in the United States. In this paper, we develop density estimation methods using smoothing kernels. We use the framework of deconvoluting kernel density estimators to remove the effect of privacy-preserving noise. This approach also allows us to adapt the results from non-parametric regression with errors-in-variables to develop regression models based on locally differentially private data. We demonstrate the performance of the developed methods on financial and demographic datasets. Nature Publishing Group UK 2020-12-07 /pmc/articles/PMC7721740/ /pubmed/33288799 http://dx.doi.org/10.1038/s41598-020-78323-0 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Farokhi, Farhad
Deconvoluting kernel density estimation and regression for locally differentially private data
title Deconvoluting kernel density estimation and regression for locally differentially private data
title_full Deconvoluting kernel density estimation and regression for locally differentially private data
title_fullStr Deconvoluting kernel density estimation and regression for locally differentially private data
title_full_unstemmed Deconvoluting kernel density estimation and regression for locally differentially private data
title_short Deconvoluting kernel density estimation and regression for locally differentially private data
title_sort deconvoluting kernel density estimation and regression for locally differentially private data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7721740/
https://www.ncbi.nlm.nih.gov/pubmed/33288799
http://dx.doi.org/10.1038/s41598-020-78323-0
work_keys_str_mv AT farokhifarhad deconvolutingkerneldensityestimationandregressionforlocallydifferentiallyprivatedata