Cargando…
You Can't See Me: Anonymizing Graphs Using the Szemerédi Regularity Lemma
Complex networks gathered from our online interactions provide a rich source of information that can be used to try to model and predict our behavior. While this has very tangible benefits that we have all grown accustomed to, there is a concrete privacy risk in sharing potentially sensitive data ab...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931930/ https://www.ncbi.nlm.nih.gov/pubmed/33693330 http://dx.doi.org/10.3389/fdata.2019.00007 |
_version_ | 1783660385502494720 |
---|---|
author | Foffano, Daniele Rossi, Luca Torsello, Andrea |
author_facet | Foffano, Daniele Rossi, Luca Torsello, Andrea |
author_sort | Foffano, Daniele |
collection | PubMed |
description | Complex networks gathered from our online interactions provide a rich source of information that can be used to try to model and predict our behavior. While this has very tangible benefits that we have all grown accustomed to, there is a concrete privacy risk in sharing potentially sensitive data about ourselves and the people we interact with, especially when this data is publicly available online and unprotected from malicious attacks. k-anonymity is a technique aimed at reducing this risk by obfuscating the topological information of a graph that can be used to infer the nodes' identity. In this paper we propose a novel algorithm to enforce k-anonymity based on a well-known result in extremal graph theory, the Szemerédi regularity lemma. Given a graph, we start by computing a regular partition of its nodes. The Szemerédi regularity lemma ensures that such a partition exists and that the edges between the sets of nodes behave almost randomly. With this partition, we anonymize the graph by randomizing the edges within each set, obtaining a graph that is structurally similar to the original one yet the nodes within each set are structurally indistinguishable. We test the proposed approach on real-world networks extracted from Facebook. Our experimental results show that the proposed approach is able to anonymize a graph while retaining most of its structural information. |
format | Online Article Text |
id | pubmed-7931930 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79319302021-03-09 You Can't See Me: Anonymizing Graphs Using the Szemerédi Regularity Lemma Foffano, Daniele Rossi, Luca Torsello, Andrea Front Big Data Big Data Complex networks gathered from our online interactions provide a rich source of information that can be used to try to model and predict our behavior. While this has very tangible benefits that we have all grown accustomed to, there is a concrete privacy risk in sharing potentially sensitive data about ourselves and the people we interact with, especially when this data is publicly available online and unprotected from malicious attacks. k-anonymity is a technique aimed at reducing this risk by obfuscating the topological information of a graph that can be used to infer the nodes' identity. In this paper we propose a novel algorithm to enforce k-anonymity based on a well-known result in extremal graph theory, the Szemerédi regularity lemma. Given a graph, we start by computing a regular partition of its nodes. The Szemerédi regularity lemma ensures that such a partition exists and that the edges between the sets of nodes behave almost randomly. With this partition, we anonymize the graph by randomizing the edges within each set, obtaining a graph that is structurally similar to the original one yet the nodes within each set are structurally indistinguishable. We test the proposed approach on real-world networks extracted from Facebook. Our experimental results show that the proposed approach is able to anonymize a graph while retaining most of its structural information. Frontiers Media S.A. 2019-05-31 /pmc/articles/PMC7931930/ /pubmed/33693330 http://dx.doi.org/10.3389/fdata.2019.00007 Text en Copyright © 2019 Foffano, Rossi and Torsello. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Big Data Foffano, Daniele Rossi, Luca Torsello, Andrea You Can't See Me: Anonymizing Graphs Using the Szemerédi Regularity Lemma |
title | You Can't See Me: Anonymizing Graphs Using the Szemerédi Regularity Lemma |
title_full | You Can't See Me: Anonymizing Graphs Using the Szemerédi Regularity Lemma |
title_fullStr | You Can't See Me: Anonymizing Graphs Using the Szemerédi Regularity Lemma |
title_full_unstemmed | You Can't See Me: Anonymizing Graphs Using the Szemerédi Regularity Lemma |
title_short | You Can't See Me: Anonymizing Graphs Using the Szemerédi Regularity Lemma |
title_sort | you can't see me: anonymizing graphs using the szemerédi regularity lemma |
topic | Big Data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931930/ https://www.ncbi.nlm.nih.gov/pubmed/33693330 http://dx.doi.org/10.3389/fdata.2019.00007 |
work_keys_str_mv | AT foffanodaniele youcantseemeanonymizinggraphsusingtheszemerediregularitylemma AT rossiluca youcantseemeanonymizinggraphsusingtheszemerediregularitylemma AT torselloandrea youcantseemeanonymizinggraphsusingtheszemerediregularitylemma |