Cargando…

Yahtzee: An Anonymized Group Level Matching Procedure

Researchers often face the problem of needing to protect the privacy of subjects while also needing to integrate data that contains personal information from diverse data sources. The advent of computational social science and the enormous amount of data about people that is being collected makes pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Jones, Jason J., Bond, Robert M., Fariss, Christopher J., Settle, Jaime E., Kramer, Adam D. I., Marlow, Cameron, Fowler, James H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3564933/
https://www.ncbi.nlm.nih.gov/pubmed/23441156
http://dx.doi.org/10.1371/journal.pone.0055760
_version_ 1782258387440893952
author Jones, Jason J.
Bond, Robert M.
Fariss, Christopher J.
Settle, Jaime E.
Kramer, Adam D. I.
Marlow, Cameron
Fowler, James H.
author_facet Jones, Jason J.
Bond, Robert M.
Fariss, Christopher J.
Settle, Jaime E.
Kramer, Adam D. I.
Marlow, Cameron
Fowler, James H.
author_sort Jones, Jason J.
collection PubMed
description Researchers often face the problem of needing to protect the privacy of subjects while also needing to integrate data that contains personal information from diverse data sources. The advent of computational social science and the enormous amount of data about people that is being collected makes protecting the privacy of research subjects ever more important. However, strict privacy procedures can hinder the process of joining diverse sources of data that contain information about specific individual behaviors. In this paper we present a procedure to keep information about specific individuals from being “leaked” or shared in either direction between two sources of data without need of a trusted third party. To achieve this goal, we randomly assign individuals to anonymous groups before combining the anonymized information between the two sources of data. We refer to this method as the Yahtzee procedure, and show that it performs as predicted by theoretical analysis when we apply it to data from Facebook and public voter records.
format Online
Article
Text
id pubmed-3564933
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35649332013-02-25 Yahtzee: An Anonymized Group Level Matching Procedure Jones, Jason J. Bond, Robert M. Fariss, Christopher J. Settle, Jaime E. Kramer, Adam D. I. Marlow, Cameron Fowler, James H. PLoS One Research Article Researchers often face the problem of needing to protect the privacy of subjects while also needing to integrate data that contains personal information from diverse data sources. The advent of computational social science and the enormous amount of data about people that is being collected makes protecting the privacy of research subjects ever more important. However, strict privacy procedures can hinder the process of joining diverse sources of data that contain information about specific individual behaviors. In this paper we present a procedure to keep information about specific individuals from being “leaked” or shared in either direction between two sources of data without need of a trusted third party. To achieve this goal, we randomly assign individuals to anonymous groups before combining the anonymized information between the two sources of data. We refer to this method as the Yahtzee procedure, and show that it performs as predicted by theoretical analysis when we apply it to data from Facebook and public voter records. Public Library of Science 2013-02-05 /pmc/articles/PMC3564933/ /pubmed/23441156 http://dx.doi.org/10.1371/journal.pone.0055760 Text en © 2013 Jones et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Jones, Jason J.
Bond, Robert M.
Fariss, Christopher J.
Settle, Jaime E.
Kramer, Adam D. I.
Marlow, Cameron
Fowler, James H.
Yahtzee: An Anonymized Group Level Matching Procedure
title Yahtzee: An Anonymized Group Level Matching Procedure
title_full Yahtzee: An Anonymized Group Level Matching Procedure
title_fullStr Yahtzee: An Anonymized Group Level Matching Procedure
title_full_unstemmed Yahtzee: An Anonymized Group Level Matching Procedure
title_short Yahtzee: An Anonymized Group Level Matching Procedure
title_sort yahtzee: an anonymized group level matching procedure
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3564933/
https://www.ncbi.nlm.nih.gov/pubmed/23441156
http://dx.doi.org/10.1371/journal.pone.0055760
work_keys_str_mv AT jonesjasonj yahtzeeananonymizedgrouplevelmatchingprocedure
AT bondrobertm yahtzeeananonymizedgrouplevelmatchingprocedure
AT farisschristopherj yahtzeeananonymizedgrouplevelmatchingprocedure
AT settlejaimee yahtzeeananonymizedgrouplevelmatchingprocedure
AT krameradamdi yahtzeeananonymizedgrouplevelmatchingprocedure
AT marlowcameron yahtzeeananonymizedgrouplevelmatchingprocedure
AT fowlerjamesh yahtzeeananonymizedgrouplevelmatchingprocedure