Cargando…

New threats to health data privacy

BACKGROUND: Along with the rapid digitalization of health data (e.g. Electronic Health Records), there is an increasing concern on maintaining data privacy while garnering the benefits, especially when the data are required to be published for secondary use. Most of the current research on protectin...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Fengjun, Zou, Xukai, Liu, Peng, Chen, Jake Y
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3247088/
https://www.ncbi.nlm.nih.gov/pubmed/22168526
http://dx.doi.org/10.1186/1471-2105-12-S12-S7
_version_ 1782220039519207424
author Li, Fengjun
Zou, Xukai
Liu, Peng
Chen, Jake Y
author_facet Li, Fengjun
Zou, Xukai
Liu, Peng
Chen, Jake Y
author_sort Li, Fengjun
collection PubMed
description BACKGROUND: Along with the rapid digitalization of health data (e.g. Electronic Health Records), there is an increasing concern on maintaining data privacy while garnering the benefits, especially when the data are required to be published for secondary use. Most of the current research on protecting health data privacy is centered around data de-identification and data anonymization, which removes the identifiable information from the published health data to prevent an adversary from reasoning about the privacy of the patients. However, published health data is not the only source that the adversaries can count on: with a large amount of information that people voluntarily share on the Web, sophisticated attacks that join disparate information pieces from multiple sources against health data privacy become practical. Limited efforts have been devoted to studying these attacks yet. RESULTS: We study how patient privacy could be compromised with the help of today’s information technologies. In particular, we show that private healthcare information could be collected by aggregating and associating disparate pieces of information from multiple online data sources including online social networks, public records and search engine results. We demonstrate a real-world case study to show user identity and privacy are highly vulnerable to the attribution, inference and aggregation attacks. We also show that people are highly identifiable to adversaries even with inaccurate information pieces about the target, with real data analysis. CONCLUSION: We claim that too much information has been made available electronic and available online that people are very vulnerable without effective privacy protection.
format Online
Article
Text
id pubmed-3247088
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32470882011-12-29 New threats to health data privacy Li, Fengjun Zou, Xukai Liu, Peng Chen, Jake Y BMC Bioinformatics Proceedings BACKGROUND: Along with the rapid digitalization of health data (e.g. Electronic Health Records), there is an increasing concern on maintaining data privacy while garnering the benefits, especially when the data are required to be published for secondary use. Most of the current research on protecting health data privacy is centered around data de-identification and data anonymization, which removes the identifiable information from the published health data to prevent an adversary from reasoning about the privacy of the patients. However, published health data is not the only source that the adversaries can count on: with a large amount of information that people voluntarily share on the Web, sophisticated attacks that join disparate information pieces from multiple sources against health data privacy become practical. Limited efforts have been devoted to studying these attacks yet. RESULTS: We study how patient privacy could be compromised with the help of today’s information technologies. In particular, we show that private healthcare information could be collected by aggregating and associating disparate pieces of information from multiple online data sources including online social networks, public records and search engine results. We demonstrate a real-world case study to show user identity and privacy are highly vulnerable to the attribution, inference and aggregation attacks. We also show that people are highly identifiable to adversaries even with inaccurate information pieces about the target, with real data analysis. CONCLUSION: We claim that too much information has been made available electronic and available online that people are very vulnerable without effective privacy protection. BioMed Central 2011-11-24 /pmc/articles/PMC3247088/ /pubmed/22168526 http://dx.doi.org/10.1186/1471-2105-12-S12-S7 Text en Copyright ©2011 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Li, Fengjun
Zou, Xukai
Liu, Peng
Chen, Jake Y
New threats to health data privacy
title New threats to health data privacy
title_full New threats to health data privacy
title_fullStr New threats to health data privacy
title_full_unstemmed New threats to health data privacy
title_short New threats to health data privacy
title_sort new threats to health data privacy
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3247088/
https://www.ncbi.nlm.nih.gov/pubmed/22168526
http://dx.doi.org/10.1186/1471-2105-12-S12-S7
work_keys_str_mv AT lifengjun newthreatstohealthdataprivacy
AT zouxukai newthreatstohealthdataprivacy
AT liupeng newthreatstohealthdataprivacy
AT chenjakey newthreatstohealthdataprivacy