Cargando…

Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions

Functional genomics experiments, such as RNA-seq, provide non-individual specific information about gene expression under different conditions such as disease and normal. There is great desire to share these data. However, privacy concerns often preclude sharing of the raw reads. To enable safe shar...

Descripción completa

Detalles Bibliográficos
Autores principales: Harmanci, Arif, Gerstein, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6015012/
https://www.ncbi.nlm.nih.gov/pubmed/29934598
http://dx.doi.org/10.1038/s41467-018-04875-5
_version_ 1783334307576676352
author Harmanci, Arif
Gerstein, Mark
author_facet Harmanci, Arif
Gerstein, Mark
author_sort Harmanci, Arif
collection PubMed
description Functional genomics experiments, such as RNA-seq, provide non-individual specific information about gene expression under different conditions such as disease and normal. There is great desire to share these data. However, privacy concerns often preclude sharing of the raw reads. To enable safe sharing, aggregated summaries such as read-depth signal profiles and levels of gene expression are used. Projects such as GTEx and ENCODE share these because they ostensibly do not leak much identifying information. Here, we attempt to quantify the validity of this statement, measuring the leakage of genomic deletions from signal profiles. We present information theoretic measures for the degree to which one can genotype these deletions. We then develop practical genotyping approaches and demonstrate how to use these to identify an individual within a large cohort in the context of linking attacks. Finally, we present an anonymization method removing much of the leakage from signal profiles.
format Online
Article
Text
id pubmed-6015012
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-60150122018-06-25 Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions Harmanci, Arif Gerstein, Mark Nat Commun Article Functional genomics experiments, such as RNA-seq, provide non-individual specific information about gene expression under different conditions such as disease and normal. There is great desire to share these data. However, privacy concerns often preclude sharing of the raw reads. To enable safe sharing, aggregated summaries such as read-depth signal profiles and levels of gene expression are used. Projects such as GTEx and ENCODE share these because they ostensibly do not leak much identifying information. Here, we attempt to quantify the validity of this statement, measuring the leakage of genomic deletions from signal profiles. We present information theoretic measures for the degree to which one can genotype these deletions. We then develop practical genotyping approaches and demonstrate how to use these to identify an individual within a large cohort in the context of linking attacks. Finally, we present an anonymization method removing much of the leakage from signal profiles. Nature Publishing Group UK 2018-06-22 /pmc/articles/PMC6015012/ /pubmed/29934598 http://dx.doi.org/10.1038/s41467-018-04875-5 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Harmanci, Arif
Gerstein, Mark
Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions
title Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions
title_full Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions
title_fullStr Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions
title_full_unstemmed Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions
title_short Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions
title_sort analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6015012/
https://www.ncbi.nlm.nih.gov/pubmed/29934598
http://dx.doi.org/10.1038/s41467-018-04875-5
work_keys_str_mv AT harmanciarif analysisofsensitiveinformationleakageinfunctionalgenomicssignalprofilesthroughgenomicdeletions
AT gersteinmark analysisofsensitiveinformationleakageinfunctionalgenomicssignalprofilesthroughgenomicdeletions