Cargando…

Realizing privacy preserving genome-wide association studies

Motivation: As genomics moves into the clinic, there has been much interest in using this medical data for research. At the same time the use of such data raises many privacy concerns. These circumstances have led to the development of various methods to perform genome-wide association studies (GWAS...

Descripción completa

Detalles Bibliográficos
Autores principales: Simmons, Sean, Berger, Bonnie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848404/
https://www.ncbi.nlm.nih.gov/pubmed/26769317
http://dx.doi.org/10.1093/bioinformatics/btw009
_version_ 1782429335753326592
author Simmons, Sean
Berger, Bonnie
author_facet Simmons, Sean
Berger, Bonnie
author_sort Simmons, Sean
collection PubMed
description Motivation: As genomics moves into the clinic, there has been much interest in using this medical data for research. At the same time the use of such data raises many privacy concerns. These circumstances have led to the development of various methods to perform genome-wide association studies (GWAS) on patient records while ensuring privacy. In particular, there has been growing interest in applying differentially private techniques to this challenge. Unfortunately, up until now all methods for finding high scoring SNPs in a differentially private manner have had major drawbacks in terms of either accuracy or computational efficiency. Results: Here we overcome these limitations with a substantially modified version of the neighbor distance method for performing differentially private GWAS, and thus are able to produce a more viable mechanism. Specifically, we use input perturbation and an adaptive boundary method to overcome accuracy issues. We also design and implement a convex analysis based algorithm to calculate the neighbor distance for each SNP in constant time, overcoming the major computational bottleneck in the neighbor distance method. It is our hope that methods such as ours will pave the way for more widespread use of patient data in biomedical research. Availability and implementation: A python implementation is available at http://groups.csail.mit.edu/cb/DiffPriv/. Contact: bab@csail.mit.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4848404
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-48484042016-04-29 Realizing privacy preserving genome-wide association studies Simmons, Sean Berger, Bonnie Bioinformatics Original Papers Motivation: As genomics moves into the clinic, there has been much interest in using this medical data for research. At the same time the use of such data raises many privacy concerns. These circumstances have led to the development of various methods to perform genome-wide association studies (GWAS) on patient records while ensuring privacy. In particular, there has been growing interest in applying differentially private techniques to this challenge. Unfortunately, up until now all methods for finding high scoring SNPs in a differentially private manner have had major drawbacks in terms of either accuracy or computational efficiency. Results: Here we overcome these limitations with a substantially modified version of the neighbor distance method for performing differentially private GWAS, and thus are able to produce a more viable mechanism. Specifically, we use input perturbation and an adaptive boundary method to overcome accuracy issues. We also design and implement a convex analysis based algorithm to calculate the neighbor distance for each SNP in constant time, overcoming the major computational bottleneck in the neighbor distance method. It is our hope that methods such as ours will pave the way for more widespread use of patient data in biomedical research. Availability and implementation: A python implementation is available at http://groups.csail.mit.edu/cb/DiffPriv/. Contact: bab@csail.mit.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-05-01 2016-01-14 /pmc/articles/PMC4848404/ /pubmed/26769317 http://dx.doi.org/10.1093/bioinformatics/btw009 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Simmons, Sean
Berger, Bonnie
Realizing privacy preserving genome-wide association studies
title Realizing privacy preserving genome-wide association studies
title_full Realizing privacy preserving genome-wide association studies
title_fullStr Realizing privacy preserving genome-wide association studies
title_full_unstemmed Realizing privacy preserving genome-wide association studies
title_short Realizing privacy preserving genome-wide association studies
title_sort realizing privacy preserving genome-wide association studies
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848404/
https://www.ncbi.nlm.nih.gov/pubmed/26769317
http://dx.doi.org/10.1093/bioinformatics/btw009
work_keys_str_mv AT simmonssean realizingprivacypreservinggenomewideassociationstudies
AT bergerbonnie realizingprivacypreservinggenomewideassociationstudies