Cargando…
Realizing privacy preserving genome-wide association studies
Motivation: As genomics moves into the clinic, there has been much interest in using this medical data for research. At the same time the use of such data raises many privacy concerns. These circumstances have led to the development of various methods to perform genome-wide association studies (GWAS...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848404/ https://www.ncbi.nlm.nih.gov/pubmed/26769317 http://dx.doi.org/10.1093/bioinformatics/btw009 |
_version_ | 1782429335753326592 |
---|---|
author | Simmons, Sean Berger, Bonnie |
author_facet | Simmons, Sean Berger, Bonnie |
author_sort | Simmons, Sean |
collection | PubMed |
description | Motivation: As genomics moves into the clinic, there has been much interest in using this medical data for research. At the same time the use of such data raises many privacy concerns. These circumstances have led to the development of various methods to perform genome-wide association studies (GWAS) on patient records while ensuring privacy. In particular, there has been growing interest in applying differentially private techniques to this challenge. Unfortunately, up until now all methods for finding high scoring SNPs in a differentially private manner have had major drawbacks in terms of either accuracy or computational efficiency. Results: Here we overcome these limitations with a substantially modified version of the neighbor distance method for performing differentially private GWAS, and thus are able to produce a more viable mechanism. Specifically, we use input perturbation and an adaptive boundary method to overcome accuracy issues. We also design and implement a convex analysis based algorithm to calculate the neighbor distance for each SNP in constant time, overcoming the major computational bottleneck in the neighbor distance method. It is our hope that methods such as ours will pave the way for more widespread use of patient data in biomedical research. Availability and implementation: A python implementation is available at http://groups.csail.mit.edu/cb/DiffPriv/. Contact: bab@csail.mit.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4848404 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-48484042016-04-29 Realizing privacy preserving genome-wide association studies Simmons, Sean Berger, Bonnie Bioinformatics Original Papers Motivation: As genomics moves into the clinic, there has been much interest in using this medical data for research. At the same time the use of such data raises many privacy concerns. These circumstances have led to the development of various methods to perform genome-wide association studies (GWAS) on patient records while ensuring privacy. In particular, there has been growing interest in applying differentially private techniques to this challenge. Unfortunately, up until now all methods for finding high scoring SNPs in a differentially private manner have had major drawbacks in terms of either accuracy or computational efficiency. Results: Here we overcome these limitations with a substantially modified version of the neighbor distance method for performing differentially private GWAS, and thus are able to produce a more viable mechanism. Specifically, we use input perturbation and an adaptive boundary method to overcome accuracy issues. We also design and implement a convex analysis based algorithm to calculate the neighbor distance for each SNP in constant time, overcoming the major computational bottleneck in the neighbor distance method. It is our hope that methods such as ours will pave the way for more widespread use of patient data in biomedical research. Availability and implementation: A python implementation is available at http://groups.csail.mit.edu/cb/DiffPriv/. Contact: bab@csail.mit.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-05-01 2016-01-14 /pmc/articles/PMC4848404/ /pubmed/26769317 http://dx.doi.org/10.1093/bioinformatics/btw009 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Simmons, Sean Berger, Bonnie Realizing privacy preserving genome-wide association studies |
title | Realizing privacy preserving genome-wide association studies |
title_full | Realizing privacy preserving genome-wide association studies |
title_fullStr | Realizing privacy preserving genome-wide association studies |
title_full_unstemmed | Realizing privacy preserving genome-wide association studies |
title_short | Realizing privacy preserving genome-wide association studies |
title_sort | realizing privacy preserving genome-wide association studies |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848404/ https://www.ncbi.nlm.nih.gov/pubmed/26769317 http://dx.doi.org/10.1093/bioinformatics/btw009 |
work_keys_str_mv | AT simmonssean realizingprivacypreservinggenomewideassociationstudies AT bergerbonnie realizingprivacypreservinggenomewideassociationstudies |