Cargando…

Secure genome-wide association analysis using multiparty computation

Most sequenced genomes are currently stored in strict access-controlled repositories(1–3). Free access to these data could improve the power of genome-wide association studies (GWAS) to identify disease-causing genetic variants and may aid in the discovery of new drug targets(4,5). However, concerns...

Descripción completa

Detalles Bibliográficos
Autores principales: Cho, Hyunghoon, Wu, David J., Berger, Bonnie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5990440/
https://www.ncbi.nlm.nih.gov/pubmed/29734293
http://dx.doi.org/10.1038/nbt.4108
_version_ 1783329580058148864
author Cho, Hyunghoon
Wu, David J.
Berger, Bonnie
author_facet Cho, Hyunghoon
Wu, David J.
Berger, Bonnie
author_sort Cho, Hyunghoon
collection PubMed
description Most sequenced genomes are currently stored in strict access-controlled repositories(1–3). Free access to these data could improve the power of genome-wide association studies (GWAS) to identify disease-causing genetic variants and may aid in the discovery of new drug targets(4,5). However, concerns over genetic data privacy(6–9) may deter individuals from contributing their genomes to scientific studies(10) and in many cases, prevent researchers from sharing data with the scientific community(11). Although several cryptographic techniques for secure data analysis exist(12–14), none scales to computationally intensive analyses, such as GWAS. Here we describe an end-to-end protocol for large-scale genome-wide analysis that facilitates quality control and population stratification correction in 9K, 13K, and 23K individuals while maintaining the confidentiality of underlying genotypes and phenotypes. We show the protocol could feasibly scale to a million individuals. This approach may help to make currently restricted data available to the scientific community and could potentially enable ‘secure genome crowdsourcing,’ allowing individuals to contribute their genomes to a study without compromising their privacy.
format Online
Article
Text
id pubmed-5990440
institution National Center for Biotechnology Information
language English
publishDate 2018
record_format MEDLINE/PubMed
spelling pubmed-59904402018-11-07 Secure genome-wide association analysis using multiparty computation Cho, Hyunghoon Wu, David J. Berger, Bonnie Nat Biotechnol Article Most sequenced genomes are currently stored in strict access-controlled repositories(1–3). Free access to these data could improve the power of genome-wide association studies (GWAS) to identify disease-causing genetic variants and may aid in the discovery of new drug targets(4,5). However, concerns over genetic data privacy(6–9) may deter individuals from contributing their genomes to scientific studies(10) and in many cases, prevent researchers from sharing data with the scientific community(11). Although several cryptographic techniques for secure data analysis exist(12–14), none scales to computationally intensive analyses, such as GWAS. Here we describe an end-to-end protocol for large-scale genome-wide analysis that facilitates quality control and population stratification correction in 9K, 13K, and 23K individuals while maintaining the confidentiality of underlying genotypes and phenotypes. We show the protocol could feasibly scale to a million individuals. This approach may help to make currently restricted data available to the scientific community and could potentially enable ‘secure genome crowdsourcing,’ allowing individuals to contribute their genomes to a study without compromising their privacy. 2018-05-07 2018-07 /pmc/articles/PMC5990440/ /pubmed/29734293 http://dx.doi.org/10.1038/nbt.4108 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Cho, Hyunghoon
Wu, David J.
Berger, Bonnie
Secure genome-wide association analysis using multiparty computation
title Secure genome-wide association analysis using multiparty computation
title_full Secure genome-wide association analysis using multiparty computation
title_fullStr Secure genome-wide association analysis using multiparty computation
title_full_unstemmed Secure genome-wide association analysis using multiparty computation
title_short Secure genome-wide association analysis using multiparty computation
title_sort secure genome-wide association analysis using multiparty computation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5990440/
https://www.ncbi.nlm.nih.gov/pubmed/29734293
http://dx.doi.org/10.1038/nbt.4108
work_keys_str_mv AT chohyunghoon securegenomewideassociationanalysisusingmultipartycomputation
AT wudavidj securegenomewideassociationanalysisusingmultipartycomputation
AT bergerbonnie securegenomewideassociationanalysisusingmultipartycomputation