Cargando…

Private Genomes and Public SNPs: Homomorphic Encryption of Genotypes and Phenotypes for Shared Quantitative Genetics

Sharing human genotype and phenotype data is essential to discover otherwise inaccessible genetic associations, but is a challenge because of privacy concerns. Here, we present a method of homomorphic encryption that obscures individuals’ genotypes and phenotypes, and is suited to quantitative genet...

Descripción completa

Detalles Bibliográficos
Autores principales: Mott, Richard, Fischer, Christian, Prins, Pjotr, Davies, Robert William
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7268998/
https://www.ncbi.nlm.nih.gov/pubmed/32327562
http://dx.doi.org/10.1534/genetics.120.303153
_version_ 1783541706933665792
author Mott, Richard
Fischer, Christian
Prins, Pjotr
Davies, Robert William
author_facet Mott, Richard
Fischer, Christian
Prins, Pjotr
Davies, Robert William
author_sort Mott, Richard
collection PubMed
description Sharing human genotype and phenotype data is essential to discover otherwise inaccessible genetic associations, but is a challenge because of privacy concerns. Here, we present a method of homomorphic encryption that obscures individuals’ genotypes and phenotypes, and is suited to quantitative genetic association analysis. Encrypted ciphertext and unencrypted plaintext are analytically interchangeable. The encryption uses a high-dimensional random linear orthogonal transformation key that leaves the likelihood of quantitative trait data unchanged under a linear model with normally distributed errors. It also preserves linkage disequilibrium between genetic variants and associations between variants and phenotypes. It scrambles relationships between individuals: encrypted genotype dosages closely resemble Gaussian deviates, and can be replaced by quantiles from a Gaussian with negligible effects on accuracy. Likelihood-based inferences are unaffected by orthogonal encryption. These include linear mixed models to control for unequal relatedness between individuals, heritability estimation, and including covariates when testing association. Orthogonal transformations can be applied in a modular fashion for multiparty federated mega-analyses where the parties first agree to share a common set of genotype sites and covariates prior to encryption. Each then privately encrypts and shares their own ciphertext, and analyses all parties’ ciphertexts. In the absence of private variants, or knowledge of the key, we show that it is infeasible to decrypt ciphertext using existing brute-force or noise-reduction attacks. We present the method as a challenge to the community to determine its security.
format Online
Article
Text
id pubmed-7268998
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-72689982020-06-30 Private Genomes and Public SNPs: Homomorphic Encryption of Genotypes and Phenotypes for Shared Quantitative Genetics Mott, Richard Fischer, Christian Prins, Pjotr Davies, Robert William Genetics Investigations Sharing human genotype and phenotype data is essential to discover otherwise inaccessible genetic associations, but is a challenge because of privacy concerns. Here, we present a method of homomorphic encryption that obscures individuals’ genotypes and phenotypes, and is suited to quantitative genetic association analysis. Encrypted ciphertext and unencrypted plaintext are analytically interchangeable. The encryption uses a high-dimensional random linear orthogonal transformation key that leaves the likelihood of quantitative trait data unchanged under a linear model with normally distributed errors. It also preserves linkage disequilibrium between genetic variants and associations between variants and phenotypes. It scrambles relationships between individuals: encrypted genotype dosages closely resemble Gaussian deviates, and can be replaced by quantiles from a Gaussian with negligible effects on accuracy. Likelihood-based inferences are unaffected by orthogonal encryption. These include linear mixed models to control for unequal relatedness between individuals, heritability estimation, and including covariates when testing association. Orthogonal transformations can be applied in a modular fashion for multiparty federated mega-analyses where the parties first agree to share a common set of genotype sites and covariates prior to encryption. Each then privately encrypts and shares their own ciphertext, and analyses all parties’ ciphertexts. In the absence of private variants, or knowledge of the key, we show that it is infeasible to decrypt ciphertext using existing brute-force or noise-reduction attacks. We present the method as a challenge to the community to determine its security. Genetics Society of America 2020-06 2020-04-22 /pmc/articles/PMC7268998/ /pubmed/32327562 http://dx.doi.org/10.1534/genetics.120.303153 Text en Copyright © 2020 Mott et al. Available freely online through the author-supported open access option. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Mott, Richard
Fischer, Christian
Prins, Pjotr
Davies, Robert William
Private Genomes and Public SNPs: Homomorphic Encryption of Genotypes and Phenotypes for Shared Quantitative Genetics
title Private Genomes and Public SNPs: Homomorphic Encryption of Genotypes and Phenotypes for Shared Quantitative Genetics
title_full Private Genomes and Public SNPs: Homomorphic Encryption of Genotypes and Phenotypes for Shared Quantitative Genetics
title_fullStr Private Genomes and Public SNPs: Homomorphic Encryption of Genotypes and Phenotypes for Shared Quantitative Genetics
title_full_unstemmed Private Genomes and Public SNPs: Homomorphic Encryption of Genotypes and Phenotypes for Shared Quantitative Genetics
title_short Private Genomes and Public SNPs: Homomorphic Encryption of Genotypes and Phenotypes for Shared Quantitative Genetics
title_sort private genomes and public snps: homomorphic encryption of genotypes and phenotypes for shared quantitative genetics
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7268998/
https://www.ncbi.nlm.nih.gov/pubmed/32327562
http://dx.doi.org/10.1534/genetics.120.303153
work_keys_str_mv AT mottrichard privategenomesandpublicsnpshomomorphicencryptionofgenotypesandphenotypesforsharedquantitativegenetics
AT fischerchristian privategenomesandpublicsnpshomomorphicencryptionofgenotypesandphenotypesforsharedquantitativegenetics
AT prinspjotr privategenomesandpublicsnpshomomorphicencryptionofgenotypesandphenotypesforsharedquantitativegenetics
AT daviesrobertwilliam privategenomesandpublicsnpshomomorphicencryptionofgenotypesandphenotypesforsharedquantitativegenetics