Cargando…

Secure distributed genome analysis for GWAS and sequence comparison computation

BACKGROUND: The rapid increase in the availability and volume of genomic data makes significant advances in biomedical research possible, but sharing of genomic data poses challenges due to the highly sensitive nature of such data. To address the challenges, a competition for secure distributed proc...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yihua, Blanton, Marina, Almashaqbeh, Ghada
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4699166/
https://www.ncbi.nlm.nih.gov/pubmed/26733307
http://dx.doi.org/10.1186/1472-6947-15-S5-S4
_version_ 1782408152235376640
author Zhang, Yihua
Blanton, Marina
Almashaqbeh, Ghada
author_facet Zhang, Yihua
Blanton, Marina
Almashaqbeh, Ghada
author_sort Zhang, Yihua
collection PubMed
description BACKGROUND: The rapid increase in the availability and volume of genomic data makes significant advances in biomedical research possible, but sharing of genomic data poses challenges due to the highly sensitive nature of such data. To address the challenges, a competition for secure distributed processing of genomic data was organized by the iDASH research center. METHODS: In this work we propose techniques for securing computation with real-life genomic data for minor allele frequency and chi-squared statistics computation, as well as distance computation between two genomic sequences, as specified by the iDASH competition tasks. We put forward novel optimizations, including a generalization of a version of mergesort, which might be of independent interest. RESULTS: We provide implementation results of our techniques based on secret sharing that demonstrate practicality of the suggested protocols and also report on performance improvements due to our optimization techniques. CONCLUSIONS: This work describes our techniques, findings, and experimental results developed and obtained as part of iDASH 2015 research competition to secure real-life genomic computations and shows feasibility of securely computing with genomic data in practice.
format Online
Article
Text
id pubmed-4699166
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46991662016-01-13 Secure distributed genome analysis for GWAS and sequence comparison computation Zhang, Yihua Blanton, Marina Almashaqbeh, Ghada BMC Med Inform Decis Mak Proceedings BACKGROUND: The rapid increase in the availability and volume of genomic data makes significant advances in biomedical research possible, but sharing of genomic data poses challenges due to the highly sensitive nature of such data. To address the challenges, a competition for secure distributed processing of genomic data was organized by the iDASH research center. METHODS: In this work we propose techniques for securing computation with real-life genomic data for minor allele frequency and chi-squared statistics computation, as well as distance computation between two genomic sequences, as specified by the iDASH competition tasks. We put forward novel optimizations, including a generalization of a version of mergesort, which might be of independent interest. RESULTS: We provide implementation results of our techniques based on secret sharing that demonstrate practicality of the suggested protocols and also report on performance improvements due to our optimization techniques. CONCLUSIONS: This work describes our techniques, findings, and experimental results developed and obtained as part of iDASH 2015 research competition to secure real-life genomic computations and shows feasibility of securely computing with genomic data in practice. BioMed Central 2015-12-21 /pmc/articles/PMC4699166/ /pubmed/26733307 http://dx.doi.org/10.1186/1472-6947-15-S5-S4 Text en Copyright © 2015 Zhang et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Zhang, Yihua
Blanton, Marina
Almashaqbeh, Ghada
Secure distributed genome analysis for GWAS and sequence comparison computation
title Secure distributed genome analysis for GWAS and sequence comparison computation
title_full Secure distributed genome analysis for GWAS and sequence comparison computation
title_fullStr Secure distributed genome analysis for GWAS and sequence comparison computation
title_full_unstemmed Secure distributed genome analysis for GWAS and sequence comparison computation
title_short Secure distributed genome analysis for GWAS and sequence comparison computation
title_sort secure distributed genome analysis for gwas and sequence comparison computation
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4699166/
https://www.ncbi.nlm.nih.gov/pubmed/26733307
http://dx.doi.org/10.1186/1472-6947-15-S5-S4
work_keys_str_mv AT zhangyihua securedistributedgenomeanalysisforgwasandsequencecomparisoncomputation
AT blantonmarina securedistributedgenomeanalysisforgwasandsequencecomparisoncomputation
AT almashaqbehghada securedistributedgenomeanalysisforgwasandsequencecomparisoncomputation