Cargando…
Secure distributed genome analysis for GWAS and sequence comparison computation
BACKGROUND: The rapid increase in the availability and volume of genomic data makes significant advances in biomedical research possible, but sharing of genomic data poses challenges due to the highly sensitive nature of such data. To address the challenges, a competition for secure distributed proc...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4699166/ https://www.ncbi.nlm.nih.gov/pubmed/26733307 http://dx.doi.org/10.1186/1472-6947-15-S5-S4 |
_version_ | 1782408152235376640 |
---|---|
author | Zhang, Yihua Blanton, Marina Almashaqbeh, Ghada |
author_facet | Zhang, Yihua Blanton, Marina Almashaqbeh, Ghada |
author_sort | Zhang, Yihua |
collection | PubMed |
description | BACKGROUND: The rapid increase in the availability and volume of genomic data makes significant advances in biomedical research possible, but sharing of genomic data poses challenges due to the highly sensitive nature of such data. To address the challenges, a competition for secure distributed processing of genomic data was organized by the iDASH research center. METHODS: In this work we propose techniques for securing computation with real-life genomic data for minor allele frequency and chi-squared statistics computation, as well as distance computation between two genomic sequences, as specified by the iDASH competition tasks. We put forward novel optimizations, including a generalization of a version of mergesort, which might be of independent interest. RESULTS: We provide implementation results of our techniques based on secret sharing that demonstrate practicality of the suggested protocols and also report on performance improvements due to our optimization techniques. CONCLUSIONS: This work describes our techniques, findings, and experimental results developed and obtained as part of iDASH 2015 research competition to secure real-life genomic computations and shows feasibility of securely computing with genomic data in practice. |
format | Online Article Text |
id | pubmed-4699166 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46991662016-01-13 Secure distributed genome analysis for GWAS and sequence comparison computation Zhang, Yihua Blanton, Marina Almashaqbeh, Ghada BMC Med Inform Decis Mak Proceedings BACKGROUND: The rapid increase in the availability and volume of genomic data makes significant advances in biomedical research possible, but sharing of genomic data poses challenges due to the highly sensitive nature of such data. To address the challenges, a competition for secure distributed processing of genomic data was organized by the iDASH research center. METHODS: In this work we propose techniques for securing computation with real-life genomic data for minor allele frequency and chi-squared statistics computation, as well as distance computation between two genomic sequences, as specified by the iDASH competition tasks. We put forward novel optimizations, including a generalization of a version of mergesort, which might be of independent interest. RESULTS: We provide implementation results of our techniques based on secret sharing that demonstrate practicality of the suggested protocols and also report on performance improvements due to our optimization techniques. CONCLUSIONS: This work describes our techniques, findings, and experimental results developed and obtained as part of iDASH 2015 research competition to secure real-life genomic computations and shows feasibility of securely computing with genomic data in practice. BioMed Central 2015-12-21 /pmc/articles/PMC4699166/ /pubmed/26733307 http://dx.doi.org/10.1186/1472-6947-15-S5-S4 Text en Copyright © 2015 Zhang et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Zhang, Yihua Blanton, Marina Almashaqbeh, Ghada Secure distributed genome analysis for GWAS and sequence comparison computation |
title | Secure distributed genome analysis for GWAS and sequence comparison computation |
title_full | Secure distributed genome analysis for GWAS and sequence comparison computation |
title_fullStr | Secure distributed genome analysis for GWAS and sequence comparison computation |
title_full_unstemmed | Secure distributed genome analysis for GWAS and sequence comparison computation |
title_short | Secure distributed genome analysis for GWAS and sequence comparison computation |
title_sort | secure distributed genome analysis for gwas and sequence comparison computation |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4699166/ https://www.ncbi.nlm.nih.gov/pubmed/26733307 http://dx.doi.org/10.1186/1472-6947-15-S5-S4 |
work_keys_str_mv | AT zhangyihua securedistributedgenomeanalysisforgwasandsequencecomparisoncomputation AT blantonmarina securedistributedgenomeanalysisforgwasandsequencecomparisoncomputation AT almashaqbehghada securedistributedgenomeanalysisforgwasandsequencecomparisoncomputation |