Cargando…
Secure top most significant genome variants search: iDASH 2017 competition
BACKGROUND: One of the 3 tracks of iDASH Privacy & Security Workshop 2017 competition was to execute a whole genome variants search on private genomic data. Particularly, the search application was to find the top most significant SNPs (Single-Nucleotide Polymorphisms) in a database of genome re...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6180353/ https://www.ncbi.nlm.nih.gov/pubmed/30309361 http://dx.doi.org/10.1186/s12920-018-0399-x |
_version_ | 1783362182419841024 |
---|---|
author | Carpov, Sergiu Tortech, Thibaud |
author_facet | Carpov, Sergiu Tortech, Thibaud |
author_sort | Carpov, Sergiu |
collection | PubMed |
description | BACKGROUND: One of the 3 tracks of iDASH Privacy & Security Workshop 2017 competition was to execute a whole genome variants search on private genomic data. Particularly, the search application was to find the top most significant SNPs (Single-Nucleotide Polymorphisms) in a database of genome records labeled with control or case. In this paper we discuss the solution submitted by our team to this competition. METHODS: Privacy and confidentiality of genome data had to be ensured using Intel SGX enclaves. The typical use-case of this application is the multi-party computation (each party possessing one or several genome records) of the SNPs which statistically differentiate control and case genome datasets. RESULTS: Our solution consists of two applications: (i) compress and encrypt genome files and (ii) perform genome processing (top most important SNPs search). We have opted for a horizontal treatment of genome records and heavily used parallel processing. Rust programming language was employed to develop both applications. CONCLUSIONS: Execution performance of the processing applications scales well and very good performance metrics are obtained. Contest organizers selected it as the best submission amongst other received competition entries and our team was awarded the first prize on this track. |
format | Online Article Text |
id | pubmed-6180353 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-61803532018-10-18 Secure top most significant genome variants search: iDASH 2017 competition Carpov, Sergiu Tortech, Thibaud BMC Med Genomics Research BACKGROUND: One of the 3 tracks of iDASH Privacy & Security Workshop 2017 competition was to execute a whole genome variants search on private genomic data. Particularly, the search application was to find the top most significant SNPs (Single-Nucleotide Polymorphisms) in a database of genome records labeled with control or case. In this paper we discuss the solution submitted by our team to this competition. METHODS: Privacy and confidentiality of genome data had to be ensured using Intel SGX enclaves. The typical use-case of this application is the multi-party computation (each party possessing one or several genome records) of the SNPs which statistically differentiate control and case genome datasets. RESULTS: Our solution consists of two applications: (i) compress and encrypt genome files and (ii) perform genome processing (top most important SNPs search). We have opted for a horizontal treatment of genome records and heavily used parallel processing. Rust programming language was employed to develop both applications. CONCLUSIONS: Execution performance of the processing applications scales well and very good performance metrics are obtained. Contest organizers selected it as the best submission amongst other received competition entries and our team was awarded the first prize on this track. BioMed Central 2018-10-11 /pmc/articles/PMC6180353/ /pubmed/30309361 http://dx.doi.org/10.1186/s12920-018-0399-x Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Carpov, Sergiu Tortech, Thibaud Secure top most significant genome variants search: iDASH 2017 competition |
title | Secure top most significant genome variants search: iDASH 2017 competition |
title_full | Secure top most significant genome variants search: iDASH 2017 competition |
title_fullStr | Secure top most significant genome variants search: iDASH 2017 competition |
title_full_unstemmed | Secure top most significant genome variants search: iDASH 2017 competition |
title_short | Secure top most significant genome variants search: iDASH 2017 competition |
title_sort | secure top most significant genome variants search: idash 2017 competition |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6180353/ https://www.ncbi.nlm.nih.gov/pubmed/30309361 http://dx.doi.org/10.1186/s12920-018-0399-x |
work_keys_str_mv | AT carpovsergiu securetopmostsignificantgenomevariantssearchidash2017competition AT tortechthibaud securetopmostsignificantgenomevariantssearchidash2017competition |