Cargando…

Secure top most significant genome variants search: iDASH 2017 competition

BACKGROUND: One of the 3 tracks of iDASH Privacy & Security Workshop 2017 competition was to execute a whole genome variants search on private genomic data. Particularly, the search application was to find the top most significant SNPs (Single-Nucleotide Polymorphisms) in a database of genome re...

Descripción completa

Detalles Bibliográficos
Autores principales: Carpov, Sergiu, Tortech, Thibaud
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6180353/
https://www.ncbi.nlm.nih.gov/pubmed/30309361
http://dx.doi.org/10.1186/s12920-018-0399-x
_version_ 1783362182419841024
author Carpov, Sergiu
Tortech, Thibaud
author_facet Carpov, Sergiu
Tortech, Thibaud
author_sort Carpov, Sergiu
collection PubMed
description BACKGROUND: One of the 3 tracks of iDASH Privacy & Security Workshop 2017 competition was to execute a whole genome variants search on private genomic data. Particularly, the search application was to find the top most significant SNPs (Single-Nucleotide Polymorphisms) in a database of genome records labeled with control or case. In this paper we discuss the solution submitted by our team to this competition. METHODS: Privacy and confidentiality of genome data had to be ensured using Intel SGX enclaves. The typical use-case of this application is the multi-party computation (each party possessing one or several genome records) of the SNPs which statistically differentiate control and case genome datasets. RESULTS: Our solution consists of two applications: (i) compress and encrypt genome files and (ii) perform genome processing (top most important SNPs search). We have opted for a horizontal treatment of genome records and heavily used parallel processing. Rust programming language was employed to develop both applications. CONCLUSIONS: Execution performance of the processing applications scales well and very good performance metrics are obtained. Contest organizers selected it as the best submission amongst other received competition entries and our team was awarded the first prize on this track.
format Online
Article
Text
id pubmed-6180353
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61803532018-10-18 Secure top most significant genome variants search: iDASH 2017 competition Carpov, Sergiu Tortech, Thibaud BMC Med Genomics Research BACKGROUND: One of the 3 tracks of iDASH Privacy & Security Workshop 2017 competition was to execute a whole genome variants search on private genomic data. Particularly, the search application was to find the top most significant SNPs (Single-Nucleotide Polymorphisms) in a database of genome records labeled with control or case. In this paper we discuss the solution submitted by our team to this competition. METHODS: Privacy and confidentiality of genome data had to be ensured using Intel SGX enclaves. The typical use-case of this application is the multi-party computation (each party possessing one or several genome records) of the SNPs which statistically differentiate control and case genome datasets. RESULTS: Our solution consists of two applications: (i) compress and encrypt genome files and (ii) perform genome processing (top most important SNPs search). We have opted for a horizontal treatment of genome records and heavily used parallel processing. Rust programming language was employed to develop both applications. CONCLUSIONS: Execution performance of the processing applications scales well and very good performance metrics are obtained. Contest organizers selected it as the best submission amongst other received competition entries and our team was awarded the first prize on this track. BioMed Central 2018-10-11 /pmc/articles/PMC6180353/ /pubmed/30309361 http://dx.doi.org/10.1186/s12920-018-0399-x Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Carpov, Sergiu
Tortech, Thibaud
Secure top most significant genome variants search: iDASH 2017 competition
title Secure top most significant genome variants search: iDASH 2017 competition
title_full Secure top most significant genome variants search: iDASH 2017 competition
title_fullStr Secure top most significant genome variants search: iDASH 2017 competition
title_full_unstemmed Secure top most significant genome variants search: iDASH 2017 competition
title_short Secure top most significant genome variants search: iDASH 2017 competition
title_sort secure top most significant genome variants search: idash 2017 competition
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6180353/
https://www.ncbi.nlm.nih.gov/pubmed/30309361
http://dx.doi.org/10.1186/s12920-018-0399-x
work_keys_str_mv AT carpovsergiu securetopmostsignificantgenomevariantssearchidash2017competition
AT tortechthibaud securetopmostsignificantgenomevariantssearchidash2017competition