Cargando…

Algorithm for DNA sequence assembly by quantum annealing

BACKGROUND: The assembly task is an indispensable step in sequencing genomes of new organisms and studying structural genomic changes. In recent years, the dynamic development of next-generation sequencing (NGS) methods raises hopes for making whole-genome sequencing a fast and reliable tool used, f...

Descripción completa

Detalles Bibliográficos
Autores principales: Nałęcz-Charkiewicz, Katarzyna, Nowak, Robert M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8988116/
https://www.ncbi.nlm.nih.gov/pubmed/35392798
http://dx.doi.org/10.1186/s12859-022-04661-7
_version_ 1784682891155865600
author Nałęcz-Charkiewicz, Katarzyna
Nowak, Robert M.
author_facet Nałęcz-Charkiewicz, Katarzyna
Nowak, Robert M.
author_sort Nałęcz-Charkiewicz, Katarzyna
collection PubMed
description BACKGROUND: The assembly task is an indispensable step in sequencing genomes of new organisms and studying structural genomic changes. In recent years, the dynamic development of next-generation sequencing (NGS) methods raises hopes for making whole-genome sequencing a fast and reliable tool used, for example, in medical diagnostics. However, this is hampered by the slowness and computational requirements of the current processing algorithms, which raises the need to develop more efficient algorithms. One possible approach, still little explored, is the use of quantum computing. RESULTS: We present a proof of concept of de novo assembly algorithm, using the Genomic Signal Processing approach, detecting overlaps between DNA reads by calculating the Pearson correlation coefficient and formulating the assembly problem as an optimization task (Traveling Salesman Problem). Computations performed on a classic computer were compared with the results achieved by a hybrid method combining CPU and QPU calculations. For this purpose quantum annealer by D-Wave was used. The experiments were performed with artificially generated data and DNA reads coming from a simulator, with actual organism genomes used as input sequences. To our knowledge, this work is one of the few where actual sequences of organisms were used to study the de novo assembly task on quantum annealer. CONCLUSIONS: Proof of concept carried out by us showed that the use of quantum annealer (QA) for the de novo assembly task might be a promising alternative to the computations performed in the classical model. The current computing power of the available devices requires a hybrid approach (combining CPU and QPU computations). The next step may be developing a hybrid algorithm strictly dedicated to the de novo assembly task, using its specificity (e.g. the sparsity and bounded degree of the overlap-layout-consensus graph).
format Online
Article
Text
id pubmed-8988116
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-89881162022-04-07 Algorithm for DNA sequence assembly by quantum annealing Nałęcz-Charkiewicz, Katarzyna Nowak, Robert M. BMC Bioinformatics Research BACKGROUND: The assembly task is an indispensable step in sequencing genomes of new organisms and studying structural genomic changes. In recent years, the dynamic development of next-generation sequencing (NGS) methods raises hopes for making whole-genome sequencing a fast and reliable tool used, for example, in medical diagnostics. However, this is hampered by the slowness and computational requirements of the current processing algorithms, which raises the need to develop more efficient algorithms. One possible approach, still little explored, is the use of quantum computing. RESULTS: We present a proof of concept of de novo assembly algorithm, using the Genomic Signal Processing approach, detecting overlaps between DNA reads by calculating the Pearson correlation coefficient and formulating the assembly problem as an optimization task (Traveling Salesman Problem). Computations performed on a classic computer were compared with the results achieved by a hybrid method combining CPU and QPU calculations. For this purpose quantum annealer by D-Wave was used. The experiments were performed with artificially generated data and DNA reads coming from a simulator, with actual organism genomes used as input sequences. To our knowledge, this work is one of the few where actual sequences of organisms were used to study the de novo assembly task on quantum annealer. CONCLUSIONS: Proof of concept carried out by us showed that the use of quantum annealer (QA) for the de novo assembly task might be a promising alternative to the computations performed in the classical model. The current computing power of the available devices requires a hybrid approach (combining CPU and QPU computations). The next step may be developing a hybrid algorithm strictly dedicated to the de novo assembly task, using its specificity (e.g. the sparsity and bounded degree of the overlap-layout-consensus graph). BioMed Central 2022-04-07 /pmc/articles/PMC8988116/ /pubmed/35392798 http://dx.doi.org/10.1186/s12859-022-04661-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Nałęcz-Charkiewicz, Katarzyna
Nowak, Robert M.
Algorithm for DNA sequence assembly by quantum annealing
title Algorithm for DNA sequence assembly by quantum annealing
title_full Algorithm for DNA sequence assembly by quantum annealing
title_fullStr Algorithm for DNA sequence assembly by quantum annealing
title_full_unstemmed Algorithm for DNA sequence assembly by quantum annealing
title_short Algorithm for DNA sequence assembly by quantum annealing
title_sort algorithm for dna sequence assembly by quantum annealing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8988116/
https://www.ncbi.nlm.nih.gov/pubmed/35392798
http://dx.doi.org/10.1186/s12859-022-04661-7
work_keys_str_mv AT nałeczcharkiewiczkatarzyna algorithmfordnasequenceassemblybyquantumannealing
AT nowakrobertm algorithmfordnasequenceassemblybyquantumannealing