Cargando…

GASAL2: a GPU accelerated sequence alignment library for high-throughput NGS data

BACKGROUND: Due the computational complexity of sequence alignment algorithms, various accelerated solutions have been proposed to speedup this analysis. NVBIO is the only available GPU library that accelerates sequence alignment of high-throughput NGS data, but has limited performance. In this arti...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmed, Nauman, Lévy, Jonathan, Ren, Shanshan, Mushtaq, Hamid, Bertels, Koen, Al-Ars, Zaid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6815017/
https://www.ncbi.nlm.nih.gov/pubmed/31653208
http://dx.doi.org/10.1186/s12859-019-3086-9
_version_ 1783463113189752832
author Ahmed, Nauman
Lévy, Jonathan
Ren, Shanshan
Mushtaq, Hamid
Bertels, Koen
Al-Ars, Zaid
author_facet Ahmed, Nauman
Lévy, Jonathan
Ren, Shanshan
Mushtaq, Hamid
Bertels, Koen
Al-Ars, Zaid
author_sort Ahmed, Nauman
collection PubMed
description BACKGROUND: Due the computational complexity of sequence alignment algorithms, various accelerated solutions have been proposed to speedup this analysis. NVBIO is the only available GPU library that accelerates sequence alignment of high-throughput NGS data, but has limited performance. In this article we present GASAL2, a GPU library for aligning DNA and RNA sequences that outperforms existing CPU and GPU libraries. RESULTS: The GASAL2 library provides specialized, accelerated kernels for local, global and all types of semi-global alignment. Pairwise sequence alignment can be performed with and without traceback. GASAL2 outperforms the fastest CPU-optimized SIMD implementations such as SeqAn and Parasail, as well as NVIDIA’s own GPU-based library known as NVBIO. GASAL2 is unique in performing sequence packing on GPU, which is up to 750x faster than NVBIO. Overall on Geforce GTX 1080 Ti GPU, GASAL2 is up to 21x faster than Parasail on a dual socket hyper-threaded Intel Xeon system with 28 cores and up to 13x faster than NVBIO with a query length of up to 300 bases and 100 bases, respectively. GASAL2 alignment functions are asynchronous/non-blocking and allow full overlap of CPU and GPU execution. The paper shows how to use GASAL2 to accelerate BWA-MEM, speeding up the local alignment by 20x, which gives an overall application speedup of 1.3x vs. CPU with up to 12 threads. CONCLUSIONS: The library provides high performance APIs for local, global and semi-global alignment that can be easily integrated into various bioinformatics tools.
format Online
Article
Text
id pubmed-6815017
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68150172019-10-31 GASAL2: a GPU accelerated sequence alignment library for high-throughput NGS data Ahmed, Nauman Lévy, Jonathan Ren, Shanshan Mushtaq, Hamid Bertels, Koen Al-Ars, Zaid BMC Bioinformatics Software BACKGROUND: Due the computational complexity of sequence alignment algorithms, various accelerated solutions have been proposed to speedup this analysis. NVBIO is the only available GPU library that accelerates sequence alignment of high-throughput NGS data, but has limited performance. In this article we present GASAL2, a GPU library for aligning DNA and RNA sequences that outperforms existing CPU and GPU libraries. RESULTS: The GASAL2 library provides specialized, accelerated kernels for local, global and all types of semi-global alignment. Pairwise sequence alignment can be performed with and without traceback. GASAL2 outperforms the fastest CPU-optimized SIMD implementations such as SeqAn and Parasail, as well as NVIDIA’s own GPU-based library known as NVBIO. GASAL2 is unique in performing sequence packing on GPU, which is up to 750x faster than NVBIO. Overall on Geforce GTX 1080 Ti GPU, GASAL2 is up to 21x faster than Parasail on a dual socket hyper-threaded Intel Xeon system with 28 cores and up to 13x faster than NVBIO with a query length of up to 300 bases and 100 bases, respectively. GASAL2 alignment functions are asynchronous/non-blocking and allow full overlap of CPU and GPU execution. The paper shows how to use GASAL2 to accelerate BWA-MEM, speeding up the local alignment by 20x, which gives an overall application speedup of 1.3x vs. CPU with up to 12 threads. CONCLUSIONS: The library provides high performance APIs for local, global and semi-global alignment that can be easily integrated into various bioinformatics tools. BioMed Central 2019-10-25 /pmc/articles/PMC6815017/ /pubmed/31653208 http://dx.doi.org/10.1186/s12859-019-3086-9 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Ahmed, Nauman
Lévy, Jonathan
Ren, Shanshan
Mushtaq, Hamid
Bertels, Koen
Al-Ars, Zaid
GASAL2: a GPU accelerated sequence alignment library for high-throughput NGS data
title GASAL2: a GPU accelerated sequence alignment library for high-throughput NGS data
title_full GASAL2: a GPU accelerated sequence alignment library for high-throughput NGS data
title_fullStr GASAL2: a GPU accelerated sequence alignment library for high-throughput NGS data
title_full_unstemmed GASAL2: a GPU accelerated sequence alignment library for high-throughput NGS data
title_short GASAL2: a GPU accelerated sequence alignment library for high-throughput NGS data
title_sort gasal2: a gpu accelerated sequence alignment library for high-throughput ngs data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6815017/
https://www.ncbi.nlm.nih.gov/pubmed/31653208
http://dx.doi.org/10.1186/s12859-019-3086-9
work_keys_str_mv AT ahmednauman gasal2agpuacceleratedsequencealignmentlibraryforhighthroughputngsdata
AT levyjonathan gasal2agpuacceleratedsequencealignmentlibraryforhighthroughputngsdata
AT renshanshan gasal2agpuacceleratedsequencealignmentlibraryforhighthroughputngsdata
AT mushtaqhamid gasal2agpuacceleratedsequencealignmentlibraryforhighthroughputngsdata
AT bertelskoen gasal2agpuacceleratedsequencealignmentlibraryforhighthroughputngsdata
AT alarszaid gasal2agpuacceleratedsequencealignmentlibraryforhighthroughputngsdata