Cargando…

Accelerating Minimap2 for Accurate Long Read Alignment on GPUs

Long read sequencing technology is becoming increasingly popular for Precision Medicine applications like Whole Genome Sequencing (WGS) and microbial abundance estimation. Minimap2 is the state-of-the-art aligner and mapper used by the leading long read sequencing technologies, today. However, Minim...

Descripción completa

Detalles Bibliográficos
Autores principales: Sadasivan, Harisankar, Maric, Milos, Dawson, Eric, Iyer, Vishanth, Israeli, Johnny, Narayanasamy, Satish
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10018915/
https://www.ncbi.nlm.nih.gov/pubmed/36937168
http://dx.doi.org/10.26502/jbb.2642-91280067
_version_ 1784907912400863232
author Sadasivan, Harisankar
Maric, Milos
Dawson, Eric
Iyer, Vishanth
Israeli, Johnny
Narayanasamy, Satish
author_facet Sadasivan, Harisankar
Maric, Milos
Dawson, Eric
Iyer, Vishanth
Israeli, Johnny
Narayanasamy, Satish
author_sort Sadasivan, Harisankar
collection PubMed
description Long read sequencing technology is becoming increasingly popular for Precision Medicine applications like Whole Genome Sequencing (WGS) and microbial abundance estimation. Minimap2 is the state-of-the-art aligner and mapper used by the leading long read sequencing technologies, today. However, Minimap2 on CPUs is very slow for long noisy reads. ~60-70% of the run-time on a CPU comes from the highly sequential chaining step in Minimap2. On the other hand, most Point-of-Care computational workflows in long read sequencing use Graphics Processing Units (GPUs). We present minimap2-accelerated (mm2-ax), a heterogeneous design for sequence mapping and alignment where minimap2’s compute intensive chaining step is sped up on the GPU and demonstrate its time and cost benefits. We extract better intra-read parallelism from chaining without losing mapping accuracy by forward transforming Minimap2’s chaining algorithm. Moreover, we better utilize the high memory available on modern cloud instances apart from better workload balancing, data locality and minimal branch divergence on the GPU. We show mm2-ax on an NVIDIA A100 GPU improves the chaining step with 5.41 - 2.57X speedup and 4.07 - 1.93X speedup : costup over the fastest version of Minimap2, mm2-fast, benchmarked on a Google Cloud Platform instance of 30 SIMD cores.
format Online
Article
Text
id pubmed-10018915
institution National Center for Biotechnology Information
language English
publishDate 2023
record_format MEDLINE/PubMed
spelling pubmed-100189152023-03-16 Accelerating Minimap2 for Accurate Long Read Alignment on GPUs Sadasivan, Harisankar Maric, Milos Dawson, Eric Iyer, Vishanth Israeli, Johnny Narayanasamy, Satish J Biotechnol Biomed Article Long read sequencing technology is becoming increasingly popular for Precision Medicine applications like Whole Genome Sequencing (WGS) and microbial abundance estimation. Minimap2 is the state-of-the-art aligner and mapper used by the leading long read sequencing technologies, today. However, Minimap2 on CPUs is very slow for long noisy reads. ~60-70% of the run-time on a CPU comes from the highly sequential chaining step in Minimap2. On the other hand, most Point-of-Care computational workflows in long read sequencing use Graphics Processing Units (GPUs). We present minimap2-accelerated (mm2-ax), a heterogeneous design for sequence mapping and alignment where minimap2’s compute intensive chaining step is sped up on the GPU and demonstrate its time and cost benefits. We extract better intra-read parallelism from chaining without losing mapping accuracy by forward transforming Minimap2’s chaining algorithm. Moreover, we better utilize the high memory available on modern cloud instances apart from better workload balancing, data locality and minimal branch divergence on the GPU. We show mm2-ax on an NVIDIA A100 GPU improves the chaining step with 5.41 - 2.57X speedup and 4.07 - 1.93X speedup : costup over the fastest version of Minimap2, mm2-fast, benchmarked on a Google Cloud Platform instance of 30 SIMD cores. 2023 2023-01-20 /pmc/articles/PMC10018915/ /pubmed/36937168 http://dx.doi.org/10.26502/jbb.2642-91280067 Text en https://creativecommons.org/licenses/by/4.0/This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license 4.0
spellingShingle Article
Sadasivan, Harisankar
Maric, Milos
Dawson, Eric
Iyer, Vishanth
Israeli, Johnny
Narayanasamy, Satish
Accelerating Minimap2 for Accurate Long Read Alignment on GPUs
title Accelerating Minimap2 for Accurate Long Read Alignment on GPUs
title_full Accelerating Minimap2 for Accurate Long Read Alignment on GPUs
title_fullStr Accelerating Minimap2 for Accurate Long Read Alignment on GPUs
title_full_unstemmed Accelerating Minimap2 for Accurate Long Read Alignment on GPUs
title_short Accelerating Minimap2 for Accurate Long Read Alignment on GPUs
title_sort accelerating minimap2 for accurate long read alignment on gpus
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10018915/
https://www.ncbi.nlm.nih.gov/pubmed/36937168
http://dx.doi.org/10.26502/jbb.2642-91280067
work_keys_str_mv AT sadasivanharisankar acceleratingminimap2foraccuratelongreadalignmentongpus
AT maricmilos acceleratingminimap2foraccuratelongreadalignmentongpus
AT dawsoneric acceleratingminimap2foraccuratelongreadalignmentongpus
AT iyervishanth acceleratingminimap2foraccuratelongreadalignmentongpus
AT israelijohnny acceleratingminimap2foraccuratelongreadalignmentongpus
AT narayanasamysatish acceleratingminimap2foraccuratelongreadalignmentongpus