Cargando…

GPU-accelerated and pipelined methylation calling

MOTIVATION: The third-generation DNA sequencing technologies, such as Nanopore Sequencing, can operate at very high speeds and produce longer reads, which in turn results in a challenge for the computational analysis of such massive data. Nanopolish is a software package for signal-level analysis of...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Yilin, Gudukbay Akbulut, Gulsum, Tang, Xulong, Gunasekaran, Jashwant Raj, Rahman, Amatur, Medvedev, Paul, Kandemir, Mahmut
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9757827/
https://www.ncbi.nlm.nih.gov/pubmed/36699365
http://dx.doi.org/10.1093/bioadv/vbac088
_version_ 1784851907143008256
author Feng, Yilin
Gudukbay Akbulut, Gulsum
Tang, Xulong
Gunasekaran, Jashwant Raj
Rahman, Amatur
Medvedev, Paul
Kandemir, Mahmut
author_facet Feng, Yilin
Gudukbay Akbulut, Gulsum
Tang, Xulong
Gunasekaran, Jashwant Raj
Rahman, Amatur
Medvedev, Paul
Kandemir, Mahmut
author_sort Feng, Yilin
collection PubMed
description MOTIVATION: The third-generation DNA sequencing technologies, such as Nanopore Sequencing, can operate at very high speeds and produce longer reads, which in turn results in a challenge for the computational analysis of such massive data. Nanopolish is a software package for signal-level analysis of Oxford Nanopore sequencing data. Call-methylation module of Nanopolish can detect methylation based on Hidden Markov Model (HMM). However, Nanopolish is limited by the long running time of some serial and computationally expensive processes. Among these, Adaptive Banded Event Alignment (ABEA) is the most time-consuming step, and the prior work, f5c, has already parallelized and optimized ABEA on GPU. As a result, the remaining methylation score calculation part, which uses HMM to identify if a given base is methylated or not, has become the new performance bottleneck. RESULTS: This article focuses on the call-methylation module that resides in the Nanopolish package. We propose Galaxy-methyl, which parallelizes and optimizes the methylation score calculation step on GPU and then pipelines the four steps of the call-methylation module. Galaxy-methyl increases the execution concurrency across CPUs and GPUs as well as hardware resource utilization for both. The experimental results collected indicate that Galaxy-methyl can achieve 3×–5× speedup compared with Nanopolish, and reduce the total execution time by 35% compared with f5c, on average. AVAILABILITY AND IMPLEMENTATION: The source code of Galaxy-methyl is available at https://github.com/fengyilin118/.
format Online
Article
Text
id pubmed-9757827
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97578272023-01-24 GPU-accelerated and pipelined methylation calling Feng, Yilin Gudukbay Akbulut, Gulsum Tang, Xulong Gunasekaran, Jashwant Raj Rahman, Amatur Medvedev, Paul Kandemir, Mahmut Bioinform Adv Original Paper MOTIVATION: The third-generation DNA sequencing technologies, such as Nanopore Sequencing, can operate at very high speeds and produce longer reads, which in turn results in a challenge for the computational analysis of such massive data. Nanopolish is a software package for signal-level analysis of Oxford Nanopore sequencing data. Call-methylation module of Nanopolish can detect methylation based on Hidden Markov Model (HMM). However, Nanopolish is limited by the long running time of some serial and computationally expensive processes. Among these, Adaptive Banded Event Alignment (ABEA) is the most time-consuming step, and the prior work, f5c, has already parallelized and optimized ABEA on GPU. As a result, the remaining methylation score calculation part, which uses HMM to identify if a given base is methylated or not, has become the new performance bottleneck. RESULTS: This article focuses on the call-methylation module that resides in the Nanopolish package. We propose Galaxy-methyl, which parallelizes and optimizes the methylation score calculation step on GPU and then pipelines the four steps of the call-methylation module. Galaxy-methyl increases the execution concurrency across CPUs and GPUs as well as hardware resource utilization for both. The experimental results collected indicate that Galaxy-methyl can achieve 3×–5× speedup compared with Nanopolish, and reduce the total execution time by 35% compared with f5c, on average. AVAILABILITY AND IMPLEMENTATION: The source code of Galaxy-methyl is available at https://github.com/fengyilin118/. Oxford University Press 2022-11-30 /pmc/articles/PMC9757827/ /pubmed/36699365 http://dx.doi.org/10.1093/bioadv/vbac088 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Feng, Yilin
Gudukbay Akbulut, Gulsum
Tang, Xulong
Gunasekaran, Jashwant Raj
Rahman, Amatur
Medvedev, Paul
Kandemir, Mahmut
GPU-accelerated and pipelined methylation calling
title GPU-accelerated and pipelined methylation calling
title_full GPU-accelerated and pipelined methylation calling
title_fullStr GPU-accelerated and pipelined methylation calling
title_full_unstemmed GPU-accelerated and pipelined methylation calling
title_short GPU-accelerated and pipelined methylation calling
title_sort gpu-accelerated and pipelined methylation calling
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9757827/
https://www.ncbi.nlm.nih.gov/pubmed/36699365
http://dx.doi.org/10.1093/bioadv/vbac088
work_keys_str_mv AT fengyilin gpuacceleratedandpipelinedmethylationcalling
AT gudukbayakbulutgulsum gpuacceleratedandpipelinedmethylationcalling
AT tangxulong gpuacceleratedandpipelinedmethylationcalling
AT gunasekaranjashwantraj gpuacceleratedandpipelinedmethylationcalling
AT rahmanamatur gpuacceleratedandpipelinedmethylationcalling
AT medvedevpaul gpuacceleratedandpipelinedmethylationcalling
AT kandemirmahmut gpuacceleratedandpipelinedmethylationcalling