Cargando…
GPU-accelerated and pipelined methylation calling
MOTIVATION: The third-generation DNA sequencing technologies, such as Nanopore Sequencing, can operate at very high speeds and produce longer reads, which in turn results in a challenge for the computational analysis of such massive data. Nanopolish is a software package for signal-level analysis of...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9757827/ https://www.ncbi.nlm.nih.gov/pubmed/36699365 http://dx.doi.org/10.1093/bioadv/vbac088 |
_version_ | 1784851907143008256 |
---|---|
author | Feng, Yilin Gudukbay Akbulut, Gulsum Tang, Xulong Gunasekaran, Jashwant Raj Rahman, Amatur Medvedev, Paul Kandemir, Mahmut |
author_facet | Feng, Yilin Gudukbay Akbulut, Gulsum Tang, Xulong Gunasekaran, Jashwant Raj Rahman, Amatur Medvedev, Paul Kandemir, Mahmut |
author_sort | Feng, Yilin |
collection | PubMed |
description | MOTIVATION: The third-generation DNA sequencing technologies, such as Nanopore Sequencing, can operate at very high speeds and produce longer reads, which in turn results in a challenge for the computational analysis of such massive data. Nanopolish is a software package for signal-level analysis of Oxford Nanopore sequencing data. Call-methylation module of Nanopolish can detect methylation based on Hidden Markov Model (HMM). However, Nanopolish is limited by the long running time of some serial and computationally expensive processes. Among these, Adaptive Banded Event Alignment (ABEA) is the most time-consuming step, and the prior work, f5c, has already parallelized and optimized ABEA on GPU. As a result, the remaining methylation score calculation part, which uses HMM to identify if a given base is methylated or not, has become the new performance bottleneck. RESULTS: This article focuses on the call-methylation module that resides in the Nanopolish package. We propose Galaxy-methyl, which parallelizes and optimizes the methylation score calculation step on GPU and then pipelines the four steps of the call-methylation module. Galaxy-methyl increases the execution concurrency across CPUs and GPUs as well as hardware resource utilization for both. The experimental results collected indicate that Galaxy-methyl can achieve 3×–5× speedup compared with Nanopolish, and reduce the total execution time by 35% compared with f5c, on average. AVAILABILITY AND IMPLEMENTATION: The source code of Galaxy-methyl is available at https://github.com/fengyilin118/. |
format | Online Article Text |
id | pubmed-9757827 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-97578272023-01-24 GPU-accelerated and pipelined methylation calling Feng, Yilin Gudukbay Akbulut, Gulsum Tang, Xulong Gunasekaran, Jashwant Raj Rahman, Amatur Medvedev, Paul Kandemir, Mahmut Bioinform Adv Original Paper MOTIVATION: The third-generation DNA sequencing technologies, such as Nanopore Sequencing, can operate at very high speeds and produce longer reads, which in turn results in a challenge for the computational analysis of such massive data. Nanopolish is a software package for signal-level analysis of Oxford Nanopore sequencing data. Call-methylation module of Nanopolish can detect methylation based on Hidden Markov Model (HMM). However, Nanopolish is limited by the long running time of some serial and computationally expensive processes. Among these, Adaptive Banded Event Alignment (ABEA) is the most time-consuming step, and the prior work, f5c, has already parallelized and optimized ABEA on GPU. As a result, the remaining methylation score calculation part, which uses HMM to identify if a given base is methylated or not, has become the new performance bottleneck. RESULTS: This article focuses on the call-methylation module that resides in the Nanopolish package. We propose Galaxy-methyl, which parallelizes and optimizes the methylation score calculation step on GPU and then pipelines the four steps of the call-methylation module. Galaxy-methyl increases the execution concurrency across CPUs and GPUs as well as hardware resource utilization for both. The experimental results collected indicate that Galaxy-methyl can achieve 3×–5× speedup compared with Nanopolish, and reduce the total execution time by 35% compared with f5c, on average. AVAILABILITY AND IMPLEMENTATION: The source code of Galaxy-methyl is available at https://github.com/fengyilin118/. Oxford University Press 2022-11-30 /pmc/articles/PMC9757827/ /pubmed/36699365 http://dx.doi.org/10.1093/bioadv/vbac088 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Feng, Yilin Gudukbay Akbulut, Gulsum Tang, Xulong Gunasekaran, Jashwant Raj Rahman, Amatur Medvedev, Paul Kandemir, Mahmut GPU-accelerated and pipelined methylation calling |
title | GPU-accelerated and pipelined methylation calling |
title_full | GPU-accelerated and pipelined methylation calling |
title_fullStr | GPU-accelerated and pipelined methylation calling |
title_full_unstemmed | GPU-accelerated and pipelined methylation calling |
title_short | GPU-accelerated and pipelined methylation calling |
title_sort | gpu-accelerated and pipelined methylation calling |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9757827/ https://www.ncbi.nlm.nih.gov/pubmed/36699365 http://dx.doi.org/10.1093/bioadv/vbac088 |
work_keys_str_mv | AT fengyilin gpuacceleratedandpipelinedmethylationcalling AT gudukbayakbulutgulsum gpuacceleratedandpipelinedmethylationcalling AT tangxulong gpuacceleratedandpipelinedmethylationcalling AT gunasekaranjashwantraj gpuacceleratedandpipelinedmethylationcalling AT rahmanamatur gpuacceleratedandpipelinedmethylationcalling AT medvedevpaul gpuacceleratedandpipelinedmethylationcalling AT kandemirmahmut gpuacceleratedandpipelinedmethylationcalling |