Cargando…

GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data

Database peptide search is the primary computational technique for identifying peptides from the mass spectrometry (MS) data. Graphical Processing Units (GPU) computing is now ubiquitous in the current-generation of high-performance computing (HPC) systems, yet its application in the database peptid...

Descripción completa

Detalles Bibliográficos
Autores principales: Haseeb, Muhammad, Saeed, Fahad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10618243/
https://www.ncbi.nlm.nih.gov/pubmed/37907498
http://dx.doi.org/10.1038/s41598-023-43033-w
_version_ 1785129732387373056
author Haseeb, Muhammad
Saeed, Fahad
author_facet Haseeb, Muhammad
Saeed, Fahad
author_sort Haseeb, Muhammad
collection PubMed
description Database peptide search is the primary computational technique for identifying peptides from the mass spectrometry (MS) data. Graphical Processing Units (GPU) computing is now ubiquitous in the current-generation of high-performance computing (HPC) systems, yet its application in the database peptide search domain remains limited. Part of the reason is the use of sub-optimal algorithms in the existing GPU-accelerated methods resulting in significantly inefficient hardware utilization. In this paper, we design and implement a new-age CPU-GPU HPC framework, called GiCOPS, for efficient and complete GPU-acceleration of the modern database peptide search algorithms on supercomputers. Our experimentation shows that the GiCOPS exhibits between 1.2 to 5[Formula: see text] speed improvement over its CPU-only predecessor, HiCOPS, and over 10[Formula: see text] improvement over several existing GPU-based database search algorithms for sufficiently large experiment sizes. We further assess and optimize the performance of our framework using the Roofline Model and report near-optimal results for several metrics including computations per second, occupancy rate, memory workload, branch efficiency and shared memory performance. Finally, the CPU-GPU methods and optimizations proposed in our work for complex integer- and memory-bounded algorithmic pipelines can also be extended to accelerate the existing and future peptide identification algorithms. GiCOPS is now integrated with our umbrella HPC framework HiCOPS and is available at: https://github.com/pcdslab/gicops.
format Online
Article
Text
id pubmed-10618243
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106182432023-11-02 GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data Haseeb, Muhammad Saeed, Fahad Sci Rep Article Database peptide search is the primary computational technique for identifying peptides from the mass spectrometry (MS) data. Graphical Processing Units (GPU) computing is now ubiquitous in the current-generation of high-performance computing (HPC) systems, yet its application in the database peptide search domain remains limited. Part of the reason is the use of sub-optimal algorithms in the existing GPU-accelerated methods resulting in significantly inefficient hardware utilization. In this paper, we design and implement a new-age CPU-GPU HPC framework, called GiCOPS, for efficient and complete GPU-acceleration of the modern database peptide search algorithms on supercomputers. Our experimentation shows that the GiCOPS exhibits between 1.2 to 5[Formula: see text] speed improvement over its CPU-only predecessor, HiCOPS, and over 10[Formula: see text] improvement over several existing GPU-based database search algorithms for sufficiently large experiment sizes. We further assess and optimize the performance of our framework using the Roofline Model and report near-optimal results for several metrics including computations per second, occupancy rate, memory workload, branch efficiency and shared memory performance. Finally, the CPU-GPU methods and optimizations proposed in our work for complex integer- and memory-bounded algorithmic pipelines can also be extended to accelerate the existing and future peptide identification algorithms. GiCOPS is now integrated with our umbrella HPC framework HiCOPS and is available at: https://github.com/pcdslab/gicops. Nature Publishing Group UK 2023-10-31 /pmc/articles/PMC10618243/ /pubmed/37907498 http://dx.doi.org/10.1038/s41598-023-43033-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Haseeb, Muhammad
Saeed, Fahad
GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data
title GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data
title_full GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data
title_fullStr GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data
title_full_unstemmed GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data
title_short GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data
title_sort gpu-acceleration of the distributed-memory database peptide search of mass spectrometry data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10618243/
https://www.ncbi.nlm.nih.gov/pubmed/37907498
http://dx.doi.org/10.1038/s41598-023-43033-w
work_keys_str_mv AT haseebmuhammad gpuaccelerationofthedistributedmemorydatabasepeptidesearchofmassspectrometrydata
AT saeedfahad gpuaccelerationofthedistributedmemorydatabasepeptidesearchofmassspectrometrydata