Cargando…

CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units

BACKGROUND: The Smith-Waterman algorithm is one of the most widely used tools for searching biological sequence databases due to its high sensitivity. Unfortunately, the Smith-Waterman algorithm is computationally demanding, which is further compounded by the exponential growth of sequence databases...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yongchao, Maskell, Douglas L, Schmidt, Bertil
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2694204/
https://www.ncbi.nlm.nih.gov/pubmed/19416548
http://dx.doi.org/10.1186/1756-0500-2-73
_version_ 1782168055224205312
author Liu, Yongchao
Maskell, Douglas L
Schmidt, Bertil
author_facet Liu, Yongchao
Maskell, Douglas L
Schmidt, Bertil
author_sort Liu, Yongchao
collection PubMed
description BACKGROUND: The Smith-Waterman algorithm is one of the most widely used tools for searching biological sequence databases due to its high sensitivity. Unfortunately, the Smith-Waterman algorithm is computationally demanding, which is further compounded by the exponential growth of sequence databases. The recent emergence of many-core architectures, and their associated programming interfaces, provides an opportunity to accelerate sequence database searches using commonly available and inexpensive hardware. FINDINGS: Our CUDASW++ implementation (benchmarked on a single-GPU NVIDIA GeForce GTX 280 graphics card and a dual-GPU GeForce GTX 295 graphics card) provides a significant performance improvement compared to other publicly available implementations, such as SWPS3, CBESW, SW-CUDA, and NCBI-BLAST. CUDASW++ supports query sequences of length up to 59K and for query sequences ranging in length from 144 to 5,478 in Swiss-Prot release 56.6, the single-GPU version achieves an average performance of 9.509 GCUPS with a lowest performance of 9.039 GCUPS and a highest performance of 9.660 GCUPS, and the dual-GPU version achieves an average performance of 14.484 GCUPS with a lowest performance of 10.660 GCUPS and a highest performance of 16.087 GCUPS. CONCLUSION: CUDASW++ is publicly available open-source software. It provides a significant performance improvement for Smith-Waterman-based protein sequence database searches by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs.
format Text
id pubmed-2694204
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26942042009-06-09 CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units Liu, Yongchao Maskell, Douglas L Schmidt, Bertil BMC Res Notes Technical Note BACKGROUND: The Smith-Waterman algorithm is one of the most widely used tools for searching biological sequence databases due to its high sensitivity. Unfortunately, the Smith-Waterman algorithm is computationally demanding, which is further compounded by the exponential growth of sequence databases. The recent emergence of many-core architectures, and their associated programming interfaces, provides an opportunity to accelerate sequence database searches using commonly available and inexpensive hardware. FINDINGS: Our CUDASW++ implementation (benchmarked on a single-GPU NVIDIA GeForce GTX 280 graphics card and a dual-GPU GeForce GTX 295 graphics card) provides a significant performance improvement compared to other publicly available implementations, such as SWPS3, CBESW, SW-CUDA, and NCBI-BLAST. CUDASW++ supports query sequences of length up to 59K and for query sequences ranging in length from 144 to 5,478 in Swiss-Prot release 56.6, the single-GPU version achieves an average performance of 9.509 GCUPS with a lowest performance of 9.039 GCUPS and a highest performance of 9.660 GCUPS, and the dual-GPU version achieves an average performance of 14.484 GCUPS with a lowest performance of 10.660 GCUPS and a highest performance of 16.087 GCUPS. CONCLUSION: CUDASW++ is publicly available open-source software. It provides a significant performance improvement for Smith-Waterman-based protein sequence database searches by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs. BioMed Central 2009-05-06 /pmc/articles/PMC2694204/ /pubmed/19416548 http://dx.doi.org/10.1186/1756-0500-2-73 Text en Copyright © 2009 Liu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Liu, Yongchao
Maskell, Douglas L
Schmidt, Bertil
CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units
title CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units
title_full CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units
title_fullStr CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units
title_full_unstemmed CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units
title_short CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units
title_sort cudasw++: optimizing smith-waterman sequence database searches for cuda-enabled graphics processing units
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2694204/
https://www.ncbi.nlm.nih.gov/pubmed/19416548
http://dx.doi.org/10.1186/1756-0500-2-73
work_keys_str_mv AT liuyongchao cudaswoptimizingsmithwatermansequencedatabasesearchesforcudaenabledgraphicsprocessingunits
AT maskelldouglasl cudaswoptimizingsmithwatermansequencedatabasesearchesforcudaenabledgraphicsprocessingunits
AT schmidtbertil cudaswoptimizingsmithwatermansequencedatabasesearchesforcudaenabledgraphicsprocessingunits