Cargando…

Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation

BACKGROUND: The Smith-Waterman algorithm for local sequence alignment is more sensitive than heuristic methods for database searching, but also more time-consuming. The fastest approach to parallelisation with SIMD technology has previously been described by Farrar in 2007. The aim of this study was...

Descripción completa

Detalles Bibliográficos
Autor principal: Rognes, Torbjørn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3120707/
https://www.ncbi.nlm.nih.gov/pubmed/21631914
http://dx.doi.org/10.1186/1471-2105-12-221
_version_ 1782206739107545088
author Rognes, Torbjørn
author_facet Rognes, Torbjørn
author_sort Rognes, Torbjørn
collection PubMed
description BACKGROUND: The Smith-Waterman algorithm for local sequence alignment is more sensitive than heuristic methods for database searching, but also more time-consuming. The fastest approach to parallelisation with SIMD technology has previously been described by Farrar in 2007. The aim of this study was to explore whether further speed could be gained by other approaches to parallelisation. RESULTS: A faster approach and implementation is described and benchmarked. In the new tool SWIPE, residues from sixteen different database sequences are compared in parallel to one query residue. Using a 375 residue query sequence a speed of 106 billion cell updates per second (GCUPS) was achieved on a dual Intel Xeon X5650 six-core processor system, which is over six times more rapid than software based on Farrar's 'striped' approach. SWIPE was about 2.5 times faster when the programs used only a single thread. For shorter queries, the increase in speed was larger. SWIPE was about twice as fast as BLAST when using the BLOSUM50 score matrix, while BLAST was about twice as fast as SWIPE for the BLOSUM62 matrix. The software is designed for 64 bit Linux on processors with SSSE3. Source code is available from http://dna.uio.no/swipe/ under the GNU Affero General Public License. CONCLUSIONS: Efficient parallelisation using SIMD on standard hardware makes it possible to run Smith-Waterman database searches more than six times faster than before. The approach described here could significantly widen the potential application of Smith-Waterman searches. Other applications that require optimal local alignment scores could also benefit from improved performance.
format Online
Article
Text
id pubmed-3120707
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31207072011-06-23 Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation Rognes, Torbjørn BMC Bioinformatics Methodology Article BACKGROUND: The Smith-Waterman algorithm for local sequence alignment is more sensitive than heuristic methods for database searching, but also more time-consuming. The fastest approach to parallelisation with SIMD technology has previously been described by Farrar in 2007. The aim of this study was to explore whether further speed could be gained by other approaches to parallelisation. RESULTS: A faster approach and implementation is described and benchmarked. In the new tool SWIPE, residues from sixteen different database sequences are compared in parallel to one query residue. Using a 375 residue query sequence a speed of 106 billion cell updates per second (GCUPS) was achieved on a dual Intel Xeon X5650 six-core processor system, which is over six times more rapid than software based on Farrar's 'striped' approach. SWIPE was about 2.5 times faster when the programs used only a single thread. For shorter queries, the increase in speed was larger. SWIPE was about twice as fast as BLAST when using the BLOSUM50 score matrix, while BLAST was about twice as fast as SWIPE for the BLOSUM62 matrix. The software is designed for 64 bit Linux on processors with SSSE3. Source code is available from http://dna.uio.no/swipe/ under the GNU Affero General Public License. CONCLUSIONS: Efficient parallelisation using SIMD on standard hardware makes it possible to run Smith-Waterman database searches more than six times faster than before. The approach described here could significantly widen the potential application of Smith-Waterman searches. Other applications that require optimal local alignment scores could also benefit from improved performance. BioMed Central 2011-06-01 /pmc/articles/PMC3120707/ /pubmed/21631914 http://dx.doi.org/10.1186/1471-2105-12-221 Text en Copyright ©2011 Rognes; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Rognes, Torbjørn
Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation
title Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation
title_full Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation
title_fullStr Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation
title_full_unstemmed Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation
title_short Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation
title_sort faster smith-waterman database searches with inter-sequence simd parallelisation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3120707/
https://www.ncbi.nlm.nih.gov/pubmed/21631914
http://dx.doi.org/10.1186/1471-2105-12-221
work_keys_str_mv AT rognestorbjørn fastersmithwatermandatabasesearcheswithintersequencesimdparallelisation