Cargando…

An Improved Distance Matrix Computation Algorithm for Multicore Clusters

Distance matrix has diverse usage in different research areas. Its computation is typically an essential task in most bioinformatics applications, especially in multiple sequence alignment. The gigantic explosion of biological sequence databases leads to an urgent need for accelerating these computa...

Descripción completa

Detalles Bibliográficos
Autores principales: Al-Neama, Mohammed W., Reda, Naglaa M., Ghaleb, Fayed F. M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4074972/
https://www.ncbi.nlm.nih.gov/pubmed/25013779
http://dx.doi.org/10.1155/2014/406178
_version_ 1782323274736205824
author Al-Neama, Mohammed W.
Reda, Naglaa M.
Ghaleb, Fayed F. M.
author_facet Al-Neama, Mohammed W.
Reda, Naglaa M.
Ghaleb, Fayed F. M.
author_sort Al-Neama, Mohammed W.
collection PubMed
description Distance matrix has diverse usage in different research areas. Its computation is typically an essential task in most bioinformatics applications, especially in multiple sequence alignment. The gigantic explosion of biological sequence databases leads to an urgent need for accelerating these computations. DistVect algorithm was introduced in the paper of Al-Neama et al. (in press) to present a recent approach for vectorizing distance matrix computing. It showed an efficient performance in both sequential and parallel computing. However, the multicore cluster systems, which are available now, with their scalability and performance/cost ratio, meet the need for more powerful and efficient performance. This paper proposes DistVect1 as highly efficient parallel vectorized algorithm with high performance for computing distance matrix, addressed to multicore clusters. It reformulates DistVect1 vectorized algorithm in terms of clusters primitives. It deduces an efficient approach of partitioning and scheduling computations, convenient to this type of architecture. Implementations employ potential of both MPI and OpenMP libraries. Experimental results show that the proposed method performs improvement of around 3-fold speedup upon SSE2. Further it also achieves speedups more than 9 orders of magnitude compared to the publicly available parallel implementation utilized in ClustalW-MPI.
format Online
Article
Text
id pubmed-4074972
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-40749722014-07-10 An Improved Distance Matrix Computation Algorithm for Multicore Clusters Al-Neama, Mohammed W. Reda, Naglaa M. Ghaleb, Fayed F. M. Biomed Res Int Research Article Distance matrix has diverse usage in different research areas. Its computation is typically an essential task in most bioinformatics applications, especially in multiple sequence alignment. The gigantic explosion of biological sequence databases leads to an urgent need for accelerating these computations. DistVect algorithm was introduced in the paper of Al-Neama et al. (in press) to present a recent approach for vectorizing distance matrix computing. It showed an efficient performance in both sequential and parallel computing. However, the multicore cluster systems, which are available now, with their scalability and performance/cost ratio, meet the need for more powerful and efficient performance. This paper proposes DistVect1 as highly efficient parallel vectorized algorithm with high performance for computing distance matrix, addressed to multicore clusters. It reformulates DistVect1 vectorized algorithm in terms of clusters primitives. It deduces an efficient approach of partitioning and scheduling computations, convenient to this type of architecture. Implementations employ potential of both MPI and OpenMP libraries. Experimental results show that the proposed method performs improvement of around 3-fold speedup upon SSE2. Further it also achieves speedups more than 9 orders of magnitude compared to the publicly available parallel implementation utilized in ClustalW-MPI. Hindawi Publishing Corporation 2014 2014-06-12 /pmc/articles/PMC4074972/ /pubmed/25013779 http://dx.doi.org/10.1155/2014/406178 Text en Copyright © 2014 Mohammed W. Al-Neama et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Al-Neama, Mohammed W.
Reda, Naglaa M.
Ghaleb, Fayed F. M.
An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_full An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_fullStr An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_full_unstemmed An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_short An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_sort improved distance matrix computation algorithm for multicore clusters
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4074972/
https://www.ncbi.nlm.nih.gov/pubmed/25013779
http://dx.doi.org/10.1155/2014/406178
work_keys_str_mv AT alneamamohammedw animproveddistancematrixcomputationalgorithmformulticoreclusters
AT redanaglaam animproveddistancematrixcomputationalgorithmformulticoreclusters
AT ghalebfayedfm animproveddistancematrixcomputationalgorithmformulticoreclusters
AT alneamamohammedw improveddistancematrixcomputationalgorithmformulticoreclusters
AT redanaglaam improveddistancematrixcomputationalgorithmformulticoreclusters
AT ghalebfayedfm improveddistancematrixcomputationalgorithmformulticoreclusters