Cargando…
A Practical and Scalable Tool to Find Overlaps between Sequences
The evolution of the next generation sequencing technology increases the demand for efficient solutions, in terms of space and time, for several bioinformatics problems. This paper presents a practical and easy-to-implement solution for one of these problems, namely, the all-pairs suffix-prefix prob...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417569/ https://www.ncbi.nlm.nih.gov/pubmed/25961045 http://dx.doi.org/10.1155/2015/905261 |
_version_ | 1782369380668014592 |
---|---|
author | Haj Rachid, Maan Malluhi, Qutaibah |
author_facet | Haj Rachid, Maan Malluhi, Qutaibah |
author_sort | Haj Rachid, Maan |
collection | PubMed |
description | The evolution of the next generation sequencing technology increases the demand for efficient solutions, in terms of space and time, for several bioinformatics problems. This paper presents a practical and easy-to-implement solution for one of these problems, namely, the all-pairs suffix-prefix problem, using a compact prefix tree. The paper demonstrates an efficient construction of this time-efficient and space-economical tree data structure. The paper presents techniques for parallel implementations of the proposed solution. Experimental evaluation indicates superior results in terms of space and time over existing solutions. Results also show that the proposed technique is highly scalable in a parallel execution environment. |
format | Online Article Text |
id | pubmed-4417569 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-44175692015-05-10 A Practical and Scalable Tool to Find Overlaps between Sequences Haj Rachid, Maan Malluhi, Qutaibah Biomed Res Int Research Article The evolution of the next generation sequencing technology increases the demand for efficient solutions, in terms of space and time, for several bioinformatics problems. This paper presents a practical and easy-to-implement solution for one of these problems, namely, the all-pairs suffix-prefix problem, using a compact prefix tree. The paper demonstrates an efficient construction of this time-efficient and space-economical tree data structure. The paper presents techniques for parallel implementations of the proposed solution. Experimental evaluation indicates superior results in terms of space and time over existing solutions. Results also show that the proposed technique is highly scalable in a parallel execution environment. Hindawi Publishing Corporation 2015 2015-04-19 /pmc/articles/PMC4417569/ /pubmed/25961045 http://dx.doi.org/10.1155/2015/905261 Text en Copyright © 2015 M. Haj Rachid and Q. Malluhi. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Haj Rachid, Maan Malluhi, Qutaibah A Practical and Scalable Tool to Find Overlaps between Sequences |
title | A Practical and Scalable Tool to Find Overlaps between Sequences |
title_full | A Practical and Scalable Tool to Find Overlaps between Sequences |
title_fullStr | A Practical and Scalable Tool to Find Overlaps between Sequences |
title_full_unstemmed | A Practical and Scalable Tool to Find Overlaps between Sequences |
title_short | A Practical and Scalable Tool to Find Overlaps between Sequences |
title_sort | practical and scalable tool to find overlaps between sequences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417569/ https://www.ncbi.nlm.nih.gov/pubmed/25961045 http://dx.doi.org/10.1155/2015/905261 |
work_keys_str_mv | AT hajrachidmaan apracticalandscalabletooltofindoverlapsbetweensequences AT malluhiqutaibah apracticalandscalabletooltofindoverlapsbetweensequences AT hajrachidmaan practicalandscalabletooltofindoverlapsbetweensequences AT malluhiqutaibah practicalandscalabletooltofindoverlapsbetweensequences |