Cargando…

A Practical and Scalable Tool to Find Overlaps between Sequences

The evolution of the next generation sequencing technology increases the demand for efficient solutions, in terms of space and time, for several bioinformatics problems. This paper presents a practical and easy-to-implement solution for one of these problems, namely, the all-pairs suffix-prefix prob...

Descripción completa

Detalles Bibliográficos
Autores principales: Haj Rachid, Maan, Malluhi, Qutaibah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417569/
https://www.ncbi.nlm.nih.gov/pubmed/25961045
http://dx.doi.org/10.1155/2015/905261
_version_ 1782369380668014592
author Haj Rachid, Maan
Malluhi, Qutaibah
author_facet Haj Rachid, Maan
Malluhi, Qutaibah
author_sort Haj Rachid, Maan
collection PubMed
description The evolution of the next generation sequencing technology increases the demand for efficient solutions, in terms of space and time, for several bioinformatics problems. This paper presents a practical and easy-to-implement solution for one of these problems, namely, the all-pairs suffix-prefix problem, using a compact prefix tree. The paper demonstrates an efficient construction of this time-efficient and space-economical tree data structure. The paper presents techniques for parallel implementations of the proposed solution. Experimental evaluation indicates superior results in terms of space and time over existing solutions. Results also show that the proposed technique is highly scalable in a parallel execution environment.
format Online
Article
Text
id pubmed-4417569
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-44175692015-05-10 A Practical and Scalable Tool to Find Overlaps between Sequences Haj Rachid, Maan Malluhi, Qutaibah Biomed Res Int Research Article The evolution of the next generation sequencing technology increases the demand for efficient solutions, in terms of space and time, for several bioinformatics problems. This paper presents a practical and easy-to-implement solution for one of these problems, namely, the all-pairs suffix-prefix problem, using a compact prefix tree. The paper demonstrates an efficient construction of this time-efficient and space-economical tree data structure. The paper presents techniques for parallel implementations of the proposed solution. Experimental evaluation indicates superior results in terms of space and time over existing solutions. Results also show that the proposed technique is highly scalable in a parallel execution environment. Hindawi Publishing Corporation 2015 2015-04-19 /pmc/articles/PMC4417569/ /pubmed/25961045 http://dx.doi.org/10.1155/2015/905261 Text en Copyright © 2015 M. Haj Rachid and Q. Malluhi. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Haj Rachid, Maan
Malluhi, Qutaibah
A Practical and Scalable Tool to Find Overlaps between Sequences
title A Practical and Scalable Tool to Find Overlaps between Sequences
title_full A Practical and Scalable Tool to Find Overlaps between Sequences
title_fullStr A Practical and Scalable Tool to Find Overlaps between Sequences
title_full_unstemmed A Practical and Scalable Tool to Find Overlaps between Sequences
title_short A Practical and Scalable Tool to Find Overlaps between Sequences
title_sort practical and scalable tool to find overlaps between sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417569/
https://www.ncbi.nlm.nih.gov/pubmed/25961045
http://dx.doi.org/10.1155/2015/905261
work_keys_str_mv AT hajrachidmaan apracticalandscalabletooltofindoverlapsbetweensequences
AT malluhiqutaibah apracticalandscalabletooltofindoverlapsbetweensequences
AT hajrachidmaan practicalandscalabletooltofindoverlapsbetweensequences
AT malluhiqutaibah practicalandscalabletooltofindoverlapsbetweensequences