Cargando…

Efficient Boolean implementation of universal sequence maps (bUSM)

BACKGROUND: Recently, Almeida and Vinga offered a new approach for the representation of arbitrary discrete sequences, referred to as Universal Sequence Maps (USM), and discussed its applicability to genomic sequence analysis. Their work generalizes and extends Chaos Game Representation (CGR) of DNA...

Descripción completa

Detalles Bibliográficos
Autores principales: Schwacke, John, Almeida, Jonas S
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC137598/
https://www.ncbi.nlm.nih.gov/pubmed/12387731
http://dx.doi.org/10.1186/1471-2105-3-28
_version_ 1782120455742685184
author Schwacke, John
Almeida, Jonas S
author_facet Schwacke, John
Almeida, Jonas S
author_sort Schwacke, John
collection PubMed
description BACKGROUND: Recently, Almeida and Vinga offered a new approach for the representation of arbitrary discrete sequences, referred to as Universal Sequence Maps (USM), and discussed its applicability to genomic sequence analysis. Their work generalizes and extends Chaos Game Representation (CGR) of DNA for arbitrary discrete sequences. RESULTS: We have considered issues associated with the practical implementation of USMs and offer a variation on the algorithm that: 1) eliminates the overestimation of similar segment lengths, 2) permits the identification of arbitrarily long similar segments in the context of finite word length coordinate representations, 3) uses more computationally efficient operations, and 4) provides a simple conversion for recovering the USM coordinates. Computational performance comparisons and examples are provided. CONCLUSIONS: We have shown that the desirable properties of the USM encoding of nucleotide sequences can be retained in a practical implementation of the algorithm. In addition, the proposed implementation enables determination of local sequence identity at increased speed.
format Text
id pubmed-137598
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1375982002-12-08 Efficient Boolean implementation of universal sequence maps (bUSM) Schwacke, John Almeida, Jonas S BMC Bioinformatics Methodology article BACKGROUND: Recently, Almeida and Vinga offered a new approach for the representation of arbitrary discrete sequences, referred to as Universal Sequence Maps (USM), and discussed its applicability to genomic sequence analysis. Their work generalizes and extends Chaos Game Representation (CGR) of DNA for arbitrary discrete sequences. RESULTS: We have considered issues associated with the practical implementation of USMs and offer a variation on the algorithm that: 1) eliminates the overestimation of similar segment lengths, 2) permits the identification of arbitrarily long similar segments in the context of finite word length coordinate representations, 3) uses more computationally efficient operations, and 4) provides a simple conversion for recovering the USM coordinates. Computational performance comparisons and examples are provided. CONCLUSIONS: We have shown that the desirable properties of the USM encoding of nucleotide sequences can be retained in a practical implementation of the algorithm. In addition, the proposed implementation enables determination of local sequence identity at increased speed. BioMed Central 2002-10-21 /pmc/articles/PMC137598/ /pubmed/12387731 http://dx.doi.org/10.1186/1471-2105-3-28 Text en Copyright ©2002 Schwacke and Almeida; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Methodology article
Schwacke, John
Almeida, Jonas S
Efficient Boolean implementation of universal sequence maps (bUSM)
title Efficient Boolean implementation of universal sequence maps (bUSM)
title_full Efficient Boolean implementation of universal sequence maps (bUSM)
title_fullStr Efficient Boolean implementation of universal sequence maps (bUSM)
title_full_unstemmed Efficient Boolean implementation of universal sequence maps (bUSM)
title_short Efficient Boolean implementation of universal sequence maps (bUSM)
title_sort efficient boolean implementation of universal sequence maps (busm)
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC137598/
https://www.ncbi.nlm.nih.gov/pubmed/12387731
http://dx.doi.org/10.1186/1471-2105-3-28
work_keys_str_mv AT schwackejohn efficientbooleanimplementationofuniversalsequencemapsbusm
AT almeidajonass efficientbooleanimplementationofuniversalsequencemapsbusm