Cargando…

Biological sequences as pictures – a generic two dimensional solution for iterated maps

BACKGROUND: Representing symbolic sequences graphically using iterated maps has enjoyed an enduring popularity since it was first proposed in Jeffrey 1990 as chaos game representation (CGR). The usefulness of this representation goes beyond the convenience of a scale independent representation. It p...

Descripción completa

Detalles Bibliográficos
Autores principales: Almeida, Jonas S, Vinga, Susana
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2678093/
https://www.ncbi.nlm.nih.gov/pubmed/19335894
http://dx.doi.org/10.1186/1471-2105-10-100
_version_ 1782166815951028224
author Almeida, Jonas S
Vinga, Susana
author_facet Almeida, Jonas S
Vinga, Susana
author_sort Almeida, Jonas S
collection PubMed
description BACKGROUND: Representing symbolic sequences graphically using iterated maps has enjoyed an enduring popularity since it was first proposed in Jeffrey 1990 as chaos game representation (CGR). The usefulness of this representation goes beyond the convenience of a scale independent representation. It provides a variable memory length representation of transition. This includes the representation of succession with non-integer order, which comes with the promise of generalizing Markovian formalisms. The original proposal targeted genomic sequences only but since then several generalizations have been proposed, many specifically designed to handle protein data. RESULTS: The challenge of a general solution is that of deriving a bijective transformation of symbolic sequences into bi-dimensional planes. More specifically, it requires the regular fractal nesting of polygons. A first attempt at a general solution was proposed by Fiser 1994 by using non-overlapping circles that contain the polygons. This was used as a starting point to identify a more efficient solution where the encapsulating circles can overlap without the same happening for the sequence maps which are circumscribed to fractal polygon domains. CONCLUSION: We identified the optimal inscribed packing solution for iterated maps of any Biological sequence, indeed of any symbolic sequence. The new solution maintains the prized bijective mapping property and includes the Sierpinski triangle and the CGR square as particular solutions of the more encompassing formulation.
format Text
id pubmed-2678093
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26780932009-05-07 Biological sequences as pictures – a generic two dimensional solution for iterated maps Almeida, Jonas S Vinga, Susana BMC Bioinformatics Methodology Article BACKGROUND: Representing symbolic sequences graphically using iterated maps has enjoyed an enduring popularity since it was first proposed in Jeffrey 1990 as chaos game representation (CGR). The usefulness of this representation goes beyond the convenience of a scale independent representation. It provides a variable memory length representation of transition. This includes the representation of succession with non-integer order, which comes with the promise of generalizing Markovian formalisms. The original proposal targeted genomic sequences only but since then several generalizations have been proposed, many specifically designed to handle protein data. RESULTS: The challenge of a general solution is that of deriving a bijective transformation of symbolic sequences into bi-dimensional planes. More specifically, it requires the regular fractal nesting of polygons. A first attempt at a general solution was proposed by Fiser 1994 by using non-overlapping circles that contain the polygons. This was used as a starting point to identify a more efficient solution where the encapsulating circles can overlap without the same happening for the sequence maps which are circumscribed to fractal polygon domains. CONCLUSION: We identified the optimal inscribed packing solution for iterated maps of any Biological sequence, indeed of any symbolic sequence. The new solution maintains the prized bijective mapping property and includes the Sierpinski triangle and the CGR square as particular solutions of the more encompassing formulation. BioMed Central 2009-03-31 /pmc/articles/PMC2678093/ /pubmed/19335894 http://dx.doi.org/10.1186/1471-2105-10-100 Text en Copyright © 2009 Almeida and Vinga; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Almeida, Jonas S
Vinga, Susana
Biological sequences as pictures – a generic two dimensional solution for iterated maps
title Biological sequences as pictures – a generic two dimensional solution for iterated maps
title_full Biological sequences as pictures – a generic two dimensional solution for iterated maps
title_fullStr Biological sequences as pictures – a generic two dimensional solution for iterated maps
title_full_unstemmed Biological sequences as pictures – a generic two dimensional solution for iterated maps
title_short Biological sequences as pictures – a generic two dimensional solution for iterated maps
title_sort biological sequences as pictures – a generic two dimensional solution for iterated maps
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2678093/
https://www.ncbi.nlm.nih.gov/pubmed/19335894
http://dx.doi.org/10.1186/1471-2105-10-100
work_keys_str_mv AT almeidajonass biologicalsequencesaspicturesagenerictwodimensionalsolutionforiteratedmaps
AT vingasusana biologicalsequencesaspicturesagenerictwodimensionalsolutionforiteratedmaps