Cargando…

Chaos game representation and its applications in bioinformatics

Chaos game representation (CGR), a milestone in graphical bioinformatics, has become a powerful tool regarding alignment-free sequence comparison and feature encoding for machine learning. The algorithm maps a sequence to 2-dimensional space, while an extension of the CGR, the so-called frequency ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Löchel, Hannah Franziska, Heider, Dominik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8636998/
https://www.ncbi.nlm.nih.gov/pubmed/34900136
http://dx.doi.org/10.1016/j.csbj.2021.11.008
_version_ 1784608652479430656
author Löchel, Hannah Franziska
Heider, Dominik
author_facet Löchel, Hannah Franziska
Heider, Dominik
author_sort Löchel, Hannah Franziska
collection PubMed
description Chaos game representation (CGR), a milestone in graphical bioinformatics, has become a powerful tool regarding alignment-free sequence comparison and feature encoding for machine learning. The algorithm maps a sequence to 2-dimensional space, while an extension of the CGR, the so-called frequency matrix representation (FCGR), transforms sequences of different lengths into equal-sized images or matrices. The CGR is a generalized Markov chain and includes various properties, which allow a unique representation of a sequence. Therefore, it has a broad spectrum of applications in bioinformatics, such as sequence comparison and phylogenetic analysis and as an encoding of sequences for machine learning. This review introduces the construction of CGRs and FCGRs, their applications on DNA and proteins, and gives an overview of recent applications and progress in bioinformatics.
format Online
Article
Text
id pubmed-8636998
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-86369982021-12-09 Chaos game representation and its applications in bioinformatics Löchel, Hannah Franziska Heider, Dominik Comput Struct Biotechnol J Review Article Chaos game representation (CGR), a milestone in graphical bioinformatics, has become a powerful tool regarding alignment-free sequence comparison and feature encoding for machine learning. The algorithm maps a sequence to 2-dimensional space, while an extension of the CGR, the so-called frequency matrix representation (FCGR), transforms sequences of different lengths into equal-sized images or matrices. The CGR is a generalized Markov chain and includes various properties, which allow a unique representation of a sequence. Therefore, it has a broad spectrum of applications in bioinformatics, such as sequence comparison and phylogenetic analysis and as an encoding of sequences for machine learning. This review introduces the construction of CGRs and FCGRs, their applications on DNA and proteins, and gives an overview of recent applications and progress in bioinformatics. Research Network of Computational and Structural Biotechnology 2021-11-10 /pmc/articles/PMC8636998/ /pubmed/34900136 http://dx.doi.org/10.1016/j.csbj.2021.11.008 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Review Article
Löchel, Hannah Franziska
Heider, Dominik
Chaos game representation and its applications in bioinformatics
title Chaos game representation and its applications in bioinformatics
title_full Chaos game representation and its applications in bioinformatics
title_fullStr Chaos game representation and its applications in bioinformatics
title_full_unstemmed Chaos game representation and its applications in bioinformatics
title_short Chaos game representation and its applications in bioinformatics
title_sort chaos game representation and its applications in bioinformatics
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8636998/
https://www.ncbi.nlm.nih.gov/pubmed/34900136
http://dx.doi.org/10.1016/j.csbj.2021.11.008
work_keys_str_mv AT lochelhannahfranziska chaosgamerepresentationanditsapplicationsinbioinformatics
AT heiderdominik chaosgamerepresentationanditsapplicationsinbioinformatics