Cargando…
Chaos game representation and its applications in bioinformatics
Chaos game representation (CGR), a milestone in graphical bioinformatics, has become a powerful tool regarding alignment-free sequence comparison and feature encoding for machine learning. The algorithm maps a sequence to 2-dimensional space, while an extension of the CGR, the so-called frequency ma...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8636998/ https://www.ncbi.nlm.nih.gov/pubmed/34900136 http://dx.doi.org/10.1016/j.csbj.2021.11.008 |
_version_ | 1784608652479430656 |
---|---|
author | Löchel, Hannah Franziska Heider, Dominik |
author_facet | Löchel, Hannah Franziska Heider, Dominik |
author_sort | Löchel, Hannah Franziska |
collection | PubMed |
description | Chaos game representation (CGR), a milestone in graphical bioinformatics, has become a powerful tool regarding alignment-free sequence comparison and feature encoding for machine learning. The algorithm maps a sequence to 2-dimensional space, while an extension of the CGR, the so-called frequency matrix representation (FCGR), transforms sequences of different lengths into equal-sized images or matrices. The CGR is a generalized Markov chain and includes various properties, which allow a unique representation of a sequence. Therefore, it has a broad spectrum of applications in bioinformatics, such as sequence comparison and phylogenetic analysis and as an encoding of sequences for machine learning. This review introduces the construction of CGRs and FCGRs, their applications on DNA and proteins, and gives an overview of recent applications and progress in bioinformatics. |
format | Online Article Text |
id | pubmed-8636998 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-86369982021-12-09 Chaos game representation and its applications in bioinformatics Löchel, Hannah Franziska Heider, Dominik Comput Struct Biotechnol J Review Article Chaos game representation (CGR), a milestone in graphical bioinformatics, has become a powerful tool regarding alignment-free sequence comparison and feature encoding for machine learning. The algorithm maps a sequence to 2-dimensional space, while an extension of the CGR, the so-called frequency matrix representation (FCGR), transforms sequences of different lengths into equal-sized images or matrices. The CGR is a generalized Markov chain and includes various properties, which allow a unique representation of a sequence. Therefore, it has a broad spectrum of applications in bioinformatics, such as sequence comparison and phylogenetic analysis and as an encoding of sequences for machine learning. This review introduces the construction of CGRs and FCGRs, their applications on DNA and proteins, and gives an overview of recent applications and progress in bioinformatics. Research Network of Computational and Structural Biotechnology 2021-11-10 /pmc/articles/PMC8636998/ /pubmed/34900136 http://dx.doi.org/10.1016/j.csbj.2021.11.008 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Review Article Löchel, Hannah Franziska Heider, Dominik Chaos game representation and its applications in bioinformatics |
title | Chaos game representation and its applications in bioinformatics |
title_full | Chaos game representation and its applications in bioinformatics |
title_fullStr | Chaos game representation and its applications in bioinformatics |
title_full_unstemmed | Chaos game representation and its applications in bioinformatics |
title_short | Chaos game representation and its applications in bioinformatics |
title_sort | chaos game representation and its applications in bioinformatics |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8636998/ https://www.ncbi.nlm.nih.gov/pubmed/34900136 http://dx.doi.org/10.1016/j.csbj.2021.11.008 |
work_keys_str_mv | AT lochelhannahfranziska chaosgamerepresentationanditsapplicationsinbioinformatics AT heiderdominik chaosgamerepresentationanditsapplicationsinbioinformatics |