Cargando…
Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits
A comprehensive overview of the recent physics-inspired genome analysis tool, GenomeBits, is presented. This is based on traditional signal processing methods such as discrete Fourier transform (DFT). GenomeBits can be used to extract underlying genomics features from the distribution of nucleotides...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673239/ https://www.ncbi.nlm.nih.gov/pubmed/38004745 http://dx.doi.org/10.3390/microorganisms11112733 |
_version_ | 1785140575906824192 |
---|---|
author | Canessa, Enrique |
author_facet | Canessa, Enrique |
author_sort | Canessa, Enrique |
collection | PubMed |
description | A comprehensive overview of the recent physics-inspired genome analysis tool, GenomeBits, is presented. This is based on traditional signal processing methods such as discrete Fourier transform (DFT). GenomeBits can be used to extract underlying genomics features from the distribution of nucleotides, and can be further used to analyze the mutation patterns in viral genomes. Examples of the main GenomeBits findings outlining the intrinsic signal organization of genomics sequences for different SARS-CoV-2 variants along the pandemic years 2020–2022 and Monkeypox cases in 2021 are presented to show the usefulness of GenomeBits. GenomeBits results for DFT of SARS-CoV-2 genomes in different geographical regions are discussed, together with the GenomeBits analysis of complete genome sequences for the first coronavirus variants reported: Alpha, Beta, Gamma, Epsilon and Eta. Interesting features of the Delta and Omicron variants in the form of a unique ‘order–disorder’ transition are uncovered from these samples, as well as from their cumulative distribution function and scatter plots. This class of transitions might reveal the cumulative outcome of mutations on the spike protein. A salient feature of GenomeBits is the mapping of the nucleotide bases (A,T,C,G) into an alternating spin-like numerical sequence via a series having binary (0,1) indicators for each A,T,C,G. This leads to the derivation of a set of statistical distribution curves. Furthermore, the quantum-based extension of the GenomeBits model to an analogous probability measure is shown to identify properties of genome sequences as wavefunctions via a superposition of states. An association of the integral of the GenomeBits coding and a binding-like energy can, in principle, also be established. The relevance of these different results in bioinformatics is analyzed. |
format | Online Article Text |
id | pubmed-10673239 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-106732392023-11-09 Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits Canessa, Enrique Microorganisms Article A comprehensive overview of the recent physics-inspired genome analysis tool, GenomeBits, is presented. This is based on traditional signal processing methods such as discrete Fourier transform (DFT). GenomeBits can be used to extract underlying genomics features from the distribution of nucleotides, and can be further used to analyze the mutation patterns in viral genomes. Examples of the main GenomeBits findings outlining the intrinsic signal organization of genomics sequences for different SARS-CoV-2 variants along the pandemic years 2020–2022 and Monkeypox cases in 2021 are presented to show the usefulness of GenomeBits. GenomeBits results for DFT of SARS-CoV-2 genomes in different geographical regions are discussed, together with the GenomeBits analysis of complete genome sequences for the first coronavirus variants reported: Alpha, Beta, Gamma, Epsilon and Eta. Interesting features of the Delta and Omicron variants in the form of a unique ‘order–disorder’ transition are uncovered from these samples, as well as from their cumulative distribution function and scatter plots. This class of transitions might reveal the cumulative outcome of mutations on the spike protein. A salient feature of GenomeBits is the mapping of the nucleotide bases (A,T,C,G) into an alternating spin-like numerical sequence via a series having binary (0,1) indicators for each A,T,C,G. This leads to the derivation of a set of statistical distribution curves. Furthermore, the quantum-based extension of the GenomeBits model to an analogous probability measure is shown to identify properties of genome sequences as wavefunctions via a superposition of states. An association of the integral of the GenomeBits coding and a binding-like energy can, in principle, also be established. The relevance of these different results in bioinformatics is analyzed. MDPI 2023-11-09 /pmc/articles/PMC10673239/ /pubmed/38004745 http://dx.doi.org/10.3390/microorganisms11112733 Text en © 2023 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Canessa, Enrique Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits |
title | Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits |
title_full | Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits |
title_fullStr | Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits |
title_full_unstemmed | Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits |
title_short | Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits |
title_sort | physics-based signal analysis of genome sequences: an overview of genomebits |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673239/ https://www.ncbi.nlm.nih.gov/pubmed/38004745 http://dx.doi.org/10.3390/microorganisms11112733 |
work_keys_str_mv | AT canessaenrique physicsbasedsignalanalysisofgenomesequencesanoverviewofgenomebits |