Cargando…

Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits

A comprehensive overview of the recent physics-inspired genome analysis tool, GenomeBits, is presented. This is based on traditional signal processing methods such as discrete Fourier transform (DFT). GenomeBits can be used to extract underlying genomics features from the distribution of nucleotides...

Descripción completa

Detalles Bibliográficos
Autor principal: Canessa, Enrique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673239/
https://www.ncbi.nlm.nih.gov/pubmed/38004745
http://dx.doi.org/10.3390/microorganisms11112733
_version_ 1785140575906824192
author Canessa, Enrique
author_facet Canessa, Enrique
author_sort Canessa, Enrique
collection PubMed
description A comprehensive overview of the recent physics-inspired genome analysis tool, GenomeBits, is presented. This is based on traditional signal processing methods such as discrete Fourier transform (DFT). GenomeBits can be used to extract underlying genomics features from the distribution of nucleotides, and can be further used to analyze the mutation patterns in viral genomes. Examples of the main GenomeBits findings outlining the intrinsic signal organization of genomics sequences for different SARS-CoV-2 variants along the pandemic years 2020–2022 and Monkeypox cases in 2021 are presented to show the usefulness of GenomeBits. GenomeBits results for DFT of SARS-CoV-2 genomes in different geographical regions are discussed, together with the GenomeBits analysis of complete genome sequences for the first coronavirus variants reported: Alpha, Beta, Gamma, Epsilon and Eta. Interesting features of the Delta and Omicron variants in the form of a unique ‘order–disorder’ transition are uncovered from these samples, as well as from their cumulative distribution function and scatter plots. This class of transitions might reveal the cumulative outcome of mutations on the spike protein. A salient feature of GenomeBits is the mapping of the nucleotide bases (A,T,C,G) into an alternating spin-like numerical sequence via a series having binary (0,1) indicators for each A,T,C,G. This leads to the derivation of a set of statistical distribution curves. Furthermore, the quantum-based extension of the GenomeBits model to an analogous probability measure is shown to identify properties of genome sequences as wavefunctions via a superposition of states. An association of the integral of the GenomeBits coding and a binding-like energy can, in principle, also be established. The relevance of these different results in bioinformatics is analyzed.
format Online
Article
Text
id pubmed-10673239
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106732392023-11-09 Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits Canessa, Enrique Microorganisms Article A comprehensive overview of the recent physics-inspired genome analysis tool, GenomeBits, is presented. This is based on traditional signal processing methods such as discrete Fourier transform (DFT). GenomeBits can be used to extract underlying genomics features from the distribution of nucleotides, and can be further used to analyze the mutation patterns in viral genomes. Examples of the main GenomeBits findings outlining the intrinsic signal organization of genomics sequences for different SARS-CoV-2 variants along the pandemic years 2020–2022 and Monkeypox cases in 2021 are presented to show the usefulness of GenomeBits. GenomeBits results for DFT of SARS-CoV-2 genomes in different geographical regions are discussed, together with the GenomeBits analysis of complete genome sequences for the first coronavirus variants reported: Alpha, Beta, Gamma, Epsilon and Eta. Interesting features of the Delta and Omicron variants in the form of a unique ‘order–disorder’ transition are uncovered from these samples, as well as from their cumulative distribution function and scatter plots. This class of transitions might reveal the cumulative outcome of mutations on the spike protein. A salient feature of GenomeBits is the mapping of the nucleotide bases (A,T,C,G) into an alternating spin-like numerical sequence via a series having binary (0,1) indicators for each A,T,C,G. This leads to the derivation of a set of statistical distribution curves. Furthermore, the quantum-based extension of the GenomeBits model to an analogous probability measure is shown to identify properties of genome sequences as wavefunctions via a superposition of states. An association of the integral of the GenomeBits coding and a binding-like energy can, in principle, also be established. The relevance of these different results in bioinformatics is analyzed. MDPI 2023-11-09 /pmc/articles/PMC10673239/ /pubmed/38004745 http://dx.doi.org/10.3390/microorganisms11112733 Text en © 2023 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Canessa, Enrique
Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits
title Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits
title_full Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits
title_fullStr Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits
title_full_unstemmed Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits
title_short Physics-Based Signal Analysis of Genome Sequences: An Overview of GenomeBits
title_sort physics-based signal analysis of genome sequences: an overview of genomebits
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673239/
https://www.ncbi.nlm.nih.gov/pubmed/38004745
http://dx.doi.org/10.3390/microorganisms11112733
work_keys_str_mv AT canessaenrique physicsbasedsignalanalysisofgenomesequencesanoverviewofgenomebits