Cargando…

Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory

Based on the circular code theory, we define a new function f that quantifies the property of reading frame retrieval (RFR) of genes from their codon usage. This RFR function f is computed on a massive scale in genes of genomes of bacteria, eukaryotes and archaea. By expressing f as a function of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Michel, Christian J., Sereni, Jean-Sébastien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950712/
https://www.ncbi.nlm.nih.gov/pubmed/36826719
http://dx.doi.org/10.1007/s11538-023-01129-4
_version_ 1784893229947158528
author Michel, Christian J.
Sereni, Jean-Sébastien
author_facet Michel, Christian J.
Sereni, Jean-Sébastien
author_sort Michel, Christian J.
collection PubMed
description Based on the circular code theory, we define a new function f that quantifies the property of reading frame retrieval (RFR) of genes from their codon usage. This RFR function f is computed on a massive scale in genes of genomes of bacteria, eukaryotes and archaea. By expressing f as a function of the mean number [Formula: see text] of codons per gene, a “universal” property is identified, whatever the kingdom: the reading frame retrieval is enhanced in large genes. By investigating this property according to the theory developed, a Spearman’s rank correlation with a strong negative coefficient is observed between the codon usage dispersion d (from the uniform codon distribution [Formula: see text] ) and the RFR function f, whatever the kingdom (p-values [Formula: see text] in bacteria, [Formula: see text] in eukaryotes and [Formula: see text] in archaea). Thus, the reading frame retrieval is enhanced with the codon usage dispersion. Furthermore, this approach identifies a “genome centre” from which emerge two distinct “genome arms”: an upper arm and a lower arm, respectively, above and below the linear regression. The RFR function by itself or combined with classical methods (alignment, phylogeny) could also be a new approach to classify the genomes in the future.
format Online
Article
Text
id pubmed-9950712
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-99507122023-02-24 Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory Michel, Christian J. Sereni, Jean-Sébastien Bull Math Biol Original Article Based on the circular code theory, we define a new function f that quantifies the property of reading frame retrieval (RFR) of genes from their codon usage. This RFR function f is computed on a massive scale in genes of genomes of bacteria, eukaryotes and archaea. By expressing f as a function of the mean number [Formula: see text] of codons per gene, a “universal” property is identified, whatever the kingdom: the reading frame retrieval is enhanced in large genes. By investigating this property according to the theory developed, a Spearman’s rank correlation with a strong negative coefficient is observed between the codon usage dispersion d (from the uniform codon distribution [Formula: see text] ) and the RFR function f, whatever the kingdom (p-values [Formula: see text] in bacteria, [Formula: see text] in eukaryotes and [Formula: see text] in archaea). Thus, the reading frame retrieval is enhanced with the codon usage dispersion. Furthermore, this approach identifies a “genome centre” from which emerge two distinct “genome arms”: an upper arm and a lower arm, respectively, above and below the linear regression. The RFR function by itself or combined with classical methods (alignment, phylogeny) could also be a new approach to classify the genomes in the future. Springer US 2023-02-24 2023 /pmc/articles/PMC9950712/ /pubmed/36826719 http://dx.doi.org/10.1007/s11538-023-01129-4 Text en © The Author(s), under exclusive licence to Society for Mathematical Biology 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Michel, Christian J.
Sereni, Jean-Sébastien
Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory
title Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory
title_full Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory
title_fullStr Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory
title_full_unstemmed Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory
title_short Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory
title_sort reading frame retrieval of genes: a new parameter of codon usage based on the circular code theory
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950712/
https://www.ncbi.nlm.nih.gov/pubmed/36826719
http://dx.doi.org/10.1007/s11538-023-01129-4
work_keys_str_mv AT michelchristianj readingframeretrievalofgenesanewparameterofcodonusagebasedonthecircularcodetheory
AT serenijeansebastien readingframeretrievalofgenesanewparameterofcodonusagebasedonthecircularcodetheory