Cargando…
Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory
Based on the circular code theory, we define a new function f that quantifies the property of reading frame retrieval (RFR) of genes from their codon usage. This RFR function f is computed on a massive scale in genes of genomes of bacteria, eukaryotes and archaea. By expressing f as a function of th...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950712/ https://www.ncbi.nlm.nih.gov/pubmed/36826719 http://dx.doi.org/10.1007/s11538-023-01129-4 |
_version_ | 1784893229947158528 |
---|---|
author | Michel, Christian J. Sereni, Jean-Sébastien |
author_facet | Michel, Christian J. Sereni, Jean-Sébastien |
author_sort | Michel, Christian J. |
collection | PubMed |
description | Based on the circular code theory, we define a new function f that quantifies the property of reading frame retrieval (RFR) of genes from their codon usage. This RFR function f is computed on a massive scale in genes of genomes of bacteria, eukaryotes and archaea. By expressing f as a function of the mean number [Formula: see text] of codons per gene, a “universal” property is identified, whatever the kingdom: the reading frame retrieval is enhanced in large genes. By investigating this property according to the theory developed, a Spearman’s rank correlation with a strong negative coefficient is observed between the codon usage dispersion d (from the uniform codon distribution [Formula: see text] ) and the RFR function f, whatever the kingdom (p-values [Formula: see text] in bacteria, [Formula: see text] in eukaryotes and [Formula: see text] in archaea). Thus, the reading frame retrieval is enhanced with the codon usage dispersion. Furthermore, this approach identifies a “genome centre” from which emerge two distinct “genome arms”: an upper arm and a lower arm, respectively, above and below the linear regression. The RFR function by itself or combined with classical methods (alignment, phylogeny) could also be a new approach to classify the genomes in the future. |
format | Online Article Text |
id | pubmed-9950712 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-99507122023-02-24 Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory Michel, Christian J. Sereni, Jean-Sébastien Bull Math Biol Original Article Based on the circular code theory, we define a new function f that quantifies the property of reading frame retrieval (RFR) of genes from their codon usage. This RFR function f is computed on a massive scale in genes of genomes of bacteria, eukaryotes and archaea. By expressing f as a function of the mean number [Formula: see text] of codons per gene, a “universal” property is identified, whatever the kingdom: the reading frame retrieval is enhanced in large genes. By investigating this property according to the theory developed, a Spearman’s rank correlation with a strong negative coefficient is observed between the codon usage dispersion d (from the uniform codon distribution [Formula: see text] ) and the RFR function f, whatever the kingdom (p-values [Formula: see text] in bacteria, [Formula: see text] in eukaryotes and [Formula: see text] in archaea). Thus, the reading frame retrieval is enhanced with the codon usage dispersion. Furthermore, this approach identifies a “genome centre” from which emerge two distinct “genome arms”: an upper arm and a lower arm, respectively, above and below the linear regression. The RFR function by itself or combined with classical methods (alignment, phylogeny) could also be a new approach to classify the genomes in the future. Springer US 2023-02-24 2023 /pmc/articles/PMC9950712/ /pubmed/36826719 http://dx.doi.org/10.1007/s11538-023-01129-4 Text en © The Author(s), under exclusive licence to Society for Mathematical Biology 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Article Michel, Christian J. Sereni, Jean-Sébastien Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory |
title | Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory |
title_full | Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory |
title_fullStr | Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory |
title_full_unstemmed | Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory |
title_short | Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory |
title_sort | reading frame retrieval of genes: a new parameter of codon usage based on the circular code theory |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950712/ https://www.ncbi.nlm.nih.gov/pubmed/36826719 http://dx.doi.org/10.1007/s11538-023-01129-4 |
work_keys_str_mv | AT michelchristianj readingframeretrievalofgenesanewparameterofcodonusagebasedonthecircularcodetheory AT serenijeansebastien readingframeretrievalofgenesanewparameterofcodonusagebasedonthecircularcodetheory |