Cargando…

The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses

In 1996, a set [Formula: see text] of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set [Formula: see text] has an interesting mathematical property as [F...

Descripción completa

Detalles Bibliográficos
Autor principal: Michel, Christian J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492142/
https://www.ncbi.nlm.nih.gov/pubmed/28420220
http://dx.doi.org/10.3390/life7020020
_version_ 1783247268549230592
author Michel, Christian J.
author_facet Michel, Christian J.
author_sort Michel, Christian J.
collection PubMed
description In 1996, a set [Formula: see text] of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set [Formula: see text] has an interesting mathematical property as [Formula: see text] is a maximal [Formula: see text] self-complementary trinucleotide circular code. In 2015, by quantifying the inspection approach used in 1996, the circular code [Formula: see text] was confirmed in the genes of bacteria and eukaryotes and was also identified in the genes of plasmids and viruses. The method was based on the preferential occurrence of trinucleotides among the three frames at the gene population level. We extend here this definition at the gene level. This new statistical approach considers all the genes, i.e., of large and small lengths, with the same weight for searching the circular code [Formula: see text]. As a consequence, the concept of circular code, in particular the reading frame retrieval, is directly associated to each gene. At the gene level, the circular code [Formula: see text] is strengthened in the genes of bacteria, eukaryotes, plasmids, and viruses, and is now also identified in the genes of archaea. The genes of mitochondria and chloroplasts contain a subset of the circular code [Formula: see text]. Finally, by studying viral genes, the circular code [Formula: see text] was found in DNA genomes, RNA genomes, double-stranded genomes, and single-stranded genomes.
format Online
Article
Text
id pubmed-5492142
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-54921422017-07-03 The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses Michel, Christian J. Life (Basel) Article In 1996, a set [Formula: see text] of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set [Formula: see text] has an interesting mathematical property as [Formula: see text] is a maximal [Formula: see text] self-complementary trinucleotide circular code. In 2015, by quantifying the inspection approach used in 1996, the circular code [Formula: see text] was confirmed in the genes of bacteria and eukaryotes and was also identified in the genes of plasmids and viruses. The method was based on the preferential occurrence of trinucleotides among the three frames at the gene population level. We extend here this definition at the gene level. This new statistical approach considers all the genes, i.e., of large and small lengths, with the same weight for searching the circular code [Formula: see text]. As a consequence, the concept of circular code, in particular the reading frame retrieval, is directly associated to each gene. At the gene level, the circular code [Formula: see text] is strengthened in the genes of bacteria, eukaryotes, plasmids, and viruses, and is now also identified in the genes of archaea. The genes of mitochondria and chloroplasts contain a subset of the circular code [Formula: see text]. Finally, by studying viral genes, the circular code [Formula: see text] was found in DNA genomes, RNA genomes, double-stranded genomes, and single-stranded genomes. MDPI 2017-04-18 /pmc/articles/PMC5492142/ /pubmed/28420220 http://dx.doi.org/10.3390/life7020020 Text en © 2017 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Michel, Christian J.
The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses
title The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses
title_full The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses
title_fullStr The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses
title_full_unstemmed The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses
title_short The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses
title_sort maximal c(3) self-complementary trinucleotide circular code x in genes of bacteria, archaea, eukaryotes, plasmids and viruses
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492142/
https://www.ncbi.nlm.nih.gov/pubmed/28420220
http://dx.doi.org/10.3390/life7020020
work_keys_str_mv AT michelchristianj themaximalc3selfcomplementarytrinucleotidecircularcodexingenesofbacteriaarchaeaeukaryotesplasmidsandviruses
AT michelchristianj maximalc3selfcomplementarytrinucleotidecircularcodexingenesofbacteriaarchaeaeukaryotesplasmidsandviruses