Cargando…
The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses
In 1996, a set [Formula: see text] of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set [Formula: see text] has an interesting mathematical property as [F...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492142/ https://www.ncbi.nlm.nih.gov/pubmed/28420220 http://dx.doi.org/10.3390/life7020020 |
_version_ | 1783247268549230592 |
---|---|
author | Michel, Christian J. |
author_facet | Michel, Christian J. |
author_sort | Michel, Christian J. |
collection | PubMed |
description | In 1996, a set [Formula: see text] of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set [Formula: see text] has an interesting mathematical property as [Formula: see text] is a maximal [Formula: see text] self-complementary trinucleotide circular code. In 2015, by quantifying the inspection approach used in 1996, the circular code [Formula: see text] was confirmed in the genes of bacteria and eukaryotes and was also identified in the genes of plasmids and viruses. The method was based on the preferential occurrence of trinucleotides among the three frames at the gene population level. We extend here this definition at the gene level. This new statistical approach considers all the genes, i.e., of large and small lengths, with the same weight for searching the circular code [Formula: see text]. As a consequence, the concept of circular code, in particular the reading frame retrieval, is directly associated to each gene. At the gene level, the circular code [Formula: see text] is strengthened in the genes of bacteria, eukaryotes, plasmids, and viruses, and is now also identified in the genes of archaea. The genes of mitochondria and chloroplasts contain a subset of the circular code [Formula: see text]. Finally, by studying viral genes, the circular code [Formula: see text] was found in DNA genomes, RNA genomes, double-stranded genomes, and single-stranded genomes. |
format | Online Article Text |
id | pubmed-5492142 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-54921422017-07-03 The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses Michel, Christian J. Life (Basel) Article In 1996, a set [Formula: see text] of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set [Formula: see text] has an interesting mathematical property as [Formula: see text] is a maximal [Formula: see text] self-complementary trinucleotide circular code. In 2015, by quantifying the inspection approach used in 1996, the circular code [Formula: see text] was confirmed in the genes of bacteria and eukaryotes and was also identified in the genes of plasmids and viruses. The method was based on the preferential occurrence of trinucleotides among the three frames at the gene population level. We extend here this definition at the gene level. This new statistical approach considers all the genes, i.e., of large and small lengths, with the same weight for searching the circular code [Formula: see text]. As a consequence, the concept of circular code, in particular the reading frame retrieval, is directly associated to each gene. At the gene level, the circular code [Formula: see text] is strengthened in the genes of bacteria, eukaryotes, plasmids, and viruses, and is now also identified in the genes of archaea. The genes of mitochondria and chloroplasts contain a subset of the circular code [Formula: see text]. Finally, by studying viral genes, the circular code [Formula: see text] was found in DNA genomes, RNA genomes, double-stranded genomes, and single-stranded genomes. MDPI 2017-04-18 /pmc/articles/PMC5492142/ /pubmed/28420220 http://dx.doi.org/10.3390/life7020020 Text en © 2017 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Michel, Christian J. The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses |
title | The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses |
title_full | The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses |
title_fullStr | The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses |
title_full_unstemmed | The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses |
title_short | The Maximal C(3) Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses |
title_sort | maximal c(3) self-complementary trinucleotide circular code x in genes of bacteria, archaea, eukaryotes, plasmids and viruses |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492142/ https://www.ncbi.nlm.nih.gov/pubmed/28420220 http://dx.doi.org/10.3390/life7020020 |
work_keys_str_mv | AT michelchristianj themaximalc3selfcomplementarytrinucleotidecircularcodexingenesofbacteriaarchaeaeukaryotesplasmidsandviruses AT michelchristianj maximalc3selfcomplementarytrinucleotidecircularcodexingenesofbacteriaarchaeaeukaryotesplasmidsandviruses |