Cargando…
Gaussian-Distributed Codon Frequencies of Genomes
DNA encodes protein primary structure using 64 different codons to specify 20 different amino acids and a stop signal. Frequencies of codon occurrence when ordered in descending sequence provide a global characterization of a genome’s preference (bias) for using the different codons of the redundant...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6505138/ https://www.ncbi.nlm.nih.gov/pubmed/30808691 http://dx.doi.org/10.1534/g3.118.200939 |
_version_ | 1783416698090553344 |
---|---|
author | Khomtchouk, Bohdan B. Nonner, Wolfgang |
author_facet | Khomtchouk, Bohdan B. Nonner, Wolfgang |
author_sort | Khomtchouk, Bohdan B. |
collection | PubMed |
description | DNA encodes protein primary structure using 64 different codons to specify 20 different amino acids and a stop signal. Frequencies of codon occurrence when ordered in descending sequence provide a global characterization of a genome’s preference (bias) for using the different codons of the redundant genetic code. Whereas frequency/rank relations have been described by empirical expressions, here we propose a statistical model in which two different forms of codon usage co-exist in a genome. We investigate whether such a model can account for the range of codon usages observed in a large set of genomes from different taxa. The differences in frequency/rank relations across these genomes can be expressed in a single parameter, the proportion of the two codon compartments. One compartment uses different codons with weak bias according to a Gaussian distribution of frequency, the other uses different codons with strong bias. In prokaryotic genomes both compartments appear to be present in a wide range of proportions, whereas in eukaryotic genomes the compartment with Gaussian distribution tends to dominate. Codon frequencies that are Gaussian-distributed suggest that many evolutionary conditions are involved in shaping weakly-biased codon usage, whereas strong bias in codon usage suggests dominance of few evolutionary conditions. |
format | Online Article Text |
id | pubmed-6505138 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-65051382019-05-21 Gaussian-Distributed Codon Frequencies of Genomes Khomtchouk, Bohdan B. Nonner, Wolfgang G3 (Bethesda) Investigations DNA encodes protein primary structure using 64 different codons to specify 20 different amino acids and a stop signal. Frequencies of codon occurrence when ordered in descending sequence provide a global characterization of a genome’s preference (bias) for using the different codons of the redundant genetic code. Whereas frequency/rank relations have been described by empirical expressions, here we propose a statistical model in which two different forms of codon usage co-exist in a genome. We investigate whether such a model can account for the range of codon usages observed in a large set of genomes from different taxa. The differences in frequency/rank relations across these genomes can be expressed in a single parameter, the proportion of the two codon compartments. One compartment uses different codons with weak bias according to a Gaussian distribution of frequency, the other uses different codons with strong bias. In prokaryotic genomes both compartments appear to be present in a wide range of proportions, whereas in eukaryotic genomes the compartment with Gaussian distribution tends to dominate. Codon frequencies that are Gaussian-distributed suggest that many evolutionary conditions are involved in shaping weakly-biased codon usage, whereas strong bias in codon usage suggests dominance of few evolutionary conditions. Genetics Society of America 2019-02-26 /pmc/articles/PMC6505138/ /pubmed/30808691 http://dx.doi.org/10.1534/g3.118.200939 Text en Copyright © 2019 Khomtchouk, Nonner http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigations Khomtchouk, Bohdan B. Nonner, Wolfgang Gaussian-Distributed Codon Frequencies of Genomes |
title | Gaussian-Distributed Codon Frequencies of Genomes |
title_full | Gaussian-Distributed Codon Frequencies of Genomes |
title_fullStr | Gaussian-Distributed Codon Frequencies of Genomes |
title_full_unstemmed | Gaussian-Distributed Codon Frequencies of Genomes |
title_short | Gaussian-Distributed Codon Frequencies of Genomes |
title_sort | gaussian-distributed codon frequencies of genomes |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6505138/ https://www.ncbi.nlm.nih.gov/pubmed/30808691 http://dx.doi.org/10.1534/g3.118.200939 |
work_keys_str_mv | AT khomtchoukbohdanb gaussiandistributedcodonfrequenciesofgenomes AT nonnerwolfgang gaussiandistributedcodonfrequenciesofgenomes |