Cargando…

Gaussian-Distributed Codon Frequencies of Genomes

DNA encodes protein primary structure using 64 different codons to specify 20 different amino acids and a stop signal. Frequencies of codon occurrence when ordered in descending sequence provide a global characterization of a genome’s preference (bias) for using the different codons of the redundant...

Descripción completa

Detalles Bibliográficos
Autores principales: Khomtchouk, Bohdan B., Nonner, Wolfgang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6505138/
https://www.ncbi.nlm.nih.gov/pubmed/30808691
http://dx.doi.org/10.1534/g3.118.200939
_version_ 1783416698090553344
author Khomtchouk, Bohdan B.
Nonner, Wolfgang
author_facet Khomtchouk, Bohdan B.
Nonner, Wolfgang
author_sort Khomtchouk, Bohdan B.
collection PubMed
description DNA encodes protein primary structure using 64 different codons to specify 20 different amino acids and a stop signal. Frequencies of codon occurrence when ordered in descending sequence provide a global characterization of a genome’s preference (bias) for using the different codons of the redundant genetic code. Whereas frequency/rank relations have been described by empirical expressions, here we propose a statistical model in which two different forms of codon usage co-exist in a genome. We investigate whether such a model can account for the range of codon usages observed in a large set of genomes from different taxa. The differences in frequency/rank relations across these genomes can be expressed in a single parameter, the proportion of the two codon compartments. One compartment uses different codons with weak bias according to a Gaussian distribution of frequency, the other uses different codons with strong bias. In prokaryotic genomes both compartments appear to be present in a wide range of proportions, whereas in eukaryotic genomes the compartment with Gaussian distribution tends to dominate. Codon frequencies that are Gaussian-distributed suggest that many evolutionary conditions are involved in shaping weakly-biased codon usage, whereas strong bias in codon usage suggests dominance of few evolutionary conditions.
format Online
Article
Text
id pubmed-6505138
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-65051382019-05-21 Gaussian-Distributed Codon Frequencies of Genomes Khomtchouk, Bohdan B. Nonner, Wolfgang G3 (Bethesda) Investigations DNA encodes protein primary structure using 64 different codons to specify 20 different amino acids and a stop signal. Frequencies of codon occurrence when ordered in descending sequence provide a global characterization of a genome’s preference (bias) for using the different codons of the redundant genetic code. Whereas frequency/rank relations have been described by empirical expressions, here we propose a statistical model in which two different forms of codon usage co-exist in a genome. We investigate whether such a model can account for the range of codon usages observed in a large set of genomes from different taxa. The differences in frequency/rank relations across these genomes can be expressed in a single parameter, the proportion of the two codon compartments. One compartment uses different codons with weak bias according to a Gaussian distribution of frequency, the other uses different codons with strong bias. In prokaryotic genomes both compartments appear to be present in a wide range of proportions, whereas in eukaryotic genomes the compartment with Gaussian distribution tends to dominate. Codon frequencies that are Gaussian-distributed suggest that many evolutionary conditions are involved in shaping weakly-biased codon usage, whereas strong bias in codon usage suggests dominance of few evolutionary conditions. Genetics Society of America 2019-02-26 /pmc/articles/PMC6505138/ /pubmed/30808691 http://dx.doi.org/10.1534/g3.118.200939 Text en Copyright © 2019 Khomtchouk, Nonner http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Khomtchouk, Bohdan B.
Nonner, Wolfgang
Gaussian-Distributed Codon Frequencies of Genomes
title Gaussian-Distributed Codon Frequencies of Genomes
title_full Gaussian-Distributed Codon Frequencies of Genomes
title_fullStr Gaussian-Distributed Codon Frequencies of Genomes
title_full_unstemmed Gaussian-Distributed Codon Frequencies of Genomes
title_short Gaussian-Distributed Codon Frequencies of Genomes
title_sort gaussian-distributed codon frequencies of genomes
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6505138/
https://www.ncbi.nlm.nih.gov/pubmed/30808691
http://dx.doi.org/10.1534/g3.118.200939
work_keys_str_mv AT khomtchoukbohdanb gaussiandistributedcodonfrequenciesofgenomes
AT nonnerwolfgang gaussiandistributedcodonfrequenciesofgenomes