Cargando…

Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach

Codon usage can provide insights into the nature of the genes in a genome. Genes that are “native” to a genome (have not been recently acquired by horizontal transfer) range in codon usage from a low-bias “typical” usage to a more biased “high-expression” usage characteristic of genes encoding abund...

Descripción completa

Detalles Bibliográficos
Autores principales: Davis, James J., Olsen, Gary J.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3002238/
https://www.ncbi.nlm.nih.gov/pubmed/20679093
http://dx.doi.org/10.1093/molbev/msq185
_version_ 1782193714178818048
author Davis, James J.
Olsen, Gary J.
author_facet Davis, James J.
Olsen, Gary J.
author_sort Davis, James J.
collection PubMed
description Codon usage can provide insights into the nature of the genes in a genome. Genes that are “native” to a genome (have not been recently acquired by horizontal transfer) range in codon usage from a low-bias “typical” usage to a more biased “high-expression” usage characteristic of genes encoding abundant proteins. Genes that differ from these native codon usages are candidates for foreign genes that have been recently acquired by horizontal gene transfer. In this study, we present a method for characterizing the codon usages of native genes—both typical and highly expressed—within a genome. Each gene is evaluated relative to a half line (or axis) in a 59D space of codon usage. The axis begins at the modal codon usage, the usage that matches the largest number of genes in the genome, and it passes through a point representing the codon usage of a set of genes with expression-related bias. A gene whose codon usage matches (does not significantly differ from) a point on this axis is a candidate native gene, and the location of its projection onto the axis provides a general estimate of its expression level. A gene that differs significantly from all points on the axis is a candidate foreign gene. This automated approach offers significant improvements over existing methods. We illustrate this by analyzing the genomes of Pseudomonas aeruginosa PAO1 and Bacillus anthracis A0248, which can be difficult to analyze with commonly used methods due to their biased base compositions. Finally, we use this approach to measure the proportion of candidate foreign genes in 923 bacterial and archaeal genomes. The organisms with the most homogeneous genomes (containing the fewest candidate foreign genes) are mostly endosymbionts and parasites, though with exceptions that include Pelagibacter ubique and Beutenbergia cavernae. The organisms with the most heterogeneous genomes (containing the most candidate foreign genes) include members of the genera Bacteroides, Corynebacterium, Desulfotalea, Neisseria, Xylella, and Thermobaculum.
format Text
id pubmed-3002238
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30022382010-12-15 Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach Davis, James J. Olsen, Gary J. Mol Biol Evol Research Articles Codon usage can provide insights into the nature of the genes in a genome. Genes that are “native” to a genome (have not been recently acquired by horizontal transfer) range in codon usage from a low-bias “typical” usage to a more biased “high-expression” usage characteristic of genes encoding abundant proteins. Genes that differ from these native codon usages are candidates for foreign genes that have been recently acquired by horizontal gene transfer. In this study, we present a method for characterizing the codon usages of native genes—both typical and highly expressed—within a genome. Each gene is evaluated relative to a half line (or axis) in a 59D space of codon usage. The axis begins at the modal codon usage, the usage that matches the largest number of genes in the genome, and it passes through a point representing the codon usage of a set of genes with expression-related bias. A gene whose codon usage matches (does not significantly differ from) a point on this axis is a candidate native gene, and the location of its projection onto the axis provides a general estimate of its expression level. A gene that differs significantly from all points on the axis is a candidate foreign gene. This automated approach offers significant improvements over existing methods. We illustrate this by analyzing the genomes of Pseudomonas aeruginosa PAO1 and Bacillus anthracis A0248, which can be difficult to analyze with commonly used methods due to their biased base compositions. Finally, we use this approach to measure the proportion of candidate foreign genes in 923 bacterial and archaeal genomes. The organisms with the most homogeneous genomes (containing the fewest candidate foreign genes) are mostly endosymbionts and parasites, though with exceptions that include Pelagibacter ubique and Beutenbergia cavernae. The organisms with the most heterogeneous genomes (containing the most candidate foreign genes) include members of the genera Bacteroides, Corynebacterium, Desulfotalea, Neisseria, Xylella, and Thermobaculum. Oxford University Press 2011-01 2010-08-02 /pmc/articles/PMC3002238/ /pubmed/20679093 http://dx.doi.org/10.1093/molbev/msq185 Text en © The Author(s) 2010. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Davis, James J.
Olsen, Gary J.
Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach
title Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach
title_full Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach
title_fullStr Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach
title_full_unstemmed Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach
title_short Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach
title_sort characterizing the native codon usages of a genome: an axis projection approach
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3002238/
https://www.ncbi.nlm.nih.gov/pubmed/20679093
http://dx.doi.org/10.1093/molbev/msq185
work_keys_str_mv AT davisjamesj characterizingthenativecodonusagesofagenomeanaxisprojectionapproach
AT olsengaryj characterizingthenativecodonusagesofagenomeanaxisprojectionapproach