Cargando…

Quantification of codon selection for comparative bacterial genomics

BACKGROUND: Statistics measuring codon selection seek to compare genes by their sensitivity to selection for translational efficiency, but existing statistics lack a model for testing the significance of differences between genes. Here, we introduce a new statistic for measuring codon selection, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Retchless, Adam C, Lawrence, Jeffrey G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3162537/
https://www.ncbi.nlm.nih.gov/pubmed/21787402
http://dx.doi.org/10.1186/1471-2164-12-374
_version_ 1782210824239054848
author Retchless, Adam C
Lawrence, Jeffrey G
author_facet Retchless, Adam C
Lawrence, Jeffrey G
author_sort Retchless, Adam C
collection PubMed
description BACKGROUND: Statistics measuring codon selection seek to compare genes by their sensitivity to selection for translational efficiency, but existing statistics lack a model for testing the significance of differences between genes. Here, we introduce a new statistic for measuring codon selection, the Adaptive Codon Enrichment (ACE). RESULTS: This statistic represents codon usage bias in terms of a probabilistic distribution, quantifying the extent that preferred codons are over-represented in the gene of interest relative to the mean and variance that would result from stochastic sampling of codons. Expected codon frequencies are derived from the observed codon usage frequencies of a broad set of genes, such that they are likely to reflect nonselective, genome wide influences on codon usage (e.g. mutational biases). The relative adaptiveness of synonymous codons is deduced from the frequency of codon usage in a pre-selected set of genes relative to the expected frequency. The ACE can predict both transcript abundance during rapid growth and the rate of synonymous substitutions, with accuracy comparable to or greater than existing metrics. We further examine how the composition of reference gene sets affects the accuracy of the statistic, and suggest methods for selecting appropriate reference sets for any genome, including bacteriophages. Finally, we demonstrate that the ACE may naturally be extended to quantify the genome-wide influence of codon selection in a manner that is sensitive to a large fraction of codons in the genome. This reveals substantial variation among genomes, correlated with the tRNA gene number, even among groups of bacteria where previously proposed whole-genome measures show little variation. CONCLUSIONS: The statistical framework of the ACE allows rigorous comparison of the level of codon selection acting on genes, both within a genome and between genomes.
format Online
Article
Text
id pubmed-3162537
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31625372011-08-27 Quantification of codon selection for comparative bacterial genomics Retchless, Adam C Lawrence, Jeffrey G BMC Genomics Methodology Article BACKGROUND: Statistics measuring codon selection seek to compare genes by their sensitivity to selection for translational efficiency, but existing statistics lack a model for testing the significance of differences between genes. Here, we introduce a new statistic for measuring codon selection, the Adaptive Codon Enrichment (ACE). RESULTS: This statistic represents codon usage bias in terms of a probabilistic distribution, quantifying the extent that preferred codons are over-represented in the gene of interest relative to the mean and variance that would result from stochastic sampling of codons. Expected codon frequencies are derived from the observed codon usage frequencies of a broad set of genes, such that they are likely to reflect nonselective, genome wide influences on codon usage (e.g. mutational biases). The relative adaptiveness of synonymous codons is deduced from the frequency of codon usage in a pre-selected set of genes relative to the expected frequency. The ACE can predict both transcript abundance during rapid growth and the rate of synonymous substitutions, with accuracy comparable to or greater than existing metrics. We further examine how the composition of reference gene sets affects the accuracy of the statistic, and suggest methods for selecting appropriate reference sets for any genome, including bacteriophages. Finally, we demonstrate that the ACE may naturally be extended to quantify the genome-wide influence of codon selection in a manner that is sensitive to a large fraction of codons in the genome. This reveals substantial variation among genomes, correlated with the tRNA gene number, even among groups of bacteria where previously proposed whole-genome measures show little variation. CONCLUSIONS: The statistical framework of the ACE allows rigorous comparison of the level of codon selection acting on genes, both within a genome and between genomes. BioMed Central 2011-07-25 /pmc/articles/PMC3162537/ /pubmed/21787402 http://dx.doi.org/10.1186/1471-2164-12-374 Text en Copyright ©2011 Retchless and Lawrence; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Retchless, Adam C
Lawrence, Jeffrey G
Quantification of codon selection for comparative bacterial genomics
title Quantification of codon selection for comparative bacterial genomics
title_full Quantification of codon selection for comparative bacterial genomics
title_fullStr Quantification of codon selection for comparative bacterial genomics
title_full_unstemmed Quantification of codon selection for comparative bacterial genomics
title_short Quantification of codon selection for comparative bacterial genomics
title_sort quantification of codon selection for comparative bacterial genomics
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3162537/
https://www.ncbi.nlm.nih.gov/pubmed/21787402
http://dx.doi.org/10.1186/1471-2164-12-374
work_keys_str_mv AT retchlessadamc quantificationofcodonselectionforcomparativebacterialgenomics
AT lawrencejeffreyg quantificationofcodonselectionforcomparativebacterialgenomics