Cargando…

Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes

BACKGROUND: Out-of-frame stop codons (OSCs) occur naturally in coding sequences of all organisms, providing a mechanism of early termination of translation in incorrect reading frame so that the metabolic cost associated with frameshift events can be reduced. Given such a functional significance, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Tse, Herman, Cai, James J, Tsoi, Hoi-Wah, Lam, Esther PT, Yuen, Kwok-Yung
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2996987/
https://www.ncbi.nlm.nih.gov/pubmed/20828396
http://dx.doi.org/10.1186/1471-2164-11-491
_version_ 1782193250877046784
author Tse, Herman
Cai, James J
Tsoi, Hoi-Wah
Lam, Esther PT
Yuen, Kwok-Yung
author_facet Tse, Herman
Cai, James J
Tsoi, Hoi-Wah
Lam, Esther PT
Yuen, Kwok-Yung
author_sort Tse, Herman
collection PubMed
description BACKGROUND: Out-of-frame stop codons (OSCs) occur naturally in coding sequences of all organisms, providing a mechanism of early termination of translation in incorrect reading frame so that the metabolic cost associated with frameshift events can be reduced. Given such a functional significance, we expect statistically overrepresented OSCs in coding sequences as a result of a widespread selection. Accordingly, we examined available prokaryotic genomes to look for evidence of this selection. RESULTS: The complete genome sequences of 990 prokaryotes were obtained from NCBI GenBank. We found that low G+C content coding sequences contain significantly more OSCs and G+C content at specific codon positions were the principal determinants of OSC usage bias in the different reading frames. To investigate if there is overrepresentation of OSCs, we modeled the trinucleotide and hexanucleotide biases of the coding sequences using Markov models, and calculated the expected OSC frequencies for each organism using a Monte Carlo approach. More than 93% of 342 phylogenetically representative prokaryotic genomes contain excess OSCs. Interestingly the degree of OSC overrepresentation correlates positively with G+C content, which may represent a compensatory mechanism for the negative correlation of OSC frequency with G+C content. We extended the analysis using additional compositional bias models and showed that lower-order bias like codon usage and dipeptide bias could not explain the OSC overrepresentation. The degree of OSC overrepresentation was found to correlate negatively with the optimal growth temperature of the organism after correcting for the G+C% and AT skew of the coding sequence. CONCLUSIONS: The present study uses approaches with statistical rigor to show that OSC overrepresentation is a widespread phenomenon among prokaryotes. Our results support the hypothesis that OSCs carry functional significance and have been selected in the course of genome evolution to act against unintended frameshift occurrences. Some results also hint that OSC overrepresentation being a compensatory mechanism to make up for the decrease in OSCs in high G+C organisms, thus revealing the interplay between two different determinants of OSC frequency.
format Text
id pubmed-2996987
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29969872010-12-07 Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes Tse, Herman Cai, James J Tsoi, Hoi-Wah Lam, Esther PT Yuen, Kwok-Yung BMC Genomics Research Article BACKGROUND: Out-of-frame stop codons (OSCs) occur naturally in coding sequences of all organisms, providing a mechanism of early termination of translation in incorrect reading frame so that the metabolic cost associated with frameshift events can be reduced. Given such a functional significance, we expect statistically overrepresented OSCs in coding sequences as a result of a widespread selection. Accordingly, we examined available prokaryotic genomes to look for evidence of this selection. RESULTS: The complete genome sequences of 990 prokaryotes were obtained from NCBI GenBank. We found that low G+C content coding sequences contain significantly more OSCs and G+C content at specific codon positions were the principal determinants of OSC usage bias in the different reading frames. To investigate if there is overrepresentation of OSCs, we modeled the trinucleotide and hexanucleotide biases of the coding sequences using Markov models, and calculated the expected OSC frequencies for each organism using a Monte Carlo approach. More than 93% of 342 phylogenetically representative prokaryotic genomes contain excess OSCs. Interestingly the degree of OSC overrepresentation correlates positively with G+C content, which may represent a compensatory mechanism for the negative correlation of OSC frequency with G+C content. We extended the analysis using additional compositional bias models and showed that lower-order bias like codon usage and dipeptide bias could not explain the OSC overrepresentation. The degree of OSC overrepresentation was found to correlate negatively with the optimal growth temperature of the organism after correcting for the G+C% and AT skew of the coding sequence. CONCLUSIONS: The present study uses approaches with statistical rigor to show that OSC overrepresentation is a widespread phenomenon among prokaryotes. Our results support the hypothesis that OSCs carry functional significance and have been selected in the course of genome evolution to act against unintended frameshift occurrences. Some results also hint that OSC overrepresentation being a compensatory mechanism to make up for the decrease in OSCs in high G+C organisms, thus revealing the interplay between two different determinants of OSC frequency. BioMed Central 2010-09-09 /pmc/articles/PMC2996987/ /pubmed/20828396 http://dx.doi.org/10.1186/1471-2164-11-491 Text en Copyright ©2010 Tse et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Tse, Herman
Cai, James J
Tsoi, Hoi-Wah
Lam, Esther PT
Yuen, Kwok-Yung
Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes
title Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes
title_full Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes
title_fullStr Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes
title_full_unstemmed Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes
title_short Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes
title_sort natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2996987/
https://www.ncbi.nlm.nih.gov/pubmed/20828396
http://dx.doi.org/10.1186/1471-2164-11-491
work_keys_str_mv AT tseherman naturalselectionretainsoverrepresentedoutofframestopcodonsagainstframeshiftpeptidesinprokaryotes
AT caijamesj naturalselectionretainsoverrepresentedoutofframestopcodonsagainstframeshiftpeptidesinprokaryotes
AT tsoihoiwah naturalselectionretainsoverrepresentedoutofframestopcodonsagainstframeshiftpeptidesinprokaryotes
AT lamestherpt naturalselectionretainsoverrepresentedoutofframestopcodonsagainstframeshiftpeptidesinprokaryotes
AT yuenkwokyung naturalselectionretainsoverrepresentedoutofframestopcodonsagainstframeshiftpeptidesinprokaryotes