Cargando…

Evolutionary pressures on simple sequence repeats in prokaryotic coding regions

Simple sequence repeats (SSRs) are indel mutational hotspots in genomes. In prokaryotes, SSR loci can cause phase variation, a microbial survival strategy that relies on stochastic, reversible on–off switching of gene activity. By analyzing multiple strains of 42 fully sequenced prokaryotic species,...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Wei-Hsiang, Kussell, Edo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315296/
https://www.ncbi.nlm.nih.gov/pubmed/22123746
http://dx.doi.org/10.1093/nar/gkr1078
_version_ 1782228208543858688
author Lin, Wei-Hsiang
Kussell, Edo
author_facet Lin, Wei-Hsiang
Kussell, Edo
author_sort Lin, Wei-Hsiang
collection PubMed
description Simple sequence repeats (SSRs) are indel mutational hotspots in genomes. In prokaryotes, SSR loci can cause phase variation, a microbial survival strategy that relies on stochastic, reversible on–off switching of gene activity. By analyzing multiple strains of 42 fully sequenced prokaryotic species, we measure the relative variability and density distribution of SSRs in coding regions. We demonstrate that repeat type strongly influences indel mutation rates, and that the most mutable types are most strongly avoided across genomes. We thoroughly characterize SSR density and variability as a function of N→C position along protein sequences. Using codon-shuffling algorithms that preserve amino acid sequence, we assess evolutionary pressures on SSRs. We find that coding sequences suppress repeats in the middle of proteins, and enrich repeats near termini, yielding U-shaped SSR density curves. We show that for many species this characteristic shape can be attributed to purely biophysical constraints of protein structure. In multiple cases, however, particularly in certain pathogenic bacteria, we observe over enrichment of SSRs near protein N-termini significantly beyond expectation based on structural constraints. This increases the probability that frameshifts result in non-functional proteins, revealing that these species may evolutionarily tune SSR positions in coding regions to facilitate phase variation.
format Online
Article
Text
id pubmed-3315296
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33152962012-03-30 Evolutionary pressures on simple sequence repeats in prokaryotic coding regions Lin, Wei-Hsiang Kussell, Edo Nucleic Acids Res Computational Biology Simple sequence repeats (SSRs) are indel mutational hotspots in genomes. In prokaryotes, SSR loci can cause phase variation, a microbial survival strategy that relies on stochastic, reversible on–off switching of gene activity. By analyzing multiple strains of 42 fully sequenced prokaryotic species, we measure the relative variability and density distribution of SSRs in coding regions. We demonstrate that repeat type strongly influences indel mutation rates, and that the most mutable types are most strongly avoided across genomes. We thoroughly characterize SSR density and variability as a function of N→C position along protein sequences. Using codon-shuffling algorithms that preserve amino acid sequence, we assess evolutionary pressures on SSRs. We find that coding sequences suppress repeats in the middle of proteins, and enrich repeats near termini, yielding U-shaped SSR density curves. We show that for many species this characteristic shape can be attributed to purely biophysical constraints of protein structure. In multiple cases, however, particularly in certain pathogenic bacteria, we observe over enrichment of SSRs near protein N-termini significantly beyond expectation based on structural constraints. This increases the probability that frameshifts result in non-functional proteins, revealing that these species may evolutionarily tune SSR positions in coding regions to facilitate phase variation. Oxford University Press 2012-03 2011-11-26 /pmc/articles/PMC3315296/ /pubmed/22123746 http://dx.doi.org/10.1093/nar/gkr1078 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Lin, Wei-Hsiang
Kussell, Edo
Evolutionary pressures on simple sequence repeats in prokaryotic coding regions
title Evolutionary pressures on simple sequence repeats in prokaryotic coding regions
title_full Evolutionary pressures on simple sequence repeats in prokaryotic coding regions
title_fullStr Evolutionary pressures on simple sequence repeats in prokaryotic coding regions
title_full_unstemmed Evolutionary pressures on simple sequence repeats in prokaryotic coding regions
title_short Evolutionary pressures on simple sequence repeats in prokaryotic coding regions
title_sort evolutionary pressures on simple sequence repeats in prokaryotic coding regions
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315296/
https://www.ncbi.nlm.nih.gov/pubmed/22123746
http://dx.doi.org/10.1093/nar/gkr1078
work_keys_str_mv AT linweihsiang evolutionarypressuresonsimplesequencerepeatsinprokaryoticcodingregions
AT kusselledo evolutionarypressuresonsimplesequencerepeatsinprokaryoticcodingregions