Cargando…

A statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes

BACKGROUND: Among microbial genomes, genetic information is frequently compressed, exploiting redundancies in the genetic code in order to store information in overlapping genes. We investigate the length, phase and orientation properties of overlap in 58 prokaryotic species evaluating neutral and s...

Descripción completa

Detalles Bibliográficos
Autores principales: Lillo, Fabrizio, Krakauer, David C
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2174442/
https://www.ncbi.nlm.nih.gov/pubmed/17877818
http://dx.doi.org/10.1186/1745-6150-2-22
_version_ 1782145340283027456
author Lillo, Fabrizio
Krakauer, David C
author_facet Lillo, Fabrizio
Krakauer, David C
author_sort Lillo, Fabrizio
collection PubMed
description BACKGROUND: Among microbial genomes, genetic information is frequently compressed, exploiting redundancies in the genetic code in order to store information in overlapping genes. We investigate the length, phase and orientation properties of overlap in 58 prokaryotic species evaluating neutral and selective mechanisms of evolution. RESULTS: Using a variety of statistical null models we find patterns of compressive coding that can not be explained purely in terms of the selective processes favoring genome minimization or translational coupling. The distribution of overlap lengths follows a fat-tailed distribution, in which a significant proportion of overlaps are in excess of 100 base pairs in length. The phase of overlap – pairing of codon positions in complementary reading frames – is strongly predicted by the translation orientation of each gene. We find that as overlapping genes become longer, they have a tendency to alternate among alternative overlap phases. Some phases seem to reflect codon pairings reducing the probability of non-synonymous substitution. We analyze the lineage-dependent features of overlapping genes by tracing a number of different continuous characters through the prokaryotic phylogeny using squared-change parsimony and observe both clade-specific and species-specific patterns. CONCLUSION: Overlapping reading frames preserve in their structure, features relating to mutational origination of new genes, but have undergone modification for both immediate benefits and for variational buffering and amplification. Genomes come under a variety of different mutational and selectional pressures, and the structure of redundancies in overlapping genes can be used to detect these pressures. No single mechanism is able to account for all the variability observed among the set of prokaryotic overlapping genes but a three-fold analysis of evolutionary events provides a more integrative framework. REVIEWERS: This article was reviewed by Eugene Koonin, Marten Huynem, and Han Liang.
format Text
id pubmed-2174442
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21744422008-01-04 A statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes Lillo, Fabrizio Krakauer, David C Biol Direct Research BACKGROUND: Among microbial genomes, genetic information is frequently compressed, exploiting redundancies in the genetic code in order to store information in overlapping genes. We investigate the length, phase and orientation properties of overlap in 58 prokaryotic species evaluating neutral and selective mechanisms of evolution. RESULTS: Using a variety of statistical null models we find patterns of compressive coding that can not be explained purely in terms of the selective processes favoring genome minimization or translational coupling. The distribution of overlap lengths follows a fat-tailed distribution, in which a significant proportion of overlaps are in excess of 100 base pairs in length. The phase of overlap – pairing of codon positions in complementary reading frames – is strongly predicted by the translation orientation of each gene. We find that as overlapping genes become longer, they have a tendency to alternate among alternative overlap phases. Some phases seem to reflect codon pairings reducing the probability of non-synonymous substitution. We analyze the lineage-dependent features of overlapping genes by tracing a number of different continuous characters through the prokaryotic phylogeny using squared-change parsimony and observe both clade-specific and species-specific patterns. CONCLUSION: Overlapping reading frames preserve in their structure, features relating to mutational origination of new genes, but have undergone modification for both immediate benefits and for variational buffering and amplification. Genomes come under a variety of different mutational and selectional pressures, and the structure of redundancies in overlapping genes can be used to detect these pressures. No single mechanism is able to account for all the variability observed among the set of prokaryotic overlapping genes but a three-fold analysis of evolutionary events provides a more integrative framework. REVIEWERS: This article was reviewed by Eugene Koonin, Marten Huynem, and Han Liang. BioMed Central 2007-09-18 /pmc/articles/PMC2174442/ /pubmed/17877818 http://dx.doi.org/10.1186/1745-6150-2-22 Text en Copyright © 2007 Lillo and Krakauer; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Lillo, Fabrizio
Krakauer, David C
A statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes
title A statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes
title_full A statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes
title_fullStr A statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes
title_full_unstemmed A statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes
title_short A statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes
title_sort statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2174442/
https://www.ncbi.nlm.nih.gov/pubmed/17877818
http://dx.doi.org/10.1186/1745-6150-2-22
work_keys_str_mv AT lillofabrizio astatisticalanalysisofthethreefoldevolutionofgenomiccompressionthroughframeoverlapsinprokaryotes
AT krakauerdavidc astatisticalanalysisofthethreefoldevolutionofgenomiccompressionthroughframeoverlapsinprokaryotes
AT lillofabrizio statisticalanalysisofthethreefoldevolutionofgenomiccompressionthroughframeoverlapsinprokaryotes
AT krakauerdavidc statisticalanalysisofthethreefoldevolutionofgenomiccompressionthroughframeoverlapsinprokaryotes