Cargando…

Patterns of intron gain and conservation in eukaryotic genes

BACKGROUND: The presence of introns in protein-coding genes is a universal feature of eukaryotic genome organization, and the genes of multicellular eukaryotes, typically, contain multiple introns, a substantial fraction of which share position in distant taxa, such as plants and animals. Depending...

Descripción completa

Detalles Bibliográficos
Autores principales: Carmel, Liran, Rogozin, Igor B, Wolf, Yuri I, Koonin, Eugene V
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2151770/
https://www.ncbi.nlm.nih.gov/pubmed/17935625
http://dx.doi.org/10.1186/1471-2148-7-192
_version_ 1782144770945056768
author Carmel, Liran
Rogozin, Igor B
Wolf, Yuri I
Koonin, Eugene V
author_facet Carmel, Liran
Rogozin, Igor B
Wolf, Yuri I
Koonin, Eugene V
author_sort Carmel, Liran
collection PubMed
description BACKGROUND: The presence of introns in protein-coding genes is a universal feature of eukaryotic genome organization, and the genes of multicellular eukaryotes, typically, contain multiple introns, a substantial fraction of which share position in distant taxa, such as plants and animals. Depending on the methods and data sets used, researchers have reached opposite conclusions on the causes of the high fraction of shared introns in orthologous genes from distant eukaryotes. Some studies conclude that shared intron positions reflect, almost entirely, a remarkable evolutionary conservation, whereas others attribute it to parallel gain of introns. To resolve these contradictions, it is crucial to analyze the evolution of introns by using a model that minimally relies on arbitrary assumptions. RESULTS: We developed a probabilistic model of evolution that allows for variability of intron gain and loss rates over branches of the phylogenetic tree, individual genes, and individual sites. Applying this model to an extended set of conserved eukaryotic genes, we find that parallel gain, on average, accounts for only ~8% of the shared intron positions. However, the distribution of parallel gains over the phylogenetic tree of eukaryotes is highly non-uniform. There are, practically, no parallel gains in closely related lineages, whereas for distant lineages, such as animals and plants, parallel gains appear to contribute up to 20% of the shared intron positions. In accord with these findings, we estimated that ancestral introns have a high probability to be retained in extant genomes, and conversely, that a substantial fraction of extant introns have retained their positions since the early stages of eukaryotic evolution. In addition, the density of sites that are available for intron insertion is estimated to be, approximately, one in seven basepairs. CONCLUSION: We obtained robust estimates of the contribution of parallel gain to the observed sharing of intron positions between eukaryotic species separated by different evolutionary distances. The results indicate that, although the contribution of parallel gains varies across the phylogenetic tree, the high level of intron position sharing is due, primarily, to evolutionary conservation. Accordingly, numerous introns appear to persist in the same position over hundreds of millions of years of evolution. This is compatible with recent observations of a negative correlation between the rate of intron gain and coding sequence evolution rate of a gene, suggesting that at least some of the introns are functionally relevant.
format Text
id pubmed-2151770
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21517702007-12-24 Patterns of intron gain and conservation in eukaryotic genes Carmel, Liran Rogozin, Igor B Wolf, Yuri I Koonin, Eugene V BMC Evol Biol Research Article BACKGROUND: The presence of introns in protein-coding genes is a universal feature of eukaryotic genome organization, and the genes of multicellular eukaryotes, typically, contain multiple introns, a substantial fraction of which share position in distant taxa, such as plants and animals. Depending on the methods and data sets used, researchers have reached opposite conclusions on the causes of the high fraction of shared introns in orthologous genes from distant eukaryotes. Some studies conclude that shared intron positions reflect, almost entirely, a remarkable evolutionary conservation, whereas others attribute it to parallel gain of introns. To resolve these contradictions, it is crucial to analyze the evolution of introns by using a model that minimally relies on arbitrary assumptions. RESULTS: We developed a probabilistic model of evolution that allows for variability of intron gain and loss rates over branches of the phylogenetic tree, individual genes, and individual sites. Applying this model to an extended set of conserved eukaryotic genes, we find that parallel gain, on average, accounts for only ~8% of the shared intron positions. However, the distribution of parallel gains over the phylogenetic tree of eukaryotes is highly non-uniform. There are, practically, no parallel gains in closely related lineages, whereas for distant lineages, such as animals and plants, parallel gains appear to contribute up to 20% of the shared intron positions. In accord with these findings, we estimated that ancestral introns have a high probability to be retained in extant genomes, and conversely, that a substantial fraction of extant introns have retained their positions since the early stages of eukaryotic evolution. In addition, the density of sites that are available for intron insertion is estimated to be, approximately, one in seven basepairs. CONCLUSION: We obtained robust estimates of the contribution of parallel gain to the observed sharing of intron positions between eukaryotic species separated by different evolutionary distances. The results indicate that, although the contribution of parallel gains varies across the phylogenetic tree, the high level of intron position sharing is due, primarily, to evolutionary conservation. Accordingly, numerous introns appear to persist in the same position over hundreds of millions of years of evolution. This is compatible with recent observations of a negative correlation between the rate of intron gain and coding sequence evolution rate of a gene, suggesting that at least some of the introns are functionally relevant. BioMed Central 2007-10-12 /pmc/articles/PMC2151770/ /pubmed/17935625 http://dx.doi.org/10.1186/1471-2148-7-192 Text en Copyright © 2007 Carmel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Carmel, Liran
Rogozin, Igor B
Wolf, Yuri I
Koonin, Eugene V
Patterns of intron gain and conservation in eukaryotic genes
title Patterns of intron gain and conservation in eukaryotic genes
title_full Patterns of intron gain and conservation in eukaryotic genes
title_fullStr Patterns of intron gain and conservation in eukaryotic genes
title_full_unstemmed Patterns of intron gain and conservation in eukaryotic genes
title_short Patterns of intron gain and conservation in eukaryotic genes
title_sort patterns of intron gain and conservation in eukaryotic genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2151770/
https://www.ncbi.nlm.nih.gov/pubmed/17935625
http://dx.doi.org/10.1186/1471-2148-7-192
work_keys_str_mv AT carmelliran patternsofintrongainandconservationineukaryoticgenes
AT rogozinigorb patternsofintrongainandconservationineukaryoticgenes
AT wolfyurii patternsofintrongainandconservationineukaryoticgenes
AT koonineugenev patternsofintrongainandconservationineukaryoticgenes