Cargando…

Modeling the evolution dynamics of exon-intron structure with a general random fragmentation process

BACKGROUND: Most eukaryotic genes are interrupted by spliceosomal introns. The evolution of exon-intron structure remains mysterious despite rapid advance in genome sequencing technique. In this work, a novel approach is taken based on the assumptions that the evolution of exon-intron structure is a...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Liya, Stein, Lincoln D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3732091/
https://www.ncbi.nlm.nih.gov/pubmed/23448166
http://dx.doi.org/10.1186/1471-2148-13-57
_version_ 1782279229733339136
author Wang, Liya
Stein, Lincoln D
author_facet Wang, Liya
Stein, Lincoln D
author_sort Wang, Liya
collection PubMed
description BACKGROUND: Most eukaryotic genes are interrupted by spliceosomal introns. The evolution of exon-intron structure remains mysterious despite rapid advance in genome sequencing technique. In this work, a novel approach is taken based on the assumptions that the evolution of exon-intron structure is a stochastic process, and that the characteristics of this process can be understood by examining its historical outcome, the present-day size distribution of internal translated exons (exon). Through the combination of simulation and modeling the size distribution of exons in different species, we propose a general random fragmentation process (GRFP) to characterize the evolution dynamics of exon-intron structure. This model accurately predicts the probability that an exon will be split by a new intron and the distribution of novel insertions along the length of the exon. RESULTS: As the first observation from this model, we show that the chance for an exon to obtain an intron is proportional to its size to the 3rd power. We also show that such size dependence is nearly constant across gene, with the exception of the exons adjacent to the 5(′) UTR. As the second conclusion from the model, we show that intron insertion loci follow a normal distribution with a mean of 0.5 (center of the exon) and a standard deviation of 0.11. Finally, we show that intron insertions within a gene are independent of each other for vertebrates, but are more negatively correlated for non-vertebrate. We use simulation to demonstrate that the negative correlation might result from significant intron loss during evolution, which could be explained by selection against multi-intron genes in these organisms. CONCLUSIONS: The GRFP model suggests that intron gain is dynamic with a higher chance for longer exons; introns are inserted into exons randomly with the highest probability at the center of the exon. GRFP estimates that there are 78 introns in every 10 kb coding sequences for vertebrate genomes, agreeing with empirical observations. GRFP also estimates that there are significant intron losses in the evolution of non-vertebrate genomes, with extreme cases of around 57% intron loss in Drosophila melanogaster, 28% in Caenorhabditis elegans, and 24% in Oryza sativa.
format Online
Article
Text
id pubmed-3732091
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37320912013-08-05 Modeling the evolution dynamics of exon-intron structure with a general random fragmentation process Wang, Liya Stein, Lincoln D BMC Evol Biol Research Article BACKGROUND: Most eukaryotic genes are interrupted by spliceosomal introns. The evolution of exon-intron structure remains mysterious despite rapid advance in genome sequencing technique. In this work, a novel approach is taken based on the assumptions that the evolution of exon-intron structure is a stochastic process, and that the characteristics of this process can be understood by examining its historical outcome, the present-day size distribution of internal translated exons (exon). Through the combination of simulation and modeling the size distribution of exons in different species, we propose a general random fragmentation process (GRFP) to characterize the evolution dynamics of exon-intron structure. This model accurately predicts the probability that an exon will be split by a new intron and the distribution of novel insertions along the length of the exon. RESULTS: As the first observation from this model, we show that the chance for an exon to obtain an intron is proportional to its size to the 3rd power. We also show that such size dependence is nearly constant across gene, with the exception of the exons adjacent to the 5(′) UTR. As the second conclusion from the model, we show that intron insertion loci follow a normal distribution with a mean of 0.5 (center of the exon) and a standard deviation of 0.11. Finally, we show that intron insertions within a gene are independent of each other for vertebrates, but are more negatively correlated for non-vertebrate. We use simulation to demonstrate that the negative correlation might result from significant intron loss during evolution, which could be explained by selection against multi-intron genes in these organisms. CONCLUSIONS: The GRFP model suggests that intron gain is dynamic with a higher chance for longer exons; introns are inserted into exons randomly with the highest probability at the center of the exon. GRFP estimates that there are 78 introns in every 10 kb coding sequences for vertebrate genomes, agreeing with empirical observations. GRFP also estimates that there are significant intron losses in the evolution of non-vertebrate genomes, with extreme cases of around 57% intron loss in Drosophila melanogaster, 28% in Caenorhabditis elegans, and 24% in Oryza sativa. BioMed Central 2013-02-28 /pmc/articles/PMC3732091/ /pubmed/23448166 http://dx.doi.org/10.1186/1471-2148-13-57 Text en Copyright © 2013 Wang and Stein; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wang, Liya
Stein, Lincoln D
Modeling the evolution dynamics of exon-intron structure with a general random fragmentation process
title Modeling the evolution dynamics of exon-intron structure with a general random fragmentation process
title_full Modeling the evolution dynamics of exon-intron structure with a general random fragmentation process
title_fullStr Modeling the evolution dynamics of exon-intron structure with a general random fragmentation process
title_full_unstemmed Modeling the evolution dynamics of exon-intron structure with a general random fragmentation process
title_short Modeling the evolution dynamics of exon-intron structure with a general random fragmentation process
title_sort modeling the evolution dynamics of exon-intron structure with a general random fragmentation process
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3732091/
https://www.ncbi.nlm.nih.gov/pubmed/23448166
http://dx.doi.org/10.1186/1471-2148-13-57
work_keys_str_mv AT wangliya modelingtheevolutiondynamicsofexonintronstructurewithageneralrandomfragmentationprocess
AT steinlincolnd modelingtheevolutiondynamicsofexonintronstructurewithageneralrandomfragmentationprocess