Cargando…
Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences
Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembl...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4973250/ https://www.ncbi.nlm.nih.gov/pubmed/27488939 http://dx.doi.org/10.1038/srep30851 |
_version_ | 1782446373186043904 |
---|---|
author | Sheinman, Michael Ramisch, Anna Massip, Florian Arndt, Peter F. |
author_facet | Sheinman, Michael Ramisch, Anna Massip, Florian Arndt, Peter F. |
author_sort | Sheinman, Michael |
collection | PubMed |
description | Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf’s law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes. |
format | Online Article Text |
id | pubmed-4973250 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-49732502016-08-11 Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences Sheinman, Michael Ramisch, Anna Massip, Florian Arndt, Peter F. Sci Rep Article Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf’s law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes. Nature Publishing Group 2016-08-04 /pmc/articles/PMC4973250/ /pubmed/27488939 http://dx.doi.org/10.1038/srep30851 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Sheinman, Michael Ramisch, Anna Massip, Florian Arndt, Peter F. Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences |
title | Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences |
title_full | Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences |
title_fullStr | Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences |
title_full_unstemmed | Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences |
title_short | Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences |
title_sort | evolutionary dynamics of selfish dna explains the abundance distribution of genomic subsequences |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4973250/ https://www.ncbi.nlm.nih.gov/pubmed/27488939 http://dx.doi.org/10.1038/srep30851 |
work_keys_str_mv | AT sheinmanmichael evolutionarydynamicsofselfishdnaexplainstheabundancedistributionofgenomicsubsequences AT ramischanna evolutionarydynamicsofselfishdnaexplainstheabundancedistributionofgenomicsubsequences AT massipflorian evolutionarydynamicsofselfishdnaexplainstheabundancedistributionofgenomicsubsequences AT arndtpeterf evolutionarydynamicsofselfishdnaexplainstheabundancedistributionofgenomicsubsequences |