Cargando…

Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences

Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembl...

Descripción completa

Detalles Bibliográficos
Autores principales: Sheinman, Michael, Ramisch, Anna, Massip, Florian, Arndt, Peter F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4973250/
https://www.ncbi.nlm.nih.gov/pubmed/27488939
http://dx.doi.org/10.1038/srep30851
_version_ 1782446373186043904
author Sheinman, Michael
Ramisch, Anna
Massip, Florian
Arndt, Peter F.
author_facet Sheinman, Michael
Ramisch, Anna
Massip, Florian
Arndt, Peter F.
author_sort Sheinman, Michael
collection PubMed
description Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf’s law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes.
format Online
Article
Text
id pubmed-4973250
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49732502016-08-11 Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences Sheinman, Michael Ramisch, Anna Massip, Florian Arndt, Peter F. Sci Rep Article Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf’s law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes. Nature Publishing Group 2016-08-04 /pmc/articles/PMC4973250/ /pubmed/27488939 http://dx.doi.org/10.1038/srep30851 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Sheinman, Michael
Ramisch, Anna
Massip, Florian
Arndt, Peter F.
Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences
title Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences
title_full Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences
title_fullStr Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences
title_full_unstemmed Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences
title_short Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences
title_sort evolutionary dynamics of selfish dna explains the abundance distribution of genomic subsequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4973250/
https://www.ncbi.nlm.nih.gov/pubmed/27488939
http://dx.doi.org/10.1038/srep30851
work_keys_str_mv AT sheinmanmichael evolutionarydynamicsofselfishdnaexplainstheabundancedistributionofgenomicsubsequences
AT ramischanna evolutionarydynamicsofselfishdnaexplainstheabundancedistributionofgenomicsubsequences
AT massipflorian evolutionarydynamicsofselfishdnaexplainstheabundancedistributionofgenomicsubsequences
AT arndtpeterf evolutionarydynamicsofselfishdnaexplainstheabundancedistributionofgenomicsubsequences