Cargando…

How Evolution of Genomes Is Reflected in Exact DNA Sequence Match Statistics

Genome evolution is shaped by a multitude of mutational processes, including point mutations, insertions, and deletions of DNA sequences, as well as segmental duplications. These mutational processes can leave distinctive qualitative marks in the statistical features of genomic DNA sequences. One su...

Descripción completa

Detalles Bibliográficos
Autores principales: Massip, Florian, Sheinman, Michael, Schbath, Sophie, Arndt, Peter F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4298173/
https://www.ncbi.nlm.nih.gov/pubmed/25398628
http://dx.doi.org/10.1093/molbev/msu313
_version_ 1782353234696863744
author Massip, Florian
Sheinman, Michael
Schbath, Sophie
Arndt, Peter F.
author_facet Massip, Florian
Sheinman, Michael
Schbath, Sophie
Arndt, Peter F.
author_sort Massip, Florian
collection PubMed
description Genome evolution is shaped by a multitude of mutational processes, including point mutations, insertions, and deletions of DNA sequences, as well as segmental duplications. These mutational processes can leave distinctive qualitative marks in the statistical features of genomic DNA sequences. One such feature is the match length distribution (MLD) of exactly matching sequence segments within an individual genome or between the genomes of related species. These have been observed to exhibit characteristic power law decays in many species. Here, we show that simple dynamical models consisting solely of duplication and mutation processes can already explain the characteristic features of MLDs observed in genomic sequences. Surprisingly, we find that these features are largely insensitive to details of the underlying mutational processes and do not necessarily rely on the action of natural selection. Our results demonstrate how analyzing statistical features of DNA sequences can help us reveal and quantify the different mutational processes that underlie genome evolution.
format Online
Article
Text
id pubmed-4298173
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-42981732015-02-03 How Evolution of Genomes Is Reflected in Exact DNA Sequence Match Statistics Massip, Florian Sheinman, Michael Schbath, Sophie Arndt, Peter F. Mol Biol Evol Methods Genome evolution is shaped by a multitude of mutational processes, including point mutations, insertions, and deletions of DNA sequences, as well as segmental duplications. These mutational processes can leave distinctive qualitative marks in the statistical features of genomic DNA sequences. One such feature is the match length distribution (MLD) of exactly matching sequence segments within an individual genome or between the genomes of related species. These have been observed to exhibit characteristic power law decays in many species. Here, we show that simple dynamical models consisting solely of duplication and mutation processes can already explain the characteristic features of MLDs observed in genomic sequences. Surprisingly, we find that these features are largely insensitive to details of the underlying mutational processes and do not necessarily rely on the action of natural selection. Our results demonstrate how analyzing statistical features of DNA sequences can help us reveal and quantify the different mutational processes that underlie genome evolution. Oxford University Press 2015-02 2014-11-13 /pmc/articles/PMC4298173/ /pubmed/25398628 http://dx.doi.org/10.1093/molbev/msu313 Text en © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Massip, Florian
Sheinman, Michael
Schbath, Sophie
Arndt, Peter F.
How Evolution of Genomes Is Reflected in Exact DNA Sequence Match Statistics
title How Evolution of Genomes Is Reflected in Exact DNA Sequence Match Statistics
title_full How Evolution of Genomes Is Reflected in Exact DNA Sequence Match Statistics
title_fullStr How Evolution of Genomes Is Reflected in Exact DNA Sequence Match Statistics
title_full_unstemmed How Evolution of Genomes Is Reflected in Exact DNA Sequence Match Statistics
title_short How Evolution of Genomes Is Reflected in Exact DNA Sequence Match Statistics
title_sort how evolution of genomes is reflected in exact dna sequence match statistics
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4298173/
https://www.ncbi.nlm.nih.gov/pubmed/25398628
http://dx.doi.org/10.1093/molbev/msu313
work_keys_str_mv AT massipflorian howevolutionofgenomesisreflectedinexactdnasequencematchstatistics
AT sheinmanmichael howevolutionofgenomesisreflectedinexactdnasequencematchstatistics
AT schbathsophie howevolutionofgenomesisreflectedinexactdnasequencematchstatistics
AT arndtpeterf howevolutionofgenomesisreflectedinexactdnasequencematchstatistics