Cargando…

Short template switch events explain mutation clusters in the human genome

Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitution...

Descripción completa

Detalles Bibliográficos
Autores principales: Löytynoja, Ari, Goldman, Nick
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5453318/
https://www.ncbi.nlm.nih.gov/pubmed/28385709
http://dx.doi.org/10.1101/gr.214973.116
_version_ 1783240633631113216
author Löytynoja, Ari
Goldman, Nick
author_facet Löytynoja, Ari
Goldman, Nick
author_sort Löytynoja, Ari
collection PubMed
description Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution.
format Online
Article
Text
id pubmed-5453318
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-54533182017-12-01 Short template switch events explain mutation clusters in the human genome Löytynoja, Ari Goldman, Nick Genome Res Method Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution. Cold Spring Harbor Laboratory Press 2017-06 /pmc/articles/PMC5453318/ /pubmed/28385709 http://dx.doi.org/10.1101/gr.214973.116 Text en © 2017 Löytynoja and Goldman; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Löytynoja, Ari
Goldman, Nick
Short template switch events explain mutation clusters in the human genome
title Short template switch events explain mutation clusters in the human genome
title_full Short template switch events explain mutation clusters in the human genome
title_fullStr Short template switch events explain mutation clusters in the human genome
title_full_unstemmed Short template switch events explain mutation clusters in the human genome
title_short Short template switch events explain mutation clusters in the human genome
title_sort short template switch events explain mutation clusters in the human genome
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5453318/
https://www.ncbi.nlm.nih.gov/pubmed/28385709
http://dx.doi.org/10.1101/gr.214973.116
work_keys_str_mv AT loytynojaari shorttemplateswitcheventsexplainmutationclustersinthehumangenome
AT goldmannick shorttemplateswitcheventsexplainmutationclustersinthehumangenome