Cargando…
Short template switch events explain mutation clusters in the human genome
Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitution...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5453318/ https://www.ncbi.nlm.nih.gov/pubmed/28385709 http://dx.doi.org/10.1101/gr.214973.116 |
_version_ | 1783240633631113216 |
---|---|
author | Löytynoja, Ari Goldman, Nick |
author_facet | Löytynoja, Ari Goldman, Nick |
author_sort | Löytynoja, Ari |
collection | PubMed |
description | Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution. |
format | Online Article Text |
id | pubmed-5453318 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54533182017-12-01 Short template switch events explain mutation clusters in the human genome Löytynoja, Ari Goldman, Nick Genome Res Method Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution. Cold Spring Harbor Laboratory Press 2017-06 /pmc/articles/PMC5453318/ /pubmed/28385709 http://dx.doi.org/10.1101/gr.214973.116 Text en © 2017 Löytynoja and Goldman; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Method Löytynoja, Ari Goldman, Nick Short template switch events explain mutation clusters in the human genome |
title | Short template switch events explain mutation clusters in the human genome |
title_full | Short template switch events explain mutation clusters in the human genome |
title_fullStr | Short template switch events explain mutation clusters in the human genome |
title_full_unstemmed | Short template switch events explain mutation clusters in the human genome |
title_short | Short template switch events explain mutation clusters in the human genome |
title_sort | short template switch events explain mutation clusters in the human genome |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5453318/ https://www.ncbi.nlm.nih.gov/pubmed/28385709 http://dx.doi.org/10.1101/gr.214973.116 |
work_keys_str_mv | AT loytynojaari shorttemplateswitcheventsexplainmutationclustersinthehumangenome AT goldmannick shorttemplateswitcheventsexplainmutationclustersinthehumangenome |