Cargando…

Thousands of human mutation clusters are explained by short-range template switching

Variation within human genomes is unevenly distributed, and variants show spatial clustering. DNA replication–related template switching is a poorly known mutational mechanism capable of causing major chromosomal rearrangements as well as creating short inverted sequence copies that appear as local...

Descripción completa

Detalles Bibliográficos
Autor principal: Löytynoja, Ari
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9435742/
https://www.ncbi.nlm.nih.gov/pubmed/35760560
http://dx.doi.org/10.1101/gr.276478.121
_version_ 1784781216477609984
author Löytynoja, Ari
author_facet Löytynoja, Ari
author_sort Löytynoja, Ari
collection PubMed
description Variation within human genomes is unevenly distributed, and variants show spatial clustering. DNA replication–related template switching is a poorly known mutational mechanism capable of causing major chromosomal rearrangements as well as creating short inverted sequence copies that appear as local mutation clusters in sequence comparisons. In this study, haplotype-resolved genome assemblies representing 25 human populations and multinucleotide variants aggregated from 140,000 human sequencing experiments were reanalyzed. Local template switching could explain thousands of complex mutation clusters across the human genome, the loci segregating within and between populations. During the study, computational tools were developed for identification of template switch events using both short-read sequencing data and genotype data, and for genotyping candidate loci using short-read data. The characteristics of template-switch mutations complicate their detection, and widely used analysis pipelines for short-read sequencing data, normally capable of identifying single nucleotide changes, were found to miss template-switch mutations of tens of base pairs, potentially invalidating medical genetic studies searching for a causative allele behind genetic diseases. Combined with the massive sequencing data now available for humans, the novel tools described here enable building catalogs of affected loci and studying the cellular mechanisms behind template switching in both healthy organisms and disease.
format Online
Article
Text
id pubmed-9435742
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-94357422023-02-01 Thousands of human mutation clusters are explained by short-range template switching Löytynoja, Ari Genome Res Research Variation within human genomes is unevenly distributed, and variants show spatial clustering. DNA replication–related template switching is a poorly known mutational mechanism capable of causing major chromosomal rearrangements as well as creating short inverted sequence copies that appear as local mutation clusters in sequence comparisons. In this study, haplotype-resolved genome assemblies representing 25 human populations and multinucleotide variants aggregated from 140,000 human sequencing experiments were reanalyzed. Local template switching could explain thousands of complex mutation clusters across the human genome, the loci segregating within and between populations. During the study, computational tools were developed for identification of template switch events using both short-read sequencing data and genotype data, and for genotyping candidate loci using short-read data. The characteristics of template-switch mutations complicate their detection, and widely used analysis pipelines for short-read sequencing data, normally capable of identifying single nucleotide changes, were found to miss template-switch mutations of tens of base pairs, potentially invalidating medical genetic studies searching for a causative allele behind genetic diseases. Combined with the massive sequencing data now available for humans, the novel tools described here enable building catalogs of affected loci and studying the cellular mechanisms behind template switching in both healthy organisms and disease. Cold Spring Harbor Laboratory Press 2022-08 /pmc/articles/PMC9435742/ /pubmed/35760560 http://dx.doi.org/10.1101/gr.276478.121 Text en © 2022 Löytynoja; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Research
Löytynoja, Ari
Thousands of human mutation clusters are explained by short-range template switching
title Thousands of human mutation clusters are explained by short-range template switching
title_full Thousands of human mutation clusters are explained by short-range template switching
title_fullStr Thousands of human mutation clusters are explained by short-range template switching
title_full_unstemmed Thousands of human mutation clusters are explained by short-range template switching
title_short Thousands of human mutation clusters are explained by short-range template switching
title_sort thousands of human mutation clusters are explained by short-range template switching
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9435742/
https://www.ncbi.nlm.nih.gov/pubmed/35760560
http://dx.doi.org/10.1101/gr.276478.121
work_keys_str_mv AT loytynojaari thousandsofhumanmutationclustersareexplainedbyshortrangetemplateswitching