Cargando…
Increased mutation and gene conversion within human segmental duplications
Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data(1,2). Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the patter...
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10172114/ https://www.ncbi.nlm.nih.gov/pubmed/37165237 http://dx.doi.org/10.1038/s41586-023-05895-y |
_version_ | 1785039555194257408 |
---|---|
author | Vollger, Mitchell R. Dishuck, Philip C. Harvey, William T. DeWitt, William S. Guitart, Xavi Goldberg, Michael E. Rozanski, Allison N. Lucas, Julian Asri, Mobin Munson, Katherine M. Lewis, Alexandra P. Hoekzema, Kendra Logsdon, Glennis A. Porubsky, David Paten, Benedict Harris, Kelley Hsieh, PingHsun Eichler, Evan E. |
author_facet | Vollger, Mitchell R. Dishuck, Philip C. Harvey, William T. DeWitt, William S. Guitart, Xavi Goldberg, Michael E. Rozanski, Allison N. Lucas, Julian Asri, Mobin Munson, Katherine M. Lewis, Alexandra P. Hoekzema, Kendra Logsdon, Glennis A. Porubsky, David Paten, Benedict Harris, Kelley Hsieh, PingHsun Eichler, Evan E. |
author_sort | Vollger, Mitchell R. |
collection | PubMed |
description | Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data(1,2). Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions(3,4). We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have ‘relocated’ on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences(5,6). |
format | Online Article Text |
id | pubmed-10172114 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-101721142023-05-12 Increased mutation and gene conversion within human segmental duplications Vollger, Mitchell R. Dishuck, Philip C. Harvey, William T. DeWitt, William S. Guitart, Xavi Goldberg, Michael E. Rozanski, Allison N. Lucas, Julian Asri, Mobin Munson, Katherine M. Lewis, Alexandra P. Hoekzema, Kendra Logsdon, Glennis A. Porubsky, David Paten, Benedict Harris, Kelley Hsieh, PingHsun Eichler, Evan E. Nature Article Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data(1,2). Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions(3,4). We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have ‘relocated’ on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences(5,6). Nature Publishing Group UK 2023-05-10 2023 /pmc/articles/PMC10172114/ /pubmed/37165237 http://dx.doi.org/10.1038/s41586-023-05895-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Vollger, Mitchell R. Dishuck, Philip C. Harvey, William T. DeWitt, William S. Guitart, Xavi Goldberg, Michael E. Rozanski, Allison N. Lucas, Julian Asri, Mobin Munson, Katherine M. Lewis, Alexandra P. Hoekzema, Kendra Logsdon, Glennis A. Porubsky, David Paten, Benedict Harris, Kelley Hsieh, PingHsun Eichler, Evan E. Increased mutation and gene conversion within human segmental duplications |
title | Increased mutation and gene conversion within human segmental duplications |
title_full | Increased mutation and gene conversion within human segmental duplications |
title_fullStr | Increased mutation and gene conversion within human segmental duplications |
title_full_unstemmed | Increased mutation and gene conversion within human segmental duplications |
title_short | Increased mutation and gene conversion within human segmental duplications |
title_sort | increased mutation and gene conversion within human segmental duplications |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10172114/ https://www.ncbi.nlm.nih.gov/pubmed/37165237 http://dx.doi.org/10.1038/s41586-023-05895-y |
work_keys_str_mv | AT vollgermitchellr increasedmutationandgeneconversionwithinhumansegmentalduplications AT dishuckphilipc increasedmutationandgeneconversionwithinhumansegmentalduplications AT harveywilliamt increasedmutationandgeneconversionwithinhumansegmentalduplications AT dewittwilliams increasedmutationandgeneconversionwithinhumansegmentalduplications AT guitartxavi increasedmutationandgeneconversionwithinhumansegmentalduplications AT goldbergmichaele increasedmutationandgeneconversionwithinhumansegmentalduplications AT rozanskiallisonn increasedmutationandgeneconversionwithinhumansegmentalduplications AT lucasjulian increasedmutationandgeneconversionwithinhumansegmentalduplications AT asrimobin increasedmutationandgeneconversionwithinhumansegmentalduplications AT increasedmutationandgeneconversionwithinhumansegmentalduplications AT munsonkatherinem increasedmutationandgeneconversionwithinhumansegmentalduplications AT lewisalexandrap increasedmutationandgeneconversionwithinhumansegmentalduplications AT hoekzemakendra increasedmutationandgeneconversionwithinhumansegmentalduplications AT logsdonglennisa increasedmutationandgeneconversionwithinhumansegmentalduplications AT porubskydavid increasedmutationandgeneconversionwithinhumansegmentalduplications AT patenbenedict increasedmutationandgeneconversionwithinhumansegmentalduplications AT harriskelley increasedmutationandgeneconversionwithinhumansegmentalduplications AT hsiehpinghsun increasedmutationandgeneconversionwithinhumansegmentalduplications AT eichlerevane increasedmutationandgeneconversionwithinhumansegmentalduplications |