Cargando…

Increased mutation and gene conversion within human segmental duplications

Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data(1,2). Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the patter...

Descripción completa

Detalles Bibliográficos
Autores principales: Vollger, Mitchell R., Dishuck, Philip C., Harvey, William T., DeWitt, William S., Guitart, Xavi, Goldberg, Michael E., Rozanski, Allison N., Lucas, Julian, Asri, Mobin, Munson, Katherine M., Lewis, Alexandra P., Hoekzema, Kendra, Logsdon, Glennis A., Porubsky, David, Paten, Benedict, Harris, Kelley, Hsieh, PingHsun, Eichler, Evan E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10172114/
https://www.ncbi.nlm.nih.gov/pubmed/37165237
http://dx.doi.org/10.1038/s41586-023-05895-y
_version_ 1785039555194257408
author Vollger, Mitchell R.
Dishuck, Philip C.
Harvey, William T.
DeWitt, William S.
Guitart, Xavi
Goldberg, Michael E.
Rozanski, Allison N.
Lucas, Julian
Asri, Mobin
Munson, Katherine M.
Lewis, Alexandra P.
Hoekzema, Kendra
Logsdon, Glennis A.
Porubsky, David
Paten, Benedict
Harris, Kelley
Hsieh, PingHsun
Eichler, Evan E.
author_facet Vollger, Mitchell R.
Dishuck, Philip C.
Harvey, William T.
DeWitt, William S.
Guitart, Xavi
Goldberg, Michael E.
Rozanski, Allison N.
Lucas, Julian
Asri, Mobin
Munson, Katherine M.
Lewis, Alexandra P.
Hoekzema, Kendra
Logsdon, Glennis A.
Porubsky, David
Paten, Benedict
Harris, Kelley
Hsieh, PingHsun
Eichler, Evan E.
author_sort Vollger, Mitchell R.
collection PubMed
description Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data(1,2). Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions(3,4). We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have ‘relocated’ on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences(5,6).
format Online
Article
Text
id pubmed-10172114
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-101721142023-05-12 Increased mutation and gene conversion within human segmental duplications Vollger, Mitchell R. Dishuck, Philip C. Harvey, William T. DeWitt, William S. Guitart, Xavi Goldberg, Michael E. Rozanski, Allison N. Lucas, Julian Asri, Mobin Munson, Katherine M. Lewis, Alexandra P. Hoekzema, Kendra Logsdon, Glennis A. Porubsky, David Paten, Benedict Harris, Kelley Hsieh, PingHsun Eichler, Evan E. Nature Article Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data(1,2). Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions(3,4). We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have ‘relocated’ on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences(5,6). Nature Publishing Group UK 2023-05-10 2023 /pmc/articles/PMC10172114/ /pubmed/37165237 http://dx.doi.org/10.1038/s41586-023-05895-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Vollger, Mitchell R.
Dishuck, Philip C.
Harvey, William T.
DeWitt, William S.
Guitart, Xavi
Goldberg, Michael E.
Rozanski, Allison N.
Lucas, Julian
Asri, Mobin
Munson, Katherine M.
Lewis, Alexandra P.
Hoekzema, Kendra
Logsdon, Glennis A.
Porubsky, David
Paten, Benedict
Harris, Kelley
Hsieh, PingHsun
Eichler, Evan E.
Increased mutation and gene conversion within human segmental duplications
title Increased mutation and gene conversion within human segmental duplications
title_full Increased mutation and gene conversion within human segmental duplications
title_fullStr Increased mutation and gene conversion within human segmental duplications
title_full_unstemmed Increased mutation and gene conversion within human segmental duplications
title_short Increased mutation and gene conversion within human segmental duplications
title_sort increased mutation and gene conversion within human segmental duplications
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10172114/
https://www.ncbi.nlm.nih.gov/pubmed/37165237
http://dx.doi.org/10.1038/s41586-023-05895-y
work_keys_str_mv AT vollgermitchellr increasedmutationandgeneconversionwithinhumansegmentalduplications
AT dishuckphilipc increasedmutationandgeneconversionwithinhumansegmentalduplications
AT harveywilliamt increasedmutationandgeneconversionwithinhumansegmentalduplications
AT dewittwilliams increasedmutationandgeneconversionwithinhumansegmentalduplications
AT guitartxavi increasedmutationandgeneconversionwithinhumansegmentalduplications
AT goldbergmichaele increasedmutationandgeneconversionwithinhumansegmentalduplications
AT rozanskiallisonn increasedmutationandgeneconversionwithinhumansegmentalduplications
AT lucasjulian increasedmutationandgeneconversionwithinhumansegmentalduplications
AT asrimobin increasedmutationandgeneconversionwithinhumansegmentalduplications
AT increasedmutationandgeneconversionwithinhumansegmentalduplications
AT munsonkatherinem increasedmutationandgeneconversionwithinhumansegmentalduplications
AT lewisalexandrap increasedmutationandgeneconversionwithinhumansegmentalduplications
AT hoekzemakendra increasedmutationandgeneconversionwithinhumansegmentalduplications
AT logsdonglennisa increasedmutationandgeneconversionwithinhumansegmentalduplications
AT porubskydavid increasedmutationandgeneconversionwithinhumansegmentalduplications
AT patenbenedict increasedmutationandgeneconversionwithinhumansegmentalduplications
AT harriskelley increasedmutationandgeneconversionwithinhumansegmentalduplications
AT hsiehpinghsun increasedmutationandgeneconversionwithinhumansegmentalduplications
AT eichlerevane increasedmutationandgeneconversionwithinhumansegmentalduplications