Cargando…

Interlocus gene conversion explains at least 2.7 % of single nucleotide variants in human segmental duplications

BACKGROUND: Interlocus gene conversion (IGC) is a recombination-based mechanism that results in the unidirectional transfer of short stretches of sequence between paralogous loci. Although IGC is a well-established mechanism of human disease, the extent to which this mutagenic process has shaped ove...

Descripción completa

Detalles Bibliográficos
Autor principal: Dumont, Beth L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4467073/
https://www.ncbi.nlm.nih.gov/pubmed/26077037
http://dx.doi.org/10.1186/s12864-015-1681-3
_version_ 1782376316063973376
author Dumont, Beth L.
author_facet Dumont, Beth L.
author_sort Dumont, Beth L.
collection PubMed
description BACKGROUND: Interlocus gene conversion (IGC) is a recombination-based mechanism that results in the unidirectional transfer of short stretches of sequence between paralogous loci. Although IGC is a well-established mechanism of human disease, the extent to which this mutagenic process has shaped overall patterns of segregating variation in multi-copy regions of the human genome remains unknown. One expected manifestation of IGC in population genomic data is the presence of one-to-one paralogous SNPs that segregate identical alleles. RESULTS: Here, I use SNP genotype calls from the low-coverage phase 3 release of the 1000 Genomes Project to identify 15,790 parallel, shared SNPs in duplicated regions of the human genome. My approach for identifying these sites accounts for the potential redundancy of short read mapping in multi-copy genomic regions, thereby effectively eliminating false positive SNP calls arising from paralogous sequence variation. I demonstrate that independent mutation events to identical nucleotides at paralogous sites are not a significant source of shared polymorphisms in the human genome, consistent with the interpretation that these sites are the outcome of historical IGC events. These putative signals of IGC are enriched in genomic contexts previously associated with non-allelic homologous recombination, including clear signals in gene families that form tandem intra-chromosomal clusters. CONCLUSIONS: Taken together, my analyses implicate IGC, not point mutation, as the mechanism generating at least 2.7 % of single nucleotide variants in duplicated regions of the human genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1681-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4467073
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44670732015-06-16 Interlocus gene conversion explains at least 2.7 % of single nucleotide variants in human segmental duplications Dumont, Beth L. BMC Genomics Research Article BACKGROUND: Interlocus gene conversion (IGC) is a recombination-based mechanism that results in the unidirectional transfer of short stretches of sequence between paralogous loci. Although IGC is a well-established mechanism of human disease, the extent to which this mutagenic process has shaped overall patterns of segregating variation in multi-copy regions of the human genome remains unknown. One expected manifestation of IGC in population genomic data is the presence of one-to-one paralogous SNPs that segregate identical alleles. RESULTS: Here, I use SNP genotype calls from the low-coverage phase 3 release of the 1000 Genomes Project to identify 15,790 parallel, shared SNPs in duplicated regions of the human genome. My approach for identifying these sites accounts for the potential redundancy of short read mapping in multi-copy genomic regions, thereby effectively eliminating false positive SNP calls arising from paralogous sequence variation. I demonstrate that independent mutation events to identical nucleotides at paralogous sites are not a significant source of shared polymorphisms in the human genome, consistent with the interpretation that these sites are the outcome of historical IGC events. These putative signals of IGC are enriched in genomic contexts previously associated with non-allelic homologous recombination, including clear signals in gene families that form tandem intra-chromosomal clusters. CONCLUSIONS: Taken together, my analyses implicate IGC, not point mutation, as the mechanism generating at least 2.7 % of single nucleotide variants in duplicated regions of the human genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1681-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-06-16 /pmc/articles/PMC4467073/ /pubmed/26077037 http://dx.doi.org/10.1186/s12864-015-1681-3 Text en © Dumont. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Dumont, Beth L.
Interlocus gene conversion explains at least 2.7 % of single nucleotide variants in human segmental duplications
title Interlocus gene conversion explains at least 2.7 % of single nucleotide variants in human segmental duplications
title_full Interlocus gene conversion explains at least 2.7 % of single nucleotide variants in human segmental duplications
title_fullStr Interlocus gene conversion explains at least 2.7 % of single nucleotide variants in human segmental duplications
title_full_unstemmed Interlocus gene conversion explains at least 2.7 % of single nucleotide variants in human segmental duplications
title_short Interlocus gene conversion explains at least 2.7 % of single nucleotide variants in human segmental duplications
title_sort interlocus gene conversion explains at least 2.7 % of single nucleotide variants in human segmental duplications
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4467073/
https://www.ncbi.nlm.nih.gov/pubmed/26077037
http://dx.doi.org/10.1186/s12864-015-1681-3
work_keys_str_mv AT dumontbethl interlocusgeneconversionexplainsatleast27ofsinglenucleotidevariantsinhumansegmentalduplications