Cargando…

Signals of Historical Interlocus Gene Conversion in Human Segmental Duplications

Standard methods of DNA sequence analysis assume that sequences evolve independently, yet this assumption may not be appropriate for segmental duplications that exchange variants via interlocus gene conversion (IGC). Here, we use high quality multiple sequence alignments from well-annotated segmenta...

Descripción completa

Detalles Bibliográficos
Autores principales: Dumont, Beth L., Eichler, Evan E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3790853/
https://www.ncbi.nlm.nih.gov/pubmed/24124524
http://dx.doi.org/10.1371/journal.pone.0075949
_version_ 1782286662161661952
author Dumont, Beth L.
Eichler, Evan E.
author_facet Dumont, Beth L.
Eichler, Evan E.
author_sort Dumont, Beth L.
collection PubMed
description Standard methods of DNA sequence analysis assume that sequences evolve independently, yet this assumption may not be appropriate for segmental duplications that exchange variants via interlocus gene conversion (IGC). Here, we use high quality multiple sequence alignments from well-annotated segmental duplications to systematically identify IGC signals in the human reference genome. Our analysis combines two complementary methods: (i) a paralog quartet method that uses DNA sequence simulations to identify a statistical excess of sites consistent with inter-paralog exchange, and (ii) the alignment-based method implemented in the GENECONV program. One-quarter (25.4%) of the paralog families in our analysis harbor clear IGC signals by the quartet approach. Using GENECONV, we identify 1477 gene conversion tracks that cumulatively span 1.54 Mb of the genome. Our analyses confirm the previously reported high rates of IGC in subtelomeric regions and Y-chromosome palindromes, and identify multiple novel IGC hotspots, including the pregnancy specific glycoproteins and the neuroblastoma breakpoint gene families. Although the duplication history of a paralog family is described by a single tree, we show that IGC has introduced incredible site-to-site variation in the evolutionary relationships among paralogs in the human genome. Our findings indicate that IGC has left significant footprints in patterns of sequence diversity across segmental duplications in the human genome, out-pacing the contributions of single base mutation by orders of magnitude. Collectively, the IGC signals we report comprise a catalog that will provide a critical reference for interpreting observed patterns of DNA sequence variation across duplicated genomic regions, including targets of recent adaptive evolution in humans.
format Online
Article
Text
id pubmed-3790853
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37908532013-10-11 Signals of Historical Interlocus Gene Conversion in Human Segmental Duplications Dumont, Beth L. Eichler, Evan E. PLoS One Research Article Standard methods of DNA sequence analysis assume that sequences evolve independently, yet this assumption may not be appropriate for segmental duplications that exchange variants via interlocus gene conversion (IGC). Here, we use high quality multiple sequence alignments from well-annotated segmental duplications to systematically identify IGC signals in the human reference genome. Our analysis combines two complementary methods: (i) a paralog quartet method that uses DNA sequence simulations to identify a statistical excess of sites consistent with inter-paralog exchange, and (ii) the alignment-based method implemented in the GENECONV program. One-quarter (25.4%) of the paralog families in our analysis harbor clear IGC signals by the quartet approach. Using GENECONV, we identify 1477 gene conversion tracks that cumulatively span 1.54 Mb of the genome. Our analyses confirm the previously reported high rates of IGC in subtelomeric regions and Y-chromosome palindromes, and identify multiple novel IGC hotspots, including the pregnancy specific glycoproteins and the neuroblastoma breakpoint gene families. Although the duplication history of a paralog family is described by a single tree, we show that IGC has introduced incredible site-to-site variation in the evolutionary relationships among paralogs in the human genome. Our findings indicate that IGC has left significant footprints in patterns of sequence diversity across segmental duplications in the human genome, out-pacing the contributions of single base mutation by orders of magnitude. Collectively, the IGC signals we report comprise a catalog that will provide a critical reference for interpreting observed patterns of DNA sequence variation across duplicated genomic regions, including targets of recent adaptive evolution in humans. Public Library of Science 2013-10-04 /pmc/articles/PMC3790853/ /pubmed/24124524 http://dx.doi.org/10.1371/journal.pone.0075949 Text en © 2013 Dumont, Eichler http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Dumont, Beth L.
Eichler, Evan E.
Signals of Historical Interlocus Gene Conversion in Human Segmental Duplications
title Signals of Historical Interlocus Gene Conversion in Human Segmental Duplications
title_full Signals of Historical Interlocus Gene Conversion in Human Segmental Duplications
title_fullStr Signals of Historical Interlocus Gene Conversion in Human Segmental Duplications
title_full_unstemmed Signals of Historical Interlocus Gene Conversion in Human Segmental Duplications
title_short Signals of Historical Interlocus Gene Conversion in Human Segmental Duplications
title_sort signals of historical interlocus gene conversion in human segmental duplications
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3790853/
https://www.ncbi.nlm.nih.gov/pubmed/24124524
http://dx.doi.org/10.1371/journal.pone.0075949
work_keys_str_mv AT dumontbethl signalsofhistoricalinterlocusgeneconversioninhumansegmentalduplications
AT eichlerevane signalsofhistoricalinterlocusgeneconversioninhumansegmentalduplications