Cargando…

Efficient Inference of Recombination Hot Regions in Bacterial Genomes

In eukaryotes, detailed surveys of recombination rates have shown variation at multiple genomic scales and the presence of “hotspots” of highly elevated recombination. In bacteria, studies of recombination rate variation are less developed, in part because there are few analysis methods that take in...

Descripción completa

Detalles Bibliográficos
Autores principales: Yahara, Koji, Didelot, Xavier, Ansari, M. Azim, Sheppard, Samuel K., Falush, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4032127/
https://www.ncbi.nlm.nih.gov/pubmed/24586045
http://dx.doi.org/10.1093/molbev/msu082
_version_ 1782317597875765248
author Yahara, Koji
Didelot, Xavier
Ansari, M. Azim
Sheppard, Samuel K.
Falush, Daniel
author_facet Yahara, Koji
Didelot, Xavier
Ansari, M. Azim
Sheppard, Samuel K.
Falush, Daniel
author_sort Yahara, Koji
collection PubMed
description In eukaryotes, detailed surveys of recombination rates have shown variation at multiple genomic scales and the presence of “hotspots” of highly elevated recombination. In bacteria, studies of recombination rate variation are less developed, in part because there are few analysis methods that take into account the clonal context within which bacterial evolution occurs. Here, we focus in particular on identifying “hot regions” of the genome where DNA is transferred frequently between isolates. We present a computationally efficient algorithm based on the recently developed “chromosome painting” algorithm, which characterizes patterns of haplotype sharing across a genome. We compare the average genome wide painting, which principally reflects clonal descent, with the painting for each site which additionally reflects the specific deviations at the site due to recombination. Using simulated data, we show that hot regions have consistently higher deviations from the genome wide average than normal regions. We applied our approach to previously analyzed Escherichia coli genomes and revealed that the new method is highly correlated with the number of recombination events affecting each site inferred by ClonalOrigin, a method that is only applicable to small numbers of genomes. Furthermore, we analyzed recombination hot regions in Campylobacter jejuni by using 200 genomes. We identified three recombination hot regions, which are enriched for genes related to membrane proteins. Our approach and its implementation, which is downloadable from https://github.com/bioprojects/orderedPainting, will help to develop a new phase of population genomic studies of recombination in prokaryotes.
format Online
Article
Text
id pubmed-4032127
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40321272014-06-18 Efficient Inference of Recombination Hot Regions in Bacterial Genomes Yahara, Koji Didelot, Xavier Ansari, M. Azim Sheppard, Samuel K. Falush, Daniel Mol Biol Evol Methods In eukaryotes, detailed surveys of recombination rates have shown variation at multiple genomic scales and the presence of “hotspots” of highly elevated recombination. In bacteria, studies of recombination rate variation are less developed, in part because there are few analysis methods that take into account the clonal context within which bacterial evolution occurs. Here, we focus in particular on identifying “hot regions” of the genome where DNA is transferred frequently between isolates. We present a computationally efficient algorithm based on the recently developed “chromosome painting” algorithm, which characterizes patterns of haplotype sharing across a genome. We compare the average genome wide painting, which principally reflects clonal descent, with the painting for each site which additionally reflects the specific deviations at the site due to recombination. Using simulated data, we show that hot regions have consistently higher deviations from the genome wide average than normal regions. We applied our approach to previously analyzed Escherichia coli genomes and revealed that the new method is highly correlated with the number of recombination events affecting each site inferred by ClonalOrigin, a method that is only applicable to small numbers of genomes. Furthermore, we analyzed recombination hot regions in Campylobacter jejuni by using 200 genomes. We identified three recombination hot regions, which are enriched for genes related to membrane proteins. Our approach and its implementation, which is downloadable from https://github.com/bioprojects/orderedPainting, will help to develop a new phase of population genomic studies of recombination in prokaryotes. Oxford University Press 2014-06 2014-02-27 /pmc/articles/PMC4032127/ /pubmed/24586045 http://dx.doi.org/10.1093/molbev/msu082 Text en © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Yahara, Koji
Didelot, Xavier
Ansari, M. Azim
Sheppard, Samuel K.
Falush, Daniel
Efficient Inference of Recombination Hot Regions in Bacterial Genomes
title Efficient Inference of Recombination Hot Regions in Bacterial Genomes
title_full Efficient Inference of Recombination Hot Regions in Bacterial Genomes
title_fullStr Efficient Inference of Recombination Hot Regions in Bacterial Genomes
title_full_unstemmed Efficient Inference of Recombination Hot Regions in Bacterial Genomes
title_short Efficient Inference of Recombination Hot Regions in Bacterial Genomes
title_sort efficient inference of recombination hot regions in bacterial genomes
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4032127/
https://www.ncbi.nlm.nih.gov/pubmed/24586045
http://dx.doi.org/10.1093/molbev/msu082
work_keys_str_mv AT yaharakoji efficientinferenceofrecombinationhotregionsinbacterialgenomes
AT didelotxavier efficientinferenceofrecombinationhotregionsinbacterialgenomes
AT ansarimazim efficientinferenceofrecombinationhotregionsinbacterialgenomes
AT sheppardsamuelk efficientinferenceofrecombinationhotregionsinbacterialgenomes
AT falushdaniel efficientinferenceofrecombinationhotregionsinbacterialgenomes