Cargando…
Efficient Inference of Recombination Hot Regions in Bacterial Genomes
In eukaryotes, detailed surveys of recombination rates have shown variation at multiple genomic scales and the presence of “hotspots” of highly elevated recombination. In bacteria, studies of recombination rate variation are less developed, in part because there are few analysis methods that take in...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4032127/ https://www.ncbi.nlm.nih.gov/pubmed/24586045 http://dx.doi.org/10.1093/molbev/msu082 |
_version_ | 1782317597875765248 |
---|---|
author | Yahara, Koji Didelot, Xavier Ansari, M. Azim Sheppard, Samuel K. Falush, Daniel |
author_facet | Yahara, Koji Didelot, Xavier Ansari, M. Azim Sheppard, Samuel K. Falush, Daniel |
author_sort | Yahara, Koji |
collection | PubMed |
description | In eukaryotes, detailed surveys of recombination rates have shown variation at multiple genomic scales and the presence of “hotspots” of highly elevated recombination. In bacteria, studies of recombination rate variation are less developed, in part because there are few analysis methods that take into account the clonal context within which bacterial evolution occurs. Here, we focus in particular on identifying “hot regions” of the genome where DNA is transferred frequently between isolates. We present a computationally efficient algorithm based on the recently developed “chromosome painting” algorithm, which characterizes patterns of haplotype sharing across a genome. We compare the average genome wide painting, which principally reflects clonal descent, with the painting for each site which additionally reflects the specific deviations at the site due to recombination. Using simulated data, we show that hot regions have consistently higher deviations from the genome wide average than normal regions. We applied our approach to previously analyzed Escherichia coli genomes and revealed that the new method is highly correlated with the number of recombination events affecting each site inferred by ClonalOrigin, a method that is only applicable to small numbers of genomes. Furthermore, we analyzed recombination hot regions in Campylobacter jejuni by using 200 genomes. We identified three recombination hot regions, which are enriched for genes related to membrane proteins. Our approach and its implementation, which is downloadable from https://github.com/bioprojects/orderedPainting, will help to develop a new phase of population genomic studies of recombination in prokaryotes. |
format | Online Article Text |
id | pubmed-4032127 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-40321272014-06-18 Efficient Inference of Recombination Hot Regions in Bacterial Genomes Yahara, Koji Didelot, Xavier Ansari, M. Azim Sheppard, Samuel K. Falush, Daniel Mol Biol Evol Methods In eukaryotes, detailed surveys of recombination rates have shown variation at multiple genomic scales and the presence of “hotspots” of highly elevated recombination. In bacteria, studies of recombination rate variation are less developed, in part because there are few analysis methods that take into account the clonal context within which bacterial evolution occurs. Here, we focus in particular on identifying “hot regions” of the genome where DNA is transferred frequently between isolates. We present a computationally efficient algorithm based on the recently developed “chromosome painting” algorithm, which characterizes patterns of haplotype sharing across a genome. We compare the average genome wide painting, which principally reflects clonal descent, with the painting for each site which additionally reflects the specific deviations at the site due to recombination. Using simulated data, we show that hot regions have consistently higher deviations from the genome wide average than normal regions. We applied our approach to previously analyzed Escherichia coli genomes and revealed that the new method is highly correlated with the number of recombination events affecting each site inferred by ClonalOrigin, a method that is only applicable to small numbers of genomes. Furthermore, we analyzed recombination hot regions in Campylobacter jejuni by using 200 genomes. We identified three recombination hot regions, which are enriched for genes related to membrane proteins. Our approach and its implementation, which is downloadable from https://github.com/bioprojects/orderedPainting, will help to develop a new phase of population genomic studies of recombination in prokaryotes. Oxford University Press 2014-06 2014-02-27 /pmc/articles/PMC4032127/ /pubmed/24586045 http://dx.doi.org/10.1093/molbev/msu082 Text en © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Yahara, Koji Didelot, Xavier Ansari, M. Azim Sheppard, Samuel K. Falush, Daniel Efficient Inference of Recombination Hot Regions in Bacterial Genomes |
title | Efficient Inference of Recombination Hot Regions in Bacterial Genomes |
title_full | Efficient Inference of Recombination Hot Regions in Bacterial Genomes |
title_fullStr | Efficient Inference of Recombination Hot Regions in Bacterial Genomes |
title_full_unstemmed | Efficient Inference of Recombination Hot Regions in Bacterial Genomes |
title_short | Efficient Inference of Recombination Hot Regions in Bacterial Genomes |
title_sort | efficient inference of recombination hot regions in bacterial genomes |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4032127/ https://www.ncbi.nlm.nih.gov/pubmed/24586045 http://dx.doi.org/10.1093/molbev/msu082 |
work_keys_str_mv | AT yaharakoji efficientinferenceofrecombinationhotregionsinbacterialgenomes AT didelotxavier efficientinferenceofrecombinationhotregionsinbacterialgenomes AT ansarimazim efficientinferenceofrecombinationhotregionsinbacterialgenomes AT sheppardsamuelk efficientinferenceofrecombinationhotregionsinbacterialgenomes AT falushdaniel efficientinferenceofrecombinationhotregionsinbacterialgenomes |