Cargando…

Bayesian modeling of recombination events in bacterial populations

BACKGROUND: We consider the discovery of recombinant segments jointly with their origins within multilocus DNA sequences from bacteria representing heterogeneous populations of fairly closely related species. The currently available methods for recombination detection capable of probabilistic charac...

Descripción completa

Detalles Bibliográficos
Autores principales: Marttinen, Pekka, Baldwin, Adam, Hanage, William P, Dowson, Chris, Mahenthiralingam, Eshwar, Corander, Jukka
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2579306/
https://www.ncbi.nlm.nih.gov/pubmed/18840286
http://dx.doi.org/10.1186/1471-2105-9-421
_version_ 1782160568737595392
author Marttinen, Pekka
Baldwin, Adam
Hanage, William P
Dowson, Chris
Mahenthiralingam, Eshwar
Corander, Jukka
author_facet Marttinen, Pekka
Baldwin, Adam
Hanage, William P
Dowson, Chris
Mahenthiralingam, Eshwar
Corander, Jukka
author_sort Marttinen, Pekka
collection PubMed
description BACKGROUND: We consider the discovery of recombinant segments jointly with their origins within multilocus DNA sequences from bacteria representing heterogeneous populations of fairly closely related species. The currently available methods for recombination detection capable of probabilistic characterization of uncertainty have a limited applicability in practice as the number of strains in a data set increases. RESULTS: We introduce a Bayesian spatial structural model representing the continuum of origins over sites within the observed sequences, including a probabilistic characterization of uncertainty related to the origin of any particular site. To enable a statistically accurate and practically feasible approach to the analysis of large-scale data sets representing a single genus, we have developed a novel software tool (BRAT, Bayesian Recombination Tracker) implementing the model and the corresponding learning algorithm, which is capable of identifying the posterior optimal structure and to estimate the marginal posterior probabilities of putative origins over the sites. CONCLUSION: A multitude of challenging simulation scenarios and an analysis of real data from seven housekeeping genes of 120 strains of genus Burkholderia are used to illustrate the possibilities offered by our approach. The software is freely available for download at URL .
format Text
id pubmed-2579306
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25793062008-11-05 Bayesian modeling of recombination events in bacterial populations Marttinen, Pekka Baldwin, Adam Hanage, William P Dowson, Chris Mahenthiralingam, Eshwar Corander, Jukka BMC Bioinformatics Methodology Article BACKGROUND: We consider the discovery of recombinant segments jointly with their origins within multilocus DNA sequences from bacteria representing heterogeneous populations of fairly closely related species. The currently available methods for recombination detection capable of probabilistic characterization of uncertainty have a limited applicability in practice as the number of strains in a data set increases. RESULTS: We introduce a Bayesian spatial structural model representing the continuum of origins over sites within the observed sequences, including a probabilistic characterization of uncertainty related to the origin of any particular site. To enable a statistically accurate and practically feasible approach to the analysis of large-scale data sets representing a single genus, we have developed a novel software tool (BRAT, Bayesian Recombination Tracker) implementing the model and the corresponding learning algorithm, which is capable of identifying the posterior optimal structure and to estimate the marginal posterior probabilities of putative origins over the sites. CONCLUSION: A multitude of challenging simulation scenarios and an analysis of real data from seven housekeeping genes of 120 strains of genus Burkholderia are used to illustrate the possibilities offered by our approach. The software is freely available for download at URL . BioMed Central 2008-10-07 /pmc/articles/PMC2579306/ /pubmed/18840286 http://dx.doi.org/10.1186/1471-2105-9-421 Text en Copyright © 2008 Marttinen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Marttinen, Pekka
Baldwin, Adam
Hanage, William P
Dowson, Chris
Mahenthiralingam, Eshwar
Corander, Jukka
Bayesian modeling of recombination events in bacterial populations
title Bayesian modeling of recombination events in bacterial populations
title_full Bayesian modeling of recombination events in bacterial populations
title_fullStr Bayesian modeling of recombination events in bacterial populations
title_full_unstemmed Bayesian modeling of recombination events in bacterial populations
title_short Bayesian modeling of recombination events in bacterial populations
title_sort bayesian modeling of recombination events in bacterial populations
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2579306/
https://www.ncbi.nlm.nih.gov/pubmed/18840286
http://dx.doi.org/10.1186/1471-2105-9-421
work_keys_str_mv AT marttinenpekka bayesianmodelingofrecombinationeventsinbacterialpopulations
AT baldwinadam bayesianmodelingofrecombinationeventsinbacterialpopulations
AT hanagewilliamp bayesianmodelingofrecombinationeventsinbacterialpopulations
AT dowsonchris bayesianmodelingofrecombinationeventsinbacterialpopulations
AT mahenthiralingameshwar bayesianmodelingofrecombinationeventsinbacterialpopulations
AT coranderjukka bayesianmodelingofrecombinationeventsinbacterialpopulations