Cargando…

Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins

The emergence of new sequencing technologies has facilitated the use of bacterial whole genome alignments for evolutionary studies and outbreak analyses. These datasets, of increasing size, often include examples of multiple different mechanisms of horizontal sequence transfer resulting in substanti...

Descripción completa

Detalles Bibliográficos
Autores principales: Croucher, Nicholas J., Page, Andrew J., Connor, Thomas R., Delaney, Aidan J., Keane, Jacqueline A., Bentley, Stephen D., Parkhill, Julian, Harris, Simon R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4330336/
https://www.ncbi.nlm.nih.gov/pubmed/25414349
http://dx.doi.org/10.1093/nar/gku1196
_version_ 1782357566626463744
author Croucher, Nicholas J.
Page, Andrew J.
Connor, Thomas R.
Delaney, Aidan J.
Keane, Jacqueline A.
Bentley, Stephen D.
Parkhill, Julian
Harris, Simon R.
author_facet Croucher, Nicholas J.
Page, Andrew J.
Connor, Thomas R.
Delaney, Aidan J.
Keane, Jacqueline A.
Bentley, Stephen D.
Parkhill, Julian
Harris, Simon R.
author_sort Croucher, Nicholas J.
collection PubMed
description The emergence of new sequencing technologies has facilitated the use of bacterial whole genome alignments for evolutionary studies and outbreak analyses. These datasets, of increasing size, often include examples of multiple different mechanisms of horizontal sequence transfer resulting in substantial alterations to prokaryotic chromosomes. The impact of these processes demands rapid and flexible approaches able to account for recombination when reconstructing isolates’ recent diversification. Gubbins is an iterative algorithm that uses spatial scanning statistics to identify loci containing elevated densities of base substitutions suggestive of horizontal sequence transfer while concurrently constructing a maximum likelihood phylogeny based on the putative point mutations outside these regions of high sequence diversity. Simulations demonstrate the algorithm generates highly accurate reconstructions under realistically parameterized models of bacterial evolution, and achieves convergence in only a few hours on alignments of hundreds of bacterial genome sequences. Gubbins is appropriate for reconstructing the recent evolutionary history of a variety of haploid genotype alignments, as it makes no assumptions about the underlying mechanism of recombination. The software is freely available for download at github.com/sanger-pathogens/Gubbins, implemented in Python and C and supported on Linux and Mac OS X.
format Online
Article
Text
id pubmed-4330336
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43303362015-03-18 Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins Croucher, Nicholas J. Page, Andrew J. Connor, Thomas R. Delaney, Aidan J. Keane, Jacqueline A. Bentley, Stephen D. Parkhill, Julian Harris, Simon R. Nucleic Acids Res Methods Online The emergence of new sequencing technologies has facilitated the use of bacterial whole genome alignments for evolutionary studies and outbreak analyses. These datasets, of increasing size, often include examples of multiple different mechanisms of horizontal sequence transfer resulting in substantial alterations to prokaryotic chromosomes. The impact of these processes demands rapid and flexible approaches able to account for recombination when reconstructing isolates’ recent diversification. Gubbins is an iterative algorithm that uses spatial scanning statistics to identify loci containing elevated densities of base substitutions suggestive of horizontal sequence transfer while concurrently constructing a maximum likelihood phylogeny based on the putative point mutations outside these regions of high sequence diversity. Simulations demonstrate the algorithm generates highly accurate reconstructions under realistically parameterized models of bacterial evolution, and achieves convergence in only a few hours on alignments of hundreds of bacterial genome sequences. Gubbins is appropriate for reconstructing the recent evolutionary history of a variety of haploid genotype alignments, as it makes no assumptions about the underlying mechanism of recombination. The software is freely available for download at github.com/sanger-pathogens/Gubbins, implemented in Python and C and supported on Linux and Mac OS X. Oxford University Press 2015-02-18 2014-11-20 /pmc/articles/PMC4330336/ /pubmed/25414349 http://dx.doi.org/10.1093/nar/gku1196 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Croucher, Nicholas J.
Page, Andrew J.
Connor, Thomas R.
Delaney, Aidan J.
Keane, Jacqueline A.
Bentley, Stephen D.
Parkhill, Julian
Harris, Simon R.
Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
title Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
title_full Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
title_fullStr Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
title_full_unstemmed Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
title_short Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
title_sort rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using gubbins
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4330336/
https://www.ncbi.nlm.nih.gov/pubmed/25414349
http://dx.doi.org/10.1093/nar/gku1196
work_keys_str_mv AT crouchernicholasj rapidphylogeneticanalysisoflargesamplesofrecombinantbacterialwholegenomesequencesusinggubbins
AT pageandrewj rapidphylogeneticanalysisoflargesamplesofrecombinantbacterialwholegenomesequencesusinggubbins
AT connorthomasr rapidphylogeneticanalysisoflargesamplesofrecombinantbacterialwholegenomesequencesusinggubbins
AT delaneyaidanj rapidphylogeneticanalysisoflargesamplesofrecombinantbacterialwholegenomesequencesusinggubbins
AT keanejacquelinea rapidphylogeneticanalysisoflargesamplesofrecombinantbacterialwholegenomesequencesusinggubbins
AT bentleystephend rapidphylogeneticanalysisoflargesamplesofrecombinantbacterialwholegenomesequencesusinggubbins
AT parkhilljulian rapidphylogeneticanalysisoflargesamplesofrecombinantbacterialwholegenomesequencesusinggubbins
AT harrissimonr rapidphylogeneticanalysisoflargesamplesofrecombinantbacterialwholegenomesequencesusinggubbins