Cargando…

Assessing the reliability of eBURST using simulated populations with known ancestry

BACKGROUND: The program eBURST uses multilocus sequence typing data to divide bacterial populations into groups of closely related strains (clonal complexes), predicts the founding genotype of each group, and displays the patterns of recent evolutionary descent of all other strains in the group from...

Descripción completa

Detalles Bibliográficos
Autores principales: Turner, Katherine ME, Hanage, William P, Fraser, Christophe, Connor, Thomas R, Spratt, Brian G
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1865383/
https://www.ncbi.nlm.nih.gov/pubmed/17430587
http://dx.doi.org/10.1186/1471-2180-7-30
_version_ 1782133224495906816
author Turner, Katherine ME
Hanage, William P
Fraser, Christophe
Connor, Thomas R
Spratt, Brian G
author_facet Turner, Katherine ME
Hanage, William P
Fraser, Christophe
Connor, Thomas R
Spratt, Brian G
author_sort Turner, Katherine ME
collection PubMed
description BACKGROUND: The program eBURST uses multilocus sequence typing data to divide bacterial populations into groups of closely related strains (clonal complexes), predicts the founding genotype of each group, and displays the patterns of recent evolutionary descent of all other strains in the group from the founder. The reliability of eBURST was evaluated using populations simulated with different levels of recombination in which the ancestry of all strains was known. RESULTS: For strictly clonal simulations, where all allelic change is due to point mutation, the groups of related strains identified by eBURST were very similar to those expected from the true ancestry and most of the true ancestor-descendant relationships (90–98%) were identified by eBURST. Populations simulated with low or moderate levels of recombination showed similarly high performance but the reliability of eBURST declined with increasing recombination to mutation ratio. Populations simulated under a high recombination to mutation ratio were dominated by a single large straggly eBURST group, which resulted from the incorrect linking of unrelated groups of strains into the same eBURST group. The reliability of the ancestor-descendant links in eBURST diagrams was related to the proportion of strains in the largest eBURST group, which provides a useful guide to when eBURST is likely to be unreliable. CONCLUSION: Examination of eBURST groups within populations of a range of bacterial species showed that most were within the range in which eBURST is reliable, and only a small number (e.g. Burkholderia pseudomallei and Enterococcus faecium) appeared to have such high rates of recombination that eBURST is likely to be unreliable. The study also demonstrates how three simple tests in eBURST v3 can be used to detect unreliable eBURST performance and recognise populations in which there appears to be a high rate of recombination relative to mutation.
format Text
id pubmed-1865383
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18653832007-05-04 Assessing the reliability of eBURST using simulated populations with known ancestry Turner, Katherine ME Hanage, William P Fraser, Christophe Connor, Thomas R Spratt, Brian G BMC Microbiol Research Article BACKGROUND: The program eBURST uses multilocus sequence typing data to divide bacterial populations into groups of closely related strains (clonal complexes), predicts the founding genotype of each group, and displays the patterns of recent evolutionary descent of all other strains in the group from the founder. The reliability of eBURST was evaluated using populations simulated with different levels of recombination in which the ancestry of all strains was known. RESULTS: For strictly clonal simulations, where all allelic change is due to point mutation, the groups of related strains identified by eBURST were very similar to those expected from the true ancestry and most of the true ancestor-descendant relationships (90–98%) were identified by eBURST. Populations simulated with low or moderate levels of recombination showed similarly high performance but the reliability of eBURST declined with increasing recombination to mutation ratio. Populations simulated under a high recombination to mutation ratio were dominated by a single large straggly eBURST group, which resulted from the incorrect linking of unrelated groups of strains into the same eBURST group. The reliability of the ancestor-descendant links in eBURST diagrams was related to the proportion of strains in the largest eBURST group, which provides a useful guide to when eBURST is likely to be unreliable. CONCLUSION: Examination of eBURST groups within populations of a range of bacterial species showed that most were within the range in which eBURST is reliable, and only a small number (e.g. Burkholderia pseudomallei and Enterococcus faecium) appeared to have such high rates of recombination that eBURST is likely to be unreliable. The study also demonstrates how three simple tests in eBURST v3 can be used to detect unreliable eBURST performance and recognise populations in which there appears to be a high rate of recombination relative to mutation. BioMed Central 2007-04-12 /pmc/articles/PMC1865383/ /pubmed/17430587 http://dx.doi.org/10.1186/1471-2180-7-30 Text en Copyright © 2007 Turner et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Turner, Katherine ME
Hanage, William P
Fraser, Christophe
Connor, Thomas R
Spratt, Brian G
Assessing the reliability of eBURST using simulated populations with known ancestry
title Assessing the reliability of eBURST using simulated populations with known ancestry
title_full Assessing the reliability of eBURST using simulated populations with known ancestry
title_fullStr Assessing the reliability of eBURST using simulated populations with known ancestry
title_full_unstemmed Assessing the reliability of eBURST using simulated populations with known ancestry
title_short Assessing the reliability of eBURST using simulated populations with known ancestry
title_sort assessing the reliability of eburst using simulated populations with known ancestry
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1865383/
https://www.ncbi.nlm.nih.gov/pubmed/17430587
http://dx.doi.org/10.1186/1471-2180-7-30
work_keys_str_mv AT turnerkatherineme assessingthereliabilityofeburstusingsimulatedpopulationswithknownancestry
AT hanagewilliamp assessingthereliabilityofeburstusingsimulatedpopulationswithknownancestry
AT fraserchristophe assessingthereliabilityofeburstusingsimulatedpopulationswithknownancestry
AT connorthomasr assessingthereliabilityofeburstusingsimulatedpopulationswithknownancestry
AT sprattbriang assessingthereliabilityofeburstusingsimulatedpopulationswithknownancestry