Cargando…

Looking for trees in the forest: summary tree from posterior samples

BACKGROUND: Bayesian phylogenetic analysis generates a set of trees which are often condensed into a single tree representing the whole set. Many methods exist for selecting a representative topology for a set of unrooted trees, few exist for assigning branch lengths to a fixed topology, and even fe...

Descripción completa

Detalles Bibliográficos
Autores principales: Heled, Joseph, Bouckaert, Remco R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853548/
https://www.ncbi.nlm.nih.gov/pubmed/24093883
http://dx.doi.org/10.1186/1471-2148-13-221
_version_ 1782294644539785216
author Heled, Joseph
Bouckaert, Remco R
author_facet Heled, Joseph
Bouckaert, Remco R
author_sort Heled, Joseph
collection PubMed
description BACKGROUND: Bayesian phylogenetic analysis generates a set of trees which are often condensed into a single tree representing the whole set. Many methods exist for selecting a representative topology for a set of unrooted trees, few exist for assigning branch lengths to a fixed topology, and even fewer for simultaneously setting the topology and branch lengths. However, there is very little research into locating a good representative for a set of rooted time trees like the ones obtained from a BEAST analysis. RESULTS: We empirically compare new and known methods for generating a summary tree. Some new methods are motivated by mathematical constructions such as tree metrics, while the rest employ tree concepts which work well in practice. These use more of the posterior than existing methods, which discard information not directly mapped to the chosen topology. Using results from a large number of simulations we assess the quality of a summary tree, measuring (a) how well it explains the sequence data under the model and (b) how close it is to the “truth”, i.e to the tree used to generate the sequences. CONCLUSIONS: Our simulations indicate that no single method is “best”. Methods producing good divergence time estimates have poor branch lengths and lower model fit, and vice versa. Using the results presented here, a user can choose the appropriate method based on the purpose of the summary tree.
format Online
Article
Text
id pubmed-3853548
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38535482013-12-16 Looking for trees in the forest: summary tree from posterior samples Heled, Joseph Bouckaert, Remco R BMC Evol Biol Research Article BACKGROUND: Bayesian phylogenetic analysis generates a set of trees which are often condensed into a single tree representing the whole set. Many methods exist for selecting a representative topology for a set of unrooted trees, few exist for assigning branch lengths to a fixed topology, and even fewer for simultaneously setting the topology and branch lengths. However, there is very little research into locating a good representative for a set of rooted time trees like the ones obtained from a BEAST analysis. RESULTS: We empirically compare new and known methods for generating a summary tree. Some new methods are motivated by mathematical constructions such as tree metrics, while the rest employ tree concepts which work well in practice. These use more of the posterior than existing methods, which discard information not directly mapped to the chosen topology. Using results from a large number of simulations we assess the quality of a summary tree, measuring (a) how well it explains the sequence data under the model and (b) how close it is to the “truth”, i.e to the tree used to generate the sequences. CONCLUSIONS: Our simulations indicate that no single method is “best”. Methods producing good divergence time estimates have poor branch lengths and lower model fit, and vice versa. Using the results presented here, a user can choose the appropriate method based on the purpose of the summary tree. BioMed Central 2013-10-04 /pmc/articles/PMC3853548/ /pubmed/24093883 http://dx.doi.org/10.1186/1471-2148-13-221 Text en Copyright © 2013 Heled and Bouckaert; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Heled, Joseph
Bouckaert, Remco R
Looking for trees in the forest: summary tree from posterior samples
title Looking for trees in the forest: summary tree from posterior samples
title_full Looking for trees in the forest: summary tree from posterior samples
title_fullStr Looking for trees in the forest: summary tree from posterior samples
title_full_unstemmed Looking for trees in the forest: summary tree from posterior samples
title_short Looking for trees in the forest: summary tree from posterior samples
title_sort looking for trees in the forest: summary tree from posterior samples
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853548/
https://www.ncbi.nlm.nih.gov/pubmed/24093883
http://dx.doi.org/10.1186/1471-2148-13-221
work_keys_str_mv AT heledjoseph lookingfortreesintheforestsummarytreefromposteriorsamples
AT bouckaertremcor lookingfortreesintheforestsummarytreefromposteriorsamples