Cargando…

Looking for trees in the forest: summary tree from posterior samples

BACKGROUND: Bayesian phylogenetic analysis generates a set of trees which are often condensed into a single tree representing the whole set. Many methods exist for selecting a representative topology for a set of unrooted trees, few exist for assigning branch lengths to a fixed topology, and even fe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Heled, Joseph, Bouckaert, Remco R
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853548/ https://www.ncbi.nlm.nih.gov/pubmed/24093883 http://dx.doi.org/10.1186/1471-2148-13-221

_version_	1782294644539785216
author	Heled, Joseph Bouckaert, Remco R
author_facet	Heled, Joseph Bouckaert, Remco R
author_sort	Heled, Joseph
collection	PubMed
description	BACKGROUND: Bayesian phylogenetic analysis generates a set of trees which are often condensed into a single tree representing the whole set. Many methods exist for selecting a representative topology for a set of unrooted trees, few exist for assigning branch lengths to a fixed topology, and even fewer for simultaneously setting the topology and branch lengths. However, there is very little research into locating a good representative for a set of rooted time trees like the ones obtained from a BEAST analysis. RESULTS: We empirically compare new and known methods for generating a summary tree. Some new methods are motivated by mathematical constructions such as tree metrics, while the rest employ tree concepts which work well in practice. These use more of the posterior than existing methods, which discard information not directly mapped to the chosen topology. Using results from a large number of simulations we assess the quality of a summary tree, measuring (a) how well it explains the sequence data under the model and (b) how close it is to the “truth”, i.e to the tree used to generate the sequences. CONCLUSIONS: Our simulations indicate that no single method is “best”. Methods producing good divergence time estimates have poor branch lengths and lower model fit, and vice versa. Using the results presented here, a user can choose the appropriate method based on the purpose of the summary tree.
format	Online Article Text
id	pubmed-3853548
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-38535482013-12-16 Looking for trees in the forest: summary tree from posterior samples Heled, Joseph Bouckaert, Remco R BMC Evol Biol Research Article BACKGROUND: Bayesian phylogenetic analysis generates a set of trees which are often condensed into a single tree representing the whole set. Many methods exist for selecting a representative topology for a set of unrooted trees, few exist for assigning branch lengths to a fixed topology, and even fewer for simultaneously setting the topology and branch lengths. However, there is very little research into locating a good representative for a set of rooted time trees like the ones obtained from a BEAST analysis. RESULTS: We empirically compare new and known methods for generating a summary tree. Some new methods are motivated by mathematical constructions such as tree metrics, while the rest employ tree concepts which work well in practice. These use more of the posterior than existing methods, which discard information not directly mapped to the chosen topology. Using results from a large number of simulations we assess the quality of a summary tree, measuring (a) how well it explains the sequence data under the model and (b) how close it is to the “truth”, i.e to the tree used to generate the sequences. CONCLUSIONS: Our simulations indicate that no single method is “best”. Methods producing good divergence time estimates have poor branch lengths and lower model fit, and vice versa. Using the results presented here, a user can choose the appropriate method based on the purpose of the summary tree. BioMed Central 2013-10-04 /pmc/articles/PMC3853548/ /pubmed/24093883 http://dx.doi.org/10.1186/1471-2148-13-221 Text en Copyright © 2013 Heled and Bouckaert; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Heled, Joseph Bouckaert, Remco R Looking for trees in the forest: summary tree from posterior samples
title	Looking for trees in the forest: summary tree from posterior samples
title_full	Looking for trees in the forest: summary tree from posterior samples
title_fullStr	Looking for trees in the forest: summary tree from posterior samples
title_full_unstemmed	Looking for trees in the forest: summary tree from posterior samples
title_short	Looking for trees in the forest: summary tree from posterior samples
title_sort	looking for trees in the forest: summary tree from posterior samples
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853548/ https://www.ncbi.nlm.nih.gov/pubmed/24093883 http://dx.doi.org/10.1186/1471-2148-13-221
work_keys_str_mv	AT heledjoseph lookingfortreesintheforestsummarytreefromposteriorsamples AT bouckaertremcor lookingfortreesintheforestsummarytreefromposteriorsamples

Looking for trees in the forest: summary tree from posterior samples

Ejemplares similares