Cargando…

An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees

Sampling tree space is the most challenging aspect of Bayesian phylogenetic inference. The sheer number of alternative topologies is problematic by itself. In addition, the complex dependency between branch lengths and topology increases the difficulty of moving efficiently among topologies. Current...

Descripción completa

Detalles Bibliográficos
Autores principales: Aberer, Andre J., Stamatakis, Alexandros, Ronquist, Fredrik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4678251/
https://www.ncbi.nlm.nih.gov/pubmed/26231183
http://dx.doi.org/10.1093/sysbio/syv051
_version_ 1782405424913317888
author Aberer, Andre J.
Stamatakis, Alexandros
Ronquist, Fredrik
author_facet Aberer, Andre J.
Stamatakis, Alexandros
Ronquist, Fredrik
author_sort Aberer, Andre J.
collection PubMed
description Sampling tree space is the most challenging aspect of Bayesian phylogenetic inference. The sheer number of alternative topologies is problematic by itself. In addition, the complex dependency between branch lengths and topology increases the difficulty of moving efficiently among topologies. Current tree proposals are fast but sample new trees using primitive transformations or re-mappings of old branch lengths. This reduces acceptance rates and presumably slows down convergence and mixing. Here, we explore branch proposals that do not rely on old branch lengths but instead are based on approximations of the conditional posterior. Using a diverse set of empirical data sets, we show that most conditional branch posteriors can be accurately approximated via a [Formula: see text] distribution. We empirically determine the relationship between the logarithmic conditional posterior density, its derivatives, and the characteristics of the branch posterior. We use these relationships to derive an independence sampler for proposing branches with an acceptance ratio of ∼90% on most data sets. This proposal samples branches between 2× and 3× more efficiently than traditional proposals with respect to the effective sample size per unit of runtime. We also compare the performance of standard topology proposals with hybrid proposals that use the new independence sampler to update those branches that are most affected by the topological change. Our results show that hybrid proposals can sometimes noticeably decrease the number of generations necessary for topological convergence. Inconsistent performance gains indicate that branch updates are not the limiting factor in improving topological convergence for the currently employed set of proposals. However, our independence sampler might be essential for the construction of novel tree proposals that apply more radical topology changes.
format Online
Article
Text
id pubmed-4678251
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-46782512015-12-15 An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees Aberer, Andre J. Stamatakis, Alexandros Ronquist, Fredrik Syst Biol Regular Articles Sampling tree space is the most challenging aspect of Bayesian phylogenetic inference. The sheer number of alternative topologies is problematic by itself. In addition, the complex dependency between branch lengths and topology increases the difficulty of moving efficiently among topologies. Current tree proposals are fast but sample new trees using primitive transformations or re-mappings of old branch lengths. This reduces acceptance rates and presumably slows down convergence and mixing. Here, we explore branch proposals that do not rely on old branch lengths but instead are based on approximations of the conditional posterior. Using a diverse set of empirical data sets, we show that most conditional branch posteriors can be accurately approximated via a [Formula: see text] distribution. We empirically determine the relationship between the logarithmic conditional posterior density, its derivatives, and the characteristics of the branch posterior. We use these relationships to derive an independence sampler for proposing branches with an acceptance ratio of ∼90% on most data sets. This proposal samples branches between 2× and 3× more efficiently than traditional proposals with respect to the effective sample size per unit of runtime. We also compare the performance of standard topology proposals with hybrid proposals that use the new independence sampler to update those branches that are most affected by the topological change. Our results show that hybrid proposals can sometimes noticeably decrease the number of generations necessary for topological convergence. Inconsistent performance gains indicate that branch updates are not the limiting factor in improving topological convergence for the currently employed set of proposals. However, our independence sampler might be essential for the construction of novel tree proposals that apply more radical topology changes. Oxford University Press 2016-01 2015-07-30 /pmc/articles/PMC4678251/ /pubmed/26231183 http://dx.doi.org/10.1093/sysbio/syv051 Text en © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Regular Articles
Aberer, Andre J.
Stamatakis, Alexandros
Ronquist, Fredrik
An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees
title An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees
title_full An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees
title_fullStr An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees
title_full_unstemmed An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees
title_short An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees
title_sort efficient independence sampler for updating branches in bayesian markov chain monte carlo sampling of phylogenetic trees
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4678251/
https://www.ncbi.nlm.nih.gov/pubmed/26231183
http://dx.doi.org/10.1093/sysbio/syv051
work_keys_str_mv AT abererandrej anefficientindependencesamplerforupdatingbranchesinbayesianmarkovchainmontecarlosamplingofphylogenetictrees
AT stamatakisalexandros anefficientindependencesamplerforupdatingbranchesinbayesianmarkovchainmontecarlosamplingofphylogenetictrees
AT ronquistfredrik anefficientindependencesamplerforupdatingbranchesinbayesianmarkovchainmontecarlosamplingofphylogenetictrees
AT abererandrej efficientindependencesamplerforupdatingbranchesinbayesianmarkovchainmontecarlosamplingofphylogenetictrees
AT stamatakisalexandros efficientindependencesamplerforupdatingbranchesinbayesianmarkovchainmontecarlosamplingofphylogenetictrees
AT ronquistfredrik efficientindependencesamplerforupdatingbranchesinbayesianmarkovchainmontecarlosamplingofphylogenetictrees