Cargando…

Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals

Modern infectious disease outbreak surveillance produces continuous streams of sequence data which require phylogenetic analysis as data arrives. Current software packages for Bayesian phylogenetic inference are unable to quickly incorporate new sequences as they become available, making them less u...

Descripción completa

Detalles Bibliográficos
Autores principales: Fourment, Mathieu, Claywell, Brian C, Dinh, Vu, McCoy, Connor, Matsen IV, Frederick A, Darling, Aaron E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5920299/
https://www.ncbi.nlm.nih.gov/pubmed/29186587
http://dx.doi.org/10.1093/sysbio/syx090
_version_ 1783317805271089152
author Fourment, Mathieu
Claywell, Brian C
Dinh, Vu
McCoy, Connor
Matsen IV, Frederick A
Darling, Aaron E
author_facet Fourment, Mathieu
Claywell, Brian C
Dinh, Vu
McCoy, Connor
Matsen IV, Frederick A
Darling, Aaron E
author_sort Fourment, Mathieu
collection PubMed
description Modern infectious disease outbreak surveillance produces continuous streams of sequence data which require phylogenetic analysis as data arrives. Current software packages for Bayesian phylogenetic inference are unable to quickly incorporate new sequences as they become available, making them less useful for dynamically unfolding evolutionary stories. This limitation can be addressed by applying a class of Bayesian statistical inference algorithms called sequential Monte Carlo (SMC) to conduct online inference, wherein new data can be continuously incorporated to update the estimate of the posterior probability distribution. In this article, we describe and evaluate several different online phylogenetic sequential Monte Carlo (OPSMC) algorithms. We show that proposing new phylogenies with a density similar to the Bayesian prior suffers from poor performance, and we develop “guided” proposals that better match the proposal density to the posterior. Furthermore, we show that the simplest guided proposals can exhibit pathological behavior in some situations, leading to poor results, and that the situation can be resolved by heating the proposal density. The results demonstrate that relative to the widely used MCMC-based algorithm implemented in MrBayes, the total time required to compute a series of phylogenetic posteriors as sequences arrive can be significantly reduced by the use of OPSMC, without incurring a significant loss in accuracy.
format Online
Article
Text
id pubmed-5920299
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-59202992018-05-04 Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals Fourment, Mathieu Claywell, Brian C Dinh, Vu McCoy, Connor Matsen IV, Frederick A Darling, Aaron E Syst Biol Regular Articles Modern infectious disease outbreak surveillance produces continuous streams of sequence data which require phylogenetic analysis as data arrives. Current software packages for Bayesian phylogenetic inference are unable to quickly incorporate new sequences as they become available, making them less useful for dynamically unfolding evolutionary stories. This limitation can be addressed by applying a class of Bayesian statistical inference algorithms called sequential Monte Carlo (SMC) to conduct online inference, wherein new data can be continuously incorporated to update the estimate of the posterior probability distribution. In this article, we describe and evaluate several different online phylogenetic sequential Monte Carlo (OPSMC) algorithms. We show that proposing new phylogenies with a density similar to the Bayesian prior suffers from poor performance, and we develop “guided” proposals that better match the proposal density to the posterior. Furthermore, we show that the simplest guided proposals can exhibit pathological behavior in some situations, leading to poor results, and that the situation can be resolved by heating the proposal density. The results demonstrate that relative to the widely used MCMC-based algorithm implemented in MrBayes, the total time required to compute a series of phylogenetic posteriors as sequences arrive can be significantly reduced by the use of OPSMC, without incurring a significant loss in accuracy. Oxford University Press 2018-05 2017-11-27 /pmc/articles/PMC5920299/ /pubmed/29186587 http://dx.doi.org/10.1093/sysbio/syx090 Text en © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Fourment, Mathieu
Claywell, Brian C
Dinh, Vu
McCoy, Connor
Matsen IV, Frederick A
Darling, Aaron E
Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals
title Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals
title_full Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals
title_fullStr Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals
title_full_unstemmed Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals
title_short Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals
title_sort effective online bayesian phylogenetics via sequential monte carlo with guided proposals
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5920299/
https://www.ncbi.nlm.nih.gov/pubmed/29186587
http://dx.doi.org/10.1093/sysbio/syx090
work_keys_str_mv AT fourmentmathieu effectiveonlinebayesianphylogeneticsviasequentialmontecarlowithguidedproposals
AT claywellbrianc effectiveonlinebayesianphylogeneticsviasequentialmontecarlowithguidedproposals
AT dinhvu effectiveonlinebayesianphylogeneticsviasequentialmontecarlowithguidedproposals
AT mccoyconnor effectiveonlinebayesianphylogeneticsviasequentialmontecarlowithguidedproposals
AT matsenivfredericka effectiveonlinebayesianphylogeneticsviasequentialmontecarlowithguidedproposals
AT darlingaarone effectiveonlinebayesianphylogeneticsviasequentialmontecarlowithguidedproposals