Cargando…

Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction

Reconstructing pathogen dynamics from genetic data as they become available during an outbreak or epidemic represents an important statistical scenario in which observations arrive sequentially in time and one is interested in performing inference in an “online” fashion. Widely used Bayesian phyloge...

Descripción completa

Detalles Bibliográficos
Autores principales: Gill, Mandev S, Lemey, Philippe, Suchard, Marc A, Rambaut, Andrew, Baele, Guy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253210/
https://www.ncbi.nlm.nih.gov/pubmed/32101295
http://dx.doi.org/10.1093/molbev/msaa047
_version_ 1783539298552774656
author Gill, Mandev S
Lemey, Philippe
Suchard, Marc A
Rambaut, Andrew
Baele, Guy
author_facet Gill, Mandev S
Lemey, Philippe
Suchard, Marc A
Rambaut, Andrew
Baele, Guy
author_sort Gill, Mandev S
collection PubMed
description Reconstructing pathogen dynamics from genetic data as they become available during an outbreak or epidemic represents an important statistical scenario in which observations arrive sequentially in time and one is interested in performing inference in an “online” fashion. Widely used Bayesian phylogenetic inference packages are not set up for this purpose, generally requiring one to recompute trees and evolutionary model parameters de novo when new data arrive. To accommodate increasing data flow in a Bayesian phylogenetic framework, we introduce a methodology to efficiently update the posterior distribution with newly available genetic data. Our procedure is implemented in the BEAST 1.10 software package, and relies on a distance-based measure to insert new taxa into the current estimate of the phylogeny and imputes plausible values for new model parameters to accommodate growing dimensionality. This augmentation creates informed starting values and re-uses optimally tuned transition kernels for posterior exploration of growing data sets, reducing the time necessary to converge to target posterior distributions. We apply our framework to data from the recent West African Ebola virus epidemic and demonstrate a considerable reduction in time required to obtain posterior estimates at different time points of the outbreak. Beyond epidemic monitoring, this framework easily finds other applications within the phylogenetics community, where changes in the data—in terms of alignment changes, sequence addition or removal—present common scenarios that can benefit from online inference.
format Online
Article
Text
id pubmed-7253210
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72532102020-06-02 Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction Gill, Mandev S Lemey, Philippe Suchard, Marc A Rambaut, Andrew Baele, Guy Mol Biol Evol Resources Reconstructing pathogen dynamics from genetic data as they become available during an outbreak or epidemic represents an important statistical scenario in which observations arrive sequentially in time and one is interested in performing inference in an “online” fashion. Widely used Bayesian phylogenetic inference packages are not set up for this purpose, generally requiring one to recompute trees and evolutionary model parameters de novo when new data arrive. To accommodate increasing data flow in a Bayesian phylogenetic framework, we introduce a methodology to efficiently update the posterior distribution with newly available genetic data. Our procedure is implemented in the BEAST 1.10 software package, and relies on a distance-based measure to insert new taxa into the current estimate of the phylogeny and imputes plausible values for new model parameters to accommodate growing dimensionality. This augmentation creates informed starting values and re-uses optimally tuned transition kernels for posterior exploration of growing data sets, reducing the time necessary to converge to target posterior distributions. We apply our framework to data from the recent West African Ebola virus epidemic and demonstrate a considerable reduction in time required to obtain posterior estimates at different time points of the outbreak. Beyond epidemic monitoring, this framework easily finds other applications within the phylogenetics community, where changes in the data—in terms of alignment changes, sequence addition or removal—present common scenarios that can benefit from online inference. Oxford University Press 2020-06 2020-02-26 /pmc/articles/PMC7253210/ /pubmed/32101295 http://dx.doi.org/10.1093/molbev/msaa047 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Resources
Gill, Mandev S
Lemey, Philippe
Suchard, Marc A
Rambaut, Andrew
Baele, Guy
Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction
title Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction
title_full Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction
title_fullStr Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction
title_full_unstemmed Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction
title_short Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction
title_sort online bayesian phylodynamic inference in beast with application to epidemic reconstruction
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253210/
https://www.ncbi.nlm.nih.gov/pubmed/32101295
http://dx.doi.org/10.1093/molbev/msaa047
work_keys_str_mv AT gillmandevs onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction
AT lemeyphilippe onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction
AT suchardmarca onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction
AT rambautandrew onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction
AT baeleguy onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction