Cargando…
Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction
Reconstructing pathogen dynamics from genetic data as they become available during an outbreak or epidemic represents an important statistical scenario in which observations arrive sequentially in time and one is interested in performing inference in an “online” fashion. Widely used Bayesian phyloge...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253210/ https://www.ncbi.nlm.nih.gov/pubmed/32101295 http://dx.doi.org/10.1093/molbev/msaa047 |
_version_ | 1783539298552774656 |
---|---|
author | Gill, Mandev S Lemey, Philippe Suchard, Marc A Rambaut, Andrew Baele, Guy |
author_facet | Gill, Mandev S Lemey, Philippe Suchard, Marc A Rambaut, Andrew Baele, Guy |
author_sort | Gill, Mandev S |
collection | PubMed |
description | Reconstructing pathogen dynamics from genetic data as they become available during an outbreak or epidemic represents an important statistical scenario in which observations arrive sequentially in time and one is interested in performing inference in an “online” fashion. Widely used Bayesian phylogenetic inference packages are not set up for this purpose, generally requiring one to recompute trees and evolutionary model parameters de novo when new data arrive. To accommodate increasing data flow in a Bayesian phylogenetic framework, we introduce a methodology to efficiently update the posterior distribution with newly available genetic data. Our procedure is implemented in the BEAST 1.10 software package, and relies on a distance-based measure to insert new taxa into the current estimate of the phylogeny and imputes plausible values for new model parameters to accommodate growing dimensionality. This augmentation creates informed starting values and re-uses optimally tuned transition kernels for posterior exploration of growing data sets, reducing the time necessary to converge to target posterior distributions. We apply our framework to data from the recent West African Ebola virus epidemic and demonstrate a considerable reduction in time required to obtain posterior estimates at different time points of the outbreak. Beyond epidemic monitoring, this framework easily finds other applications within the phylogenetics community, where changes in the data—in terms of alignment changes, sequence addition or removal—present common scenarios that can benefit from online inference. |
format | Online Article Text |
id | pubmed-7253210 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-72532102020-06-02 Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction Gill, Mandev S Lemey, Philippe Suchard, Marc A Rambaut, Andrew Baele, Guy Mol Biol Evol Resources Reconstructing pathogen dynamics from genetic data as they become available during an outbreak or epidemic represents an important statistical scenario in which observations arrive sequentially in time and one is interested in performing inference in an “online” fashion. Widely used Bayesian phylogenetic inference packages are not set up for this purpose, generally requiring one to recompute trees and evolutionary model parameters de novo when new data arrive. To accommodate increasing data flow in a Bayesian phylogenetic framework, we introduce a methodology to efficiently update the posterior distribution with newly available genetic data. Our procedure is implemented in the BEAST 1.10 software package, and relies on a distance-based measure to insert new taxa into the current estimate of the phylogeny and imputes plausible values for new model parameters to accommodate growing dimensionality. This augmentation creates informed starting values and re-uses optimally tuned transition kernels for posterior exploration of growing data sets, reducing the time necessary to converge to target posterior distributions. We apply our framework to data from the recent West African Ebola virus epidemic and demonstrate a considerable reduction in time required to obtain posterior estimates at different time points of the outbreak. Beyond epidemic monitoring, this framework easily finds other applications within the phylogenetics community, where changes in the data—in terms of alignment changes, sequence addition or removal—present common scenarios that can benefit from online inference. Oxford University Press 2020-06 2020-02-26 /pmc/articles/PMC7253210/ /pubmed/32101295 http://dx.doi.org/10.1093/molbev/msaa047 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Resources Gill, Mandev S Lemey, Philippe Suchard, Marc A Rambaut, Andrew Baele, Guy Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction |
title | Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction |
title_full | Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction |
title_fullStr | Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction |
title_full_unstemmed | Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction |
title_short | Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction |
title_sort | online bayesian phylodynamic inference in beast with application to epidemic reconstruction |
topic | Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253210/ https://www.ncbi.nlm.nih.gov/pubmed/32101295 http://dx.doi.org/10.1093/molbev/msaa047 |
work_keys_str_mv | AT gillmandevs onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction AT lemeyphilippe onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction AT suchardmarca onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction AT rambautandrew onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction AT baeleguy onlinebayesianphylodynamicinferenceinbeastwithapplicationtoepidemicreconstruction |