Cargando…

Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration

Phylogenetic analyses which include fossils or molecular sequences that are sampled through time require models that allow one sample to be a direct ancestor of another sample. As previously available phylogenetic inference tools assume that all samples are tips, they do not allow for this possibili...

Descripción completa

Detalles Bibliográficos
Autores principales: Gavryushkina, Alexandra, Welch, David, Stadler, Tanja, Drummond, Alexei J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4263412/
https://www.ncbi.nlm.nih.gov/pubmed/25474353
http://dx.doi.org/10.1371/journal.pcbi.1003919
_version_ 1782348565498036224
author Gavryushkina, Alexandra
Welch, David
Stadler, Tanja
Drummond, Alexei J.
author_facet Gavryushkina, Alexandra
Welch, David
Stadler, Tanja
Drummond, Alexei J.
author_sort Gavryushkina, Alexandra
collection PubMed
description Phylogenetic analyses which include fossils or molecular sequences that are sampled through time require models that allow one sample to be a direct ancestor of another sample. As previously available phylogenetic inference tools assume that all samples are tips, they do not allow for this possibility. We have developed and implemented a Bayesian Markov Chain Monte Carlo (MCMC) algorithm to infer what we call sampled ancestor trees, that is, trees in which sampled individuals can be direct ancestors of other sampled individuals. We use a family of birth-death models where individuals may remain in the tree process after sampling, in particular we extend the birth-death skyline model [Stadler et al., 2013] to sampled ancestor trees. This method allows the detection of sampled ancestors as well as estimation of the probability that an individual will be removed from the process when it is sampled. We show that even if sampled ancestors are not of specific interest in an analysis, failing to account for them leads to significant bias in parameter estimates. We also show that sampled ancestor birth-death models where every sample comes from a different time point are non-identifiable and thus require one parameter to be known in order to infer other parameters. We apply our phylogenetic inference accounting for sampled ancestors to epidemiological data, where the possibility of sampled ancestors enables us to identify individuals that infected other individuals after being sampled and to infer fundamental epidemiological parameters. We also apply the method to infer divergence times and diversification rates when fossils are included along with extant species samples, so that fossilisation events are modelled as a part of the tree branching process. Such modelling has many advantages as argued in the literature. The sampler is available as an open-source BEAST2 package (https://github.com/CompEvol/sampled-ancestors).
format Online
Article
Text
id pubmed-4263412
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42634122014-12-19 Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration Gavryushkina, Alexandra Welch, David Stadler, Tanja Drummond, Alexei J. PLoS Comput Biol Research Article Phylogenetic analyses which include fossils or molecular sequences that are sampled through time require models that allow one sample to be a direct ancestor of another sample. As previously available phylogenetic inference tools assume that all samples are tips, they do not allow for this possibility. We have developed and implemented a Bayesian Markov Chain Monte Carlo (MCMC) algorithm to infer what we call sampled ancestor trees, that is, trees in which sampled individuals can be direct ancestors of other sampled individuals. We use a family of birth-death models where individuals may remain in the tree process after sampling, in particular we extend the birth-death skyline model [Stadler et al., 2013] to sampled ancestor trees. This method allows the detection of sampled ancestors as well as estimation of the probability that an individual will be removed from the process when it is sampled. We show that even if sampled ancestors are not of specific interest in an analysis, failing to account for them leads to significant bias in parameter estimates. We also show that sampled ancestor birth-death models where every sample comes from a different time point are non-identifiable and thus require one parameter to be known in order to infer other parameters. We apply our phylogenetic inference accounting for sampled ancestors to epidemiological data, where the possibility of sampled ancestors enables us to identify individuals that infected other individuals after being sampled and to infer fundamental epidemiological parameters. We also apply the method to infer divergence times and diversification rates when fossils are included along with extant species samples, so that fossilisation events are modelled as a part of the tree branching process. Such modelling has many advantages as argued in the literature. The sampler is available as an open-source BEAST2 package (https://github.com/CompEvol/sampled-ancestors). Public Library of Science 2014-12-04 /pmc/articles/PMC4263412/ /pubmed/25474353 http://dx.doi.org/10.1371/journal.pcbi.1003919 Text en © 2014 Gavryushkina et al This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gavryushkina, Alexandra
Welch, David
Stadler, Tanja
Drummond, Alexei J.
Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration
title Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration
title_full Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration
title_fullStr Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration
title_full_unstemmed Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration
title_short Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration
title_sort bayesian inference of sampled ancestor trees for epidemiology and fossil calibration
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4263412/
https://www.ncbi.nlm.nih.gov/pubmed/25474353
http://dx.doi.org/10.1371/journal.pcbi.1003919
work_keys_str_mv AT gavryushkinaalexandra bayesianinferenceofsampledancestortreesforepidemiologyandfossilcalibration
AT welchdavid bayesianinferenceofsampledancestortreesforepidemiologyandfossilcalibration
AT stadlertanja bayesianinferenceofsampledancestortreesforepidemiologyandfossilcalibration
AT drummondalexeij bayesianinferenceofsampledancestortreesforepidemiologyandfossilcalibration