Cargando…

Universal probabilistic programming offers a powerful approach to statistical phylogenetics

Statistical phylogenetic analysis currently relies on complex, dedicated software packages, making it difficult for evolutionary biologists to explore new models and inference strategies. Recent years have seen more generic solutions based on probabilistic graphical models, but this formalism can on...

Descripción completa

Detalles Bibliográficos
Autores principales: Ronquist, Fredrik, Kudlicka, Jan, Senderov, Viktor, Borgström, Johannes, Lartillot, Nicolas, Lundén, Daniel, Murray, Lawrence, Schön, Thomas B., Broman, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7904853/
https://www.ncbi.nlm.nih.gov/pubmed/33627766
http://dx.doi.org/10.1038/s42003-021-01753-7
_version_ 1783655003598094336
author Ronquist, Fredrik
Kudlicka, Jan
Senderov, Viktor
Borgström, Johannes
Lartillot, Nicolas
Lundén, Daniel
Murray, Lawrence
Schön, Thomas B.
Broman, David
author_facet Ronquist, Fredrik
Kudlicka, Jan
Senderov, Viktor
Borgström, Johannes
Lartillot, Nicolas
Lundén, Daniel
Murray, Lawrence
Schön, Thomas B.
Broman, David
author_sort Ronquist, Fredrik
collection PubMed
description Statistical phylogenetic analysis currently relies on complex, dedicated software packages, making it difficult for evolutionary biologists to explore new models and inference strategies. Recent years have seen more generic solutions based on probabilistic graphical models, but this formalism can only partly express phylogenetic problems. Here, we show that universal probabilistic programming languages (PPLs) solve the expressivity problem, while still supporting automated generation of efficient inference algorithms. To prove the latter point, we develop automated generation of sequential Monte Carlo (SMC) algorithms for PPL descriptions of arbitrary biological diversification (birth-death) models. SMC is a new inference strategy for these problems, supporting both parameter inference and efficient estimation of Bayes factors that are used in model testing. We take advantage of this in automatically generating SMC algorithms for several recent diversification models that have been difficult or impossible to tackle previously. Finally, applying these algorithms to 40 bird phylogenies, we show that models with slowing diversification, constant turnover and many small shifts generally explain the data best. Our work opens up several related problem domains to PPL approaches, and shows that few hurdles remain before these techniques can be effectively applied to the full range of phylogenetic models.
format Online
Article
Text
id pubmed-7904853
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-79048532021-03-11 Universal probabilistic programming offers a powerful approach to statistical phylogenetics Ronquist, Fredrik Kudlicka, Jan Senderov, Viktor Borgström, Johannes Lartillot, Nicolas Lundén, Daniel Murray, Lawrence Schön, Thomas B. Broman, David Commun Biol Article Statistical phylogenetic analysis currently relies on complex, dedicated software packages, making it difficult for evolutionary biologists to explore new models and inference strategies. Recent years have seen more generic solutions based on probabilistic graphical models, but this formalism can only partly express phylogenetic problems. Here, we show that universal probabilistic programming languages (PPLs) solve the expressivity problem, while still supporting automated generation of efficient inference algorithms. To prove the latter point, we develop automated generation of sequential Monte Carlo (SMC) algorithms for PPL descriptions of arbitrary biological diversification (birth-death) models. SMC is a new inference strategy for these problems, supporting both parameter inference and efficient estimation of Bayes factors that are used in model testing. We take advantage of this in automatically generating SMC algorithms for several recent diversification models that have been difficult or impossible to tackle previously. Finally, applying these algorithms to 40 bird phylogenies, we show that models with slowing diversification, constant turnover and many small shifts generally explain the data best. Our work opens up several related problem domains to PPL approaches, and shows that few hurdles remain before these techniques can be effectively applied to the full range of phylogenetic models. Nature Publishing Group UK 2021-02-24 /pmc/articles/PMC7904853/ /pubmed/33627766 http://dx.doi.org/10.1038/s42003-021-01753-7 Text en © The Author(s) 2021, corrected publication 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ronquist, Fredrik
Kudlicka, Jan
Senderov, Viktor
Borgström, Johannes
Lartillot, Nicolas
Lundén, Daniel
Murray, Lawrence
Schön, Thomas B.
Broman, David
Universal probabilistic programming offers a powerful approach to statistical phylogenetics
title Universal probabilistic programming offers a powerful approach to statistical phylogenetics
title_full Universal probabilistic programming offers a powerful approach to statistical phylogenetics
title_fullStr Universal probabilistic programming offers a powerful approach to statistical phylogenetics
title_full_unstemmed Universal probabilistic programming offers a powerful approach to statistical phylogenetics
title_short Universal probabilistic programming offers a powerful approach to statistical phylogenetics
title_sort universal probabilistic programming offers a powerful approach to statistical phylogenetics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7904853/
https://www.ncbi.nlm.nih.gov/pubmed/33627766
http://dx.doi.org/10.1038/s42003-021-01753-7
work_keys_str_mv AT ronquistfredrik universalprobabilisticprogrammingoffersapowerfulapproachtostatisticalphylogenetics
AT kudlickajan universalprobabilisticprogrammingoffersapowerfulapproachtostatisticalphylogenetics
AT senderovviktor universalprobabilisticprogrammingoffersapowerfulapproachtostatisticalphylogenetics
AT borgstromjohannes universalprobabilisticprogrammingoffersapowerfulapproachtostatisticalphylogenetics
AT lartillotnicolas universalprobabilisticprogrammingoffersapowerfulapproachtostatisticalphylogenetics
AT lundendaniel universalprobabilisticprogrammingoffersapowerfulapproachtostatisticalphylogenetics
AT murraylawrence universalprobabilisticprogrammingoffersapowerfulapproachtostatisticalphylogenetics
AT schonthomasb universalprobabilisticprogrammingoffersapowerfulapproachtostatisticalphylogenetics
AT bromandavid universalprobabilisticprogrammingoffersapowerfulapproachtostatisticalphylogenetics