Cargando…

Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST

Markov models of character substitution on phylogenies form the foundation of phylogenetic inference frameworks. Early models made the simplifying assumption that the substitution process is homogeneous over time and across sites in the molecular sequence alignment. While standard practice adopts ex...

Descripción completa

Detalles Bibliográficos
Autores principales: Baele, Guy, Gill, Mandev S, Bastide, Paul, Lemey, Philippe, Suchard, Marc A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7744037/
https://www.ncbi.nlm.nih.gov/pubmed/32415977
http://dx.doi.org/10.1093/sysbio/syaa037
_version_ 1783624354215493632
author Baele, Guy
Gill, Mandev S
Bastide, Paul
Lemey, Philippe
Suchard, Marc A
author_facet Baele, Guy
Gill, Mandev S
Bastide, Paul
Lemey, Philippe
Suchard, Marc A
author_sort Baele, Guy
collection PubMed
description Markov models of character substitution on phylogenies form the foundation of phylogenetic inference frameworks. Early models made the simplifying assumption that the substitution process is homogeneous over time and across sites in the molecular sequence alignment. While standard practice adopts extensions that accommodate heterogeneity of substitution rates across sites, heterogeneity in the process over time in a site-specific manner remains frequently overlooked. This is problematic, as evolutionary processes that act at the molecular level are highly variable, subjecting different sites to different selective constraints over time, impacting their substitution behavior. We propose incorporating time variability through Markov-modulated models (MMMs), which extend covarion-like models and allow the substitution process (including relative character exchange rates as well as the overall substitution rate) at individual sites to vary across lineages. We implement a general MMM framework in BEAST, a popular Bayesian phylogenetic inference software package, allowing researchers to compose a wide range of MMMs through flexible XML specification. Using examples from bacterial, viral, and plastid genome evolution, we show that MMMs impact phylogenetic tree estimation and can substantially improve model fit compared to standard substitution models. Through simulations, we show that marginal likelihood estimation accurately identifies the generative model and does not systematically prefer the more parameter-rich MMMs. To mitigate the increased computational demands associated with MMMs, our implementation exploits recent developments in BEAGLE, a high-performance computational library for phylogenetic inference. [Bayesian inference; BEAGLE; BEAST; covarion, heterotachy; Markov-modulated models; phylogenetics.]
format Online
Article
Text
id pubmed-7744037
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77440372020-12-22 Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST Baele, Guy Gill, Mandev S Bastide, Paul Lemey, Philippe Suchard, Marc A Syst Biol Software for Systematics and Evolution Markov models of character substitution on phylogenies form the foundation of phylogenetic inference frameworks. Early models made the simplifying assumption that the substitution process is homogeneous over time and across sites in the molecular sequence alignment. While standard practice adopts extensions that accommodate heterogeneity of substitution rates across sites, heterogeneity in the process over time in a site-specific manner remains frequently overlooked. This is problematic, as evolutionary processes that act at the molecular level are highly variable, subjecting different sites to different selective constraints over time, impacting their substitution behavior. We propose incorporating time variability through Markov-modulated models (MMMs), which extend covarion-like models and allow the substitution process (including relative character exchange rates as well as the overall substitution rate) at individual sites to vary across lineages. We implement a general MMM framework in BEAST, a popular Bayesian phylogenetic inference software package, allowing researchers to compose a wide range of MMMs through flexible XML specification. Using examples from bacterial, viral, and plastid genome evolution, we show that MMMs impact phylogenetic tree estimation and can substantially improve model fit compared to standard substitution models. Through simulations, we show that marginal likelihood estimation accurately identifies the generative model and does not systematically prefer the more parameter-rich MMMs. To mitigate the increased computational demands associated with MMMs, our implementation exploits recent developments in BEAGLE, a high-performance computational library for phylogenetic inference. [Bayesian inference; BEAGLE; BEAST; covarion, heterotachy; Markov-modulated models; phylogenetics.] Oxford University Press 2020-05-16 /pmc/articles/PMC7744037/ /pubmed/32415977 http://dx.doi.org/10.1093/sysbio/syaa037 Text en © The Author(s) 2020. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software for Systematics and Evolution
Baele, Guy
Gill, Mandev S
Bastide, Paul
Lemey, Philippe
Suchard, Marc A
Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST
title Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST
title_full Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST
title_fullStr Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST
title_full_unstemmed Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST
title_short Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST
title_sort markov-modulated continuous-time markov chains to identify site- and branch-specific evolutionary variation in beast
topic Software for Systematics and Evolution
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7744037/
https://www.ncbi.nlm.nih.gov/pubmed/32415977
http://dx.doi.org/10.1093/sysbio/syaa037
work_keys_str_mv AT baeleguy markovmodulatedcontinuoustimemarkovchainstoidentifysiteandbranchspecificevolutionaryvariationinbeast
AT gillmandevs markovmodulatedcontinuoustimemarkovchainstoidentifysiteandbranchspecificevolutionaryvariationinbeast
AT bastidepaul markovmodulatedcontinuoustimemarkovchainstoidentifysiteandbranchspecificevolutionaryvariationinbeast
AT lemeyphilippe markovmodulatedcontinuoustimemarkovchainstoidentifysiteandbranchspecificevolutionaryvariationinbeast
AT suchardmarca markovmodulatedcontinuoustimemarkovchainstoidentifysiteandbranchspecificevolutionaryvariationinbeast