Cargando…

Bayesian Phylogenetic Inference using Relaxed-clocks and the Multispecies Coalescent

The multispecies coalescent (MSC) model accommodates both species divergences and within-species coalescent and provides a natural framework for phylogenetic analysis of genomic data when the gene trees vary across the genome. The MSC model implemented in the program bpp assumes a molecular clock an...

Descripción completa

Detalles Bibliográficos
Autores principales: Flouri, Tomáš, Huang, Jun, Jiao, Xiyun, Kapli, Paschalia, Rannala, Bruce, Yang, Ziheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9366188/
https://www.ncbi.nlm.nih.gov/pubmed/35907248
http://dx.doi.org/10.1093/molbev/msac161
_version_ 1784765505755676672
author Flouri, Tomáš
Huang, Jun
Jiao, Xiyun
Kapli, Paschalia
Rannala, Bruce
Yang, Ziheng
author_facet Flouri, Tomáš
Huang, Jun
Jiao, Xiyun
Kapli, Paschalia
Rannala, Bruce
Yang, Ziheng
author_sort Flouri, Tomáš
collection PubMed
description The multispecies coalescent (MSC) model accommodates both species divergences and within-species coalescent and provides a natural framework for phylogenetic analysis of genomic data when the gene trees vary across the genome. The MSC model implemented in the program bpp assumes a molecular clock and the Jukes–Cantor model, and is suitable for analyzing genomic data from closely related species. Here we extend our implementation to more general substitution models and relaxed clocks to allow the rate to vary among species. The MSC-with-relaxed-clock model allows the estimation of species divergence times and ancestral population sizes using genomic sequences sampled from contemporary species when the strict clock assumption is violated, and provides a simulation framework for evaluating species tree estimation methods. We conducted simulations and analyzed two real datasets to evaluate the utility of the new models. We confirm that the clock-JC model is adequate for inference of shallow trees with closely related species, but it is important to account for clock violation for distant species. Our simulation suggests that there is valuable phylogenetic information in the gene-tree branch lengths even if the molecular clock assumption is seriously violated, and the relaxed-clock models implemented in bpp are able to extract such information. Our Markov chain Monte Carlo algorithms suffer from mixing problems when used for species tree estimation under the relaxed clock and we discuss possible improvements. We conclude that the new models are currently most effective for estimating population parameters such as species divergence times when the species tree is fixed.
format Online
Article
Text
id pubmed-9366188
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93661882022-08-11 Bayesian Phylogenetic Inference using Relaxed-clocks and the Multispecies Coalescent Flouri, Tomáš Huang, Jun Jiao, Xiyun Kapli, Paschalia Rannala, Bruce Yang, Ziheng Mol Biol Evol Methods The multispecies coalescent (MSC) model accommodates both species divergences and within-species coalescent and provides a natural framework for phylogenetic analysis of genomic data when the gene trees vary across the genome. The MSC model implemented in the program bpp assumes a molecular clock and the Jukes–Cantor model, and is suitable for analyzing genomic data from closely related species. Here we extend our implementation to more general substitution models and relaxed clocks to allow the rate to vary among species. The MSC-with-relaxed-clock model allows the estimation of species divergence times and ancestral population sizes using genomic sequences sampled from contemporary species when the strict clock assumption is violated, and provides a simulation framework for evaluating species tree estimation methods. We conducted simulations and analyzed two real datasets to evaluate the utility of the new models. We confirm that the clock-JC model is adequate for inference of shallow trees with closely related species, but it is important to account for clock violation for distant species. Our simulation suggests that there is valuable phylogenetic information in the gene-tree branch lengths even if the molecular clock assumption is seriously violated, and the relaxed-clock models implemented in bpp are able to extract such information. Our Markov chain Monte Carlo algorithms suffer from mixing problems when used for species tree estimation under the relaxed clock and we discuss possible improvements. We conclude that the new models are currently most effective for estimating population parameters such as species divergence times when the species tree is fixed. Oxford University Press 2022-07-30 /pmc/articles/PMC9366188/ /pubmed/35907248 http://dx.doi.org/10.1093/molbev/msac161 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Flouri, Tomáš
Huang, Jun
Jiao, Xiyun
Kapli, Paschalia
Rannala, Bruce
Yang, Ziheng
Bayesian Phylogenetic Inference using Relaxed-clocks and the Multispecies Coalescent
title Bayesian Phylogenetic Inference using Relaxed-clocks and the Multispecies Coalescent
title_full Bayesian Phylogenetic Inference using Relaxed-clocks and the Multispecies Coalescent
title_fullStr Bayesian Phylogenetic Inference using Relaxed-clocks and the Multispecies Coalescent
title_full_unstemmed Bayesian Phylogenetic Inference using Relaxed-clocks and the Multispecies Coalescent
title_short Bayesian Phylogenetic Inference using Relaxed-clocks and the Multispecies Coalescent
title_sort bayesian phylogenetic inference using relaxed-clocks and the multispecies coalescent
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9366188/
https://www.ncbi.nlm.nih.gov/pubmed/35907248
http://dx.doi.org/10.1093/molbev/msac161
work_keys_str_mv AT flouritomas bayesianphylogeneticinferenceusingrelaxedclocksandthemultispeciescoalescent
AT huangjun bayesianphylogeneticinferenceusingrelaxedclocksandthemultispeciescoalescent
AT jiaoxiyun bayesianphylogeneticinferenceusingrelaxedclocksandthemultispeciescoalescent
AT kaplipaschalia bayesianphylogeneticinferenceusingrelaxedclocksandthemultispeciescoalescent
AT rannalabruce bayesianphylogeneticinferenceusingrelaxedclocksandthemultispeciescoalescent
AT yangziheng bayesianphylogeneticinferenceusingrelaxedclocksandthemultispeciescoalescent