Cargando…

PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance

An important goal of evolutionary genomics is to identify genomic regions whose substitution rates differ among lineages. For example, genomic regions experiencing accelerated molecular evolution in some lineages may provide insight into links between genotype and phenotype. Several comparative geno...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Han, Hu, Zhirui, Thomas, Gregg W C, Edwards, Scott V, Sackton, Timothy B, Liu, Jun S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10540510/
https://www.ncbi.nlm.nih.gov/pubmed/37665177
http://dx.doi.org/10.1093/molbev/msad195
_version_ 1785113735237468160
author Yan, Han
Hu, Zhirui
Thomas, Gregg W C
Edwards, Scott V
Sackton, Timothy B
Liu, Jun S
author_facet Yan, Han
Hu, Zhirui
Thomas, Gregg W C
Edwards, Scott V
Sackton, Timothy B
Liu, Jun S
author_sort Yan, Han
collection PubMed
description An important goal of evolutionary genomics is to identify genomic regions whose substitution rates differ among lineages. For example, genomic regions experiencing accelerated molecular evolution in some lineages may provide insight into links between genotype and phenotype. Several comparative genomics methods have been developed to identify genomic accelerations between species, including a Bayesian method called PhyloAcc, which models shifts in substitution rate in multiple target lineages on a phylogeny. However, few methods consider the possibility of discordance between the trees of individual loci and the species tree due to incomplete lineage sorting, which might cause false positives. Here, we present PhyloAcc-GT, which extends PhyloAcc by modeling gene tree heterogeneity. Given a species tree, we adopt the multispecies coalescent model as the prior distribution of gene trees, use Markov chain Monte Carlo (MCMC) for inference, and design novel MCMC moves to sample gene trees efficiently. Through extensive simulations, we show that PhyloAcc-GT outperforms PhyloAcc and other methods in identifying target lineage-specific accelerations and detecting complex patterns of rate shifts, and is robust to specification of population size parameters. PhyloAcc-GT is usually more conservative than PhyloAcc in calling convergent rate shifts because it identifies more accelerations on ancestral than on terminal branches. We apply PhyloAcc-GT to two examples of convergent evolution: flightlessness in ratites and marine mammal adaptations, and show that PhyloAcc-GT is a robust tool to identify shifts in substitution rate associated with specific target lineages while accounting for incomplete lineage sorting.
format Online
Article
Text
id pubmed-10540510
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105405102023-09-30 PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance Yan, Han Hu, Zhirui Thomas, Gregg W C Edwards, Scott V Sackton, Timothy B Liu, Jun S Mol Biol Evol Methods An important goal of evolutionary genomics is to identify genomic regions whose substitution rates differ among lineages. For example, genomic regions experiencing accelerated molecular evolution in some lineages may provide insight into links between genotype and phenotype. Several comparative genomics methods have been developed to identify genomic accelerations between species, including a Bayesian method called PhyloAcc, which models shifts in substitution rate in multiple target lineages on a phylogeny. However, few methods consider the possibility of discordance between the trees of individual loci and the species tree due to incomplete lineage sorting, which might cause false positives. Here, we present PhyloAcc-GT, which extends PhyloAcc by modeling gene tree heterogeneity. Given a species tree, we adopt the multispecies coalescent model as the prior distribution of gene trees, use Markov chain Monte Carlo (MCMC) for inference, and design novel MCMC moves to sample gene trees efficiently. Through extensive simulations, we show that PhyloAcc-GT outperforms PhyloAcc and other methods in identifying target lineage-specific accelerations and detecting complex patterns of rate shifts, and is robust to specification of population size parameters. PhyloAcc-GT is usually more conservative than PhyloAcc in calling convergent rate shifts because it identifies more accelerations on ancestral than on terminal branches. We apply PhyloAcc-GT to two examples of convergent evolution: flightlessness in ratites and marine mammal adaptations, and show that PhyloAcc-GT is a robust tool to identify shifts in substitution rate associated with specific target lineages while accounting for incomplete lineage sorting. Oxford University Press 2023-09-04 /pmc/articles/PMC10540510/ /pubmed/37665177 http://dx.doi.org/10.1093/molbev/msad195 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Yan, Han
Hu, Zhirui
Thomas, Gregg W C
Edwards, Scott V
Sackton, Timothy B
Liu, Jun S
PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance
title PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance
title_full PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance
title_fullStr PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance
title_full_unstemmed PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance
title_short PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance
title_sort phyloacc-gt: a bayesian method for inferring patterns of substitution rate shifts on targeted lineages accounting for gene tree discordance
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10540510/
https://www.ncbi.nlm.nih.gov/pubmed/37665177
http://dx.doi.org/10.1093/molbev/msad195
work_keys_str_mv AT yanhan phyloaccgtabayesianmethodforinferringpatternsofsubstitutionrateshiftsontargetedlineagesaccountingforgenetreediscordance
AT huzhirui phyloaccgtabayesianmethodforinferringpatternsofsubstitutionrateshiftsontargetedlineagesaccountingforgenetreediscordance
AT thomasgreggwc phyloaccgtabayesianmethodforinferringpatternsofsubstitutionrateshiftsontargetedlineagesaccountingforgenetreediscordance
AT edwardsscottv phyloaccgtabayesianmethodforinferringpatternsofsubstitutionrateshiftsontargetedlineagesaccountingforgenetreediscordance
AT sacktontimothyb phyloaccgtabayesianmethodforinferringpatternsofsubstitutionrateshiftsontargetedlineagesaccountingforgenetreediscordance
AT liujuns phyloaccgtabayesianmethodforinferringpatternsofsubstitutionrateshiftsontargetedlineagesaccountingforgenetreediscordance