Cargando…

Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence

Molecular phylogenetics is often used to estimate the time since the divergence of modern gene sequences. For highly diverged sequences, such phylogenetic techniques sometimes estimate surprisingly recent divergence times. In the case of viruses, independent evidence indicates that the estimates of...

Descripción completa

Detalles Bibliográficos
Autores principales: Hilton, Sarah K, Bloom, Jesse D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6220371/
https://www.ncbi.nlm.nih.gov/pubmed/30425841
http://dx.doi.org/10.1093/ve/vey033
_version_ 1783368814551891968
author Hilton, Sarah K
Bloom, Jesse D
author_facet Hilton, Sarah K
Bloom, Jesse D
author_sort Hilton, Sarah K
collection PubMed
description Molecular phylogenetics is often used to estimate the time since the divergence of modern gene sequences. For highly diverged sequences, such phylogenetic techniques sometimes estimate surprisingly recent divergence times. In the case of viruses, independent evidence indicates that the estimates of deep divergence times from molecular phylogenetics are sometimes too recent. This discrepancy is caused in part by inadequate models of purifying selection leading to branch-length underestimation. Here we examine the effect on branch-length estimation of using models that incorporate experimental measurements of purifying selection. We find that models informed by experimentally measured site-specific amino-acid preferences estimate longer deep branches on phylogenies of influenza virus hemagglutinin. This lengthening of branches is due to more realistic stationary states of the models, and is mostly independent of the branch-length extension from modeling site-to-site variation in amino-acid substitution rate. The branch-length extension from experimentally informed site-specific models is similar to that achieved by other approaches that allow the stationary state to vary across sites. However, the improvements from all of these site-specific but time homogeneous and site independent models are limited by the fact that a protein’s amino-acid preferences gradually shift as it evolves. Overall, our work underscores the importance of modeling site-specific amino-acid preferences when estimating deep divergence times—but also shows the inherent limitations of approaches that fail to account for how these preferences shift over time.
format Online
Article
Text
id pubmed-6220371
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62203712018-11-13 Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence Hilton, Sarah K Bloom, Jesse D Virus Evol Research Article Molecular phylogenetics is often used to estimate the time since the divergence of modern gene sequences. For highly diverged sequences, such phylogenetic techniques sometimes estimate surprisingly recent divergence times. In the case of viruses, independent evidence indicates that the estimates of deep divergence times from molecular phylogenetics are sometimes too recent. This discrepancy is caused in part by inadequate models of purifying selection leading to branch-length underestimation. Here we examine the effect on branch-length estimation of using models that incorporate experimental measurements of purifying selection. We find that models informed by experimentally measured site-specific amino-acid preferences estimate longer deep branches on phylogenies of influenza virus hemagglutinin. This lengthening of branches is due to more realistic stationary states of the models, and is mostly independent of the branch-length extension from modeling site-to-site variation in amino-acid substitution rate. The branch-length extension from experimentally informed site-specific models is similar to that achieved by other approaches that allow the stationary state to vary across sites. However, the improvements from all of these site-specific but time homogeneous and site independent models are limited by the fact that a protein’s amino-acid preferences gradually shift as it evolves. Overall, our work underscores the importance of modeling site-specific amino-acid preferences when estimating deep divergence times—but also shows the inherent limitations of approaches that fail to account for how these preferences shift over time. Oxford University Press 2018-11-06 /pmc/articles/PMC6220371/ /pubmed/30425841 http://dx.doi.org/10.1093/ve/vey033 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hilton, Sarah K
Bloom, Jesse D
Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence
title Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence
title_full Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence
title_fullStr Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence
title_full_unstemmed Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence
title_short Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence
title_sort modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6220371/
https://www.ncbi.nlm.nih.gov/pubmed/30425841
http://dx.doi.org/10.1093/ve/vey033
work_keys_str_mv AT hiltonsarahk modelingsitespecificaminoacidpreferencesdeepensphylogeneticestimatesofviralsequencedivergence
AT bloomjessed modelingsitespecificaminoacidpreferencesdeepensphylogeneticestimatesofviralsequencedivergence