Cargando…

Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model

The relative evolutionary rates at individual sites in proteins are informative measures of conservation or adaptation. Often used as evolutionarily aware conservation scores, relative rates reveal key functional or strongly selected residues. Estimating rates in a phylogenetic context requires spec...

Descripción completa

Detalles Bibliográficos
Autores principales: Spielman, Stephanie J, Kosakovsky Pond, Sergei L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6107055/
https://www.ncbi.nlm.nih.gov/pubmed/29924340
http://dx.doi.org/10.1093/molbev/msy127
_version_ 1783349904018505728
author Spielman, Stephanie J
Kosakovsky Pond, Sergei L
author_facet Spielman, Stephanie J
Kosakovsky Pond, Sergei L
author_sort Spielman, Stephanie J
collection PubMed
description The relative evolutionary rates at individual sites in proteins are informative measures of conservation or adaptation. Often used as evolutionarily aware conservation scores, relative rates reveal key functional or strongly selected residues. Estimating rates in a phylogenetic context requires specifying a protein substitution model, which is typically a phenomenological model trained on a large empirical data set. A strong emphasis has traditionally been placed on selecting the “best-fit” model, with the implicit understanding that suboptimal or otherwise ill-fitting models might bias inferences. However, the pervasiveness and degree of such bias has not been systematically examined. We investigated how model choice impacts site-wise relative rates in a large set of empirical protein alignments. We compared models designed for use on any general protein, models designed for specific domains of life, and the simple equal-rates Jukes Cantor-style model (JC). As expected, information theoretic measures showed overwhelming evidence that some models fit the data decidedly better than others. By contrast, estimates of site-specific evolutionary rates were impressively insensitive to the substitution model used, revealing an unexpected degree of robustness to potential model misspecification. A deeper examination of the fewer than 5% of sites for which model inferences differed in a meaningful way showed that the JC model could uniquely identify rapidly evolving sites that models with empirically derived exchangeabilities failed to detect. We conclude that relative protein rates appear robust to the applied substitution model, and any sensible model of protein evolution, regardless of its fit to the data, should produce broadly consistent evolutionary rates.
format Online
Article
Text
id pubmed-6107055
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61070552018-08-27 Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model Spielman, Stephanie J Kosakovsky Pond, Sergei L Mol Biol Evol Methods The relative evolutionary rates at individual sites in proteins are informative measures of conservation or adaptation. Often used as evolutionarily aware conservation scores, relative rates reveal key functional or strongly selected residues. Estimating rates in a phylogenetic context requires specifying a protein substitution model, which is typically a phenomenological model trained on a large empirical data set. A strong emphasis has traditionally been placed on selecting the “best-fit” model, with the implicit understanding that suboptimal or otherwise ill-fitting models might bias inferences. However, the pervasiveness and degree of such bias has not been systematically examined. We investigated how model choice impacts site-wise relative rates in a large set of empirical protein alignments. We compared models designed for use on any general protein, models designed for specific domains of life, and the simple equal-rates Jukes Cantor-style model (JC). As expected, information theoretic measures showed overwhelming evidence that some models fit the data decidedly better than others. By contrast, estimates of site-specific evolutionary rates were impressively insensitive to the substitution model used, revealing an unexpected degree of robustness to potential model misspecification. A deeper examination of the fewer than 5% of sites for which model inferences differed in a meaningful way showed that the JC model could uniquely identify rapidly evolving sites that models with empirically derived exchangeabilities failed to detect. We conclude that relative protein rates appear robust to the applied substitution model, and any sensible model of protein evolution, regardless of its fit to the data, should produce broadly consistent evolutionary rates. Oxford University Press 2018-09 2018-06-19 /pmc/articles/PMC6107055/ /pubmed/29924340 http://dx.doi.org/10.1093/molbev/msy127 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods
Spielman, Stephanie J
Kosakovsky Pond, Sergei L
Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model
title Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model
title_full Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model
title_fullStr Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model
title_full_unstemmed Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model
title_short Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model
title_sort relative evolutionary rates in proteins are largely insensitive to the substitution model
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6107055/
https://www.ncbi.nlm.nih.gov/pubmed/29924340
http://dx.doi.org/10.1093/molbev/msy127
work_keys_str_mv AT spielmanstephaniej relativeevolutionaryratesinproteinsarelargelyinsensitivetothesubstitutionmodel
AT kosakovskypondsergeil relativeevolutionaryratesinproteinsarelargelyinsensitivetothesubstitutionmodel