Cargando…
Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model
The relative evolutionary rates at individual sites in proteins are informative measures of conservation or adaptation. Often used as evolutionarily aware conservation scores, relative rates reveal key functional or strongly selected residues. Estimating rates in a phylogenetic context requires spec...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6107055/ https://www.ncbi.nlm.nih.gov/pubmed/29924340 http://dx.doi.org/10.1093/molbev/msy127 |
_version_ | 1783349904018505728 |
---|---|
author | Spielman, Stephanie J Kosakovsky Pond, Sergei L |
author_facet | Spielman, Stephanie J Kosakovsky Pond, Sergei L |
author_sort | Spielman, Stephanie J |
collection | PubMed |
description | The relative evolutionary rates at individual sites in proteins are informative measures of conservation or adaptation. Often used as evolutionarily aware conservation scores, relative rates reveal key functional or strongly selected residues. Estimating rates in a phylogenetic context requires specifying a protein substitution model, which is typically a phenomenological model trained on a large empirical data set. A strong emphasis has traditionally been placed on selecting the “best-fit” model, with the implicit understanding that suboptimal or otherwise ill-fitting models might bias inferences. However, the pervasiveness and degree of such bias has not been systematically examined. We investigated how model choice impacts site-wise relative rates in a large set of empirical protein alignments. We compared models designed for use on any general protein, models designed for specific domains of life, and the simple equal-rates Jukes Cantor-style model (JC). As expected, information theoretic measures showed overwhelming evidence that some models fit the data decidedly better than others. By contrast, estimates of site-specific evolutionary rates were impressively insensitive to the substitution model used, revealing an unexpected degree of robustness to potential model misspecification. A deeper examination of the fewer than 5% of sites for which model inferences differed in a meaningful way showed that the JC model could uniquely identify rapidly evolving sites that models with empirically derived exchangeabilities failed to detect. We conclude that relative protein rates appear robust to the applied substitution model, and any sensible model of protein evolution, regardless of its fit to the data, should produce broadly consistent evolutionary rates. |
format | Online Article Text |
id | pubmed-6107055 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-61070552018-08-27 Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model Spielman, Stephanie J Kosakovsky Pond, Sergei L Mol Biol Evol Methods The relative evolutionary rates at individual sites in proteins are informative measures of conservation or adaptation. Often used as evolutionarily aware conservation scores, relative rates reveal key functional or strongly selected residues. Estimating rates in a phylogenetic context requires specifying a protein substitution model, which is typically a phenomenological model trained on a large empirical data set. A strong emphasis has traditionally been placed on selecting the “best-fit” model, with the implicit understanding that suboptimal or otherwise ill-fitting models might bias inferences. However, the pervasiveness and degree of such bias has not been systematically examined. We investigated how model choice impacts site-wise relative rates in a large set of empirical protein alignments. We compared models designed for use on any general protein, models designed for specific domains of life, and the simple equal-rates Jukes Cantor-style model (JC). As expected, information theoretic measures showed overwhelming evidence that some models fit the data decidedly better than others. By contrast, estimates of site-specific evolutionary rates were impressively insensitive to the substitution model used, revealing an unexpected degree of robustness to potential model misspecification. A deeper examination of the fewer than 5% of sites for which model inferences differed in a meaningful way showed that the JC model could uniquely identify rapidly evolving sites that models with empirically derived exchangeabilities failed to detect. We conclude that relative protein rates appear robust to the applied substitution model, and any sensible model of protein evolution, regardless of its fit to the data, should produce broadly consistent evolutionary rates. Oxford University Press 2018-09 2018-06-19 /pmc/articles/PMC6107055/ /pubmed/29924340 http://dx.doi.org/10.1093/molbev/msy127 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Spielman, Stephanie J Kosakovsky Pond, Sergei L Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model |
title | Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model |
title_full | Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model |
title_fullStr | Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model |
title_full_unstemmed | Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model |
title_short | Relative Evolutionary Rates in Proteins Are Largely Insensitive to the Substitution Model |
title_sort | relative evolutionary rates in proteins are largely insensitive to the substitution model |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6107055/ https://www.ncbi.nlm.nih.gov/pubmed/29924340 http://dx.doi.org/10.1093/molbev/msy127 |
work_keys_str_mv | AT spielmanstephaniej relativeevolutionaryratesinproteinsarelargelyinsensitivetothesubstitutionmodel AT kosakovskypondsergeil relativeevolutionaryratesinproteinsarelargelyinsensitivetothesubstitutionmodel |