Cargando…

The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence

The propensity of protein sites to be occupied by any of the 20 amino acids is known as site-specific amino acid preferences (SSAP). Under the assumption that SSAP are conserved among homologs, they can be used to parameterize evolutionary models for the reconstruction of accurate phylogenetic trees...

Descripción completa

Detalles Bibliográficos
Autor principal: Ferrada, Evandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6326188/
https://www.ncbi.nlm.nih.gov/pubmed/30496400
http://dx.doi.org/10.1093/gbe/evy261
_version_ 1783386257287544832
author Ferrada, Evandro
author_facet Ferrada, Evandro
author_sort Ferrada, Evandro
collection PubMed
description The propensity of protein sites to be occupied by any of the 20 amino acids is known as site-specific amino acid preferences (SSAP). Under the assumption that SSAP are conserved among homologs, they can be used to parameterize evolutionary models for the reconstruction of accurate phylogenetic trees. However, simulations and experimental studies have not been able to fully assess the relative conservation of SSAP as a function of sequence divergence between protein homologs. Here, we implement a computational procedure to predict the SSAP of proteins based on the effect of changes in thermodynamic stability upon mutation. An advantage of this computational approach is that it allows us to interrogate a large and unbiased sample of homologous proteins, over the entire spectrum of sequence divergence, and under selection for the same molecular trait. We show that computational predictions have reproducibilities that resemble those obtained in experimental replicates, and can largely recapitulate the SSAP observed in a large-scale mutagenesis experiment. Our results support recent experimental reports on the conservation of SSAP of related homologs, with a slowly increasing fraction of up to 15% of different sites at sequence distances lower than 40%. However, even under the sole contribution of thermodynamic stability, our conservative approach identifies up to 30% of significant different sites between divergent homologs. We show that this relation holds for homologs of diverse sizes and structural classes. Analyses of residue contact networks suggest that an important determinant of these differences is the increasing accumulation of structural deviations that results from sequence divergence.
format Online
Article
Text
id pubmed-6326188
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63261882019-01-15 The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence Ferrada, Evandro Genome Biol Evol Research Article The propensity of protein sites to be occupied by any of the 20 amino acids is known as site-specific amino acid preferences (SSAP). Under the assumption that SSAP are conserved among homologs, they can be used to parameterize evolutionary models for the reconstruction of accurate phylogenetic trees. However, simulations and experimental studies have not been able to fully assess the relative conservation of SSAP as a function of sequence divergence between protein homologs. Here, we implement a computational procedure to predict the SSAP of proteins based on the effect of changes in thermodynamic stability upon mutation. An advantage of this computational approach is that it allows us to interrogate a large and unbiased sample of homologous proteins, over the entire spectrum of sequence divergence, and under selection for the same molecular trait. We show that computational predictions have reproducibilities that resemble those obtained in experimental replicates, and can largely recapitulate the SSAP observed in a large-scale mutagenesis experiment. Our results support recent experimental reports on the conservation of SSAP of related homologs, with a slowly increasing fraction of up to 15% of different sites at sequence distances lower than 40%. However, even under the sole contribution of thermodynamic stability, our conservative approach identifies up to 30% of significant different sites between divergent homologs. We show that this relation holds for homologs of diverse sizes and structural classes. Analyses of residue contact networks suggest that an important determinant of these differences is the increasing accumulation of structural deviations that results from sequence divergence. Oxford University Press 2018-11-29 /pmc/articles/PMC6326188/ /pubmed/30496400 http://dx.doi.org/10.1093/gbe/evy261 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ferrada, Evandro
The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence
title The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence
title_full The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence
title_fullStr The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence
title_full_unstemmed The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence
title_short The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence
title_sort site-specific amino acid preferences of homologous proteins depend on sequence divergence
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6326188/
https://www.ncbi.nlm.nih.gov/pubmed/30496400
http://dx.doi.org/10.1093/gbe/evy261
work_keys_str_mv AT ferradaevandro thesitespecificaminoacidpreferencesofhomologousproteinsdependonsequencedivergence
AT ferradaevandro sitespecificaminoacidpreferencesofhomologousproteinsdependonsequencedivergence