Cargando…
A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices
Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid subst...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8442976/ https://www.ncbi.nlm.nih.gov/pubmed/34218472 http://dx.doi.org/10.1002/pro.4155 |
_version_ | 1783753099832197120 |
---|---|
author | Norn, Christoffer André, Ingemar Theobald, Douglas L. |
author_facet | Norn, Christoffer André, Ingemar Theobald, Douglas L. |
author_sort | Norn, Christoffer |
collection | PubMed |
description | Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid substitution matrices such as BLOSUM, JTT, WAG, and LG, which can be used to successfully detect homologs, infer phylogenies, and reconstruct ancestral sequences. Although the factors that govern the variation of amino acid substitution rates have received much attention, the influence of thermodynamic stability constraints remains unresolved. Here we develop a simple model to calculate amino acid substitution matrices from evolutionary dynamics controlled by a fitness function that reports on the thermodynamic effects of amino acid mutations in protein structures. This hybrid biophysical and evolutionary model accounts for nucleotide transition/transversion rate bias, multi‐nucleotide codon changes, the number of codons per amino acid, and thermodynamic protein stability. We find that our theoretical model accurately recapitulates the complex yet universal pattern observed in common global amino acid substitution matrices used in phylogenetics. These results suggest that selection for thermodynamically stable proteins, coupled with nucleotide mutation bias filtered by the structure of the genetic code, is the primary driver behind the global amino acid substitution patterns observed in proteins throughout the tree of life. |
format | Online Article Text |
id | pubmed-8442976 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84429762021-09-21 A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices Norn, Christoffer André, Ingemar Theobald, Douglas L. Protein Sci Full‐Length Papers Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid substitution matrices such as BLOSUM, JTT, WAG, and LG, which can be used to successfully detect homologs, infer phylogenies, and reconstruct ancestral sequences. Although the factors that govern the variation of amino acid substitution rates have received much attention, the influence of thermodynamic stability constraints remains unresolved. Here we develop a simple model to calculate amino acid substitution matrices from evolutionary dynamics controlled by a fitness function that reports on the thermodynamic effects of amino acid mutations in protein structures. This hybrid biophysical and evolutionary model accounts for nucleotide transition/transversion rate bias, multi‐nucleotide codon changes, the number of codons per amino acid, and thermodynamic protein stability. We find that our theoretical model accurately recapitulates the complex yet universal pattern observed in common global amino acid substitution matrices used in phylogenetics. These results suggest that selection for thermodynamically stable proteins, coupled with nucleotide mutation bias filtered by the structure of the genetic code, is the primary driver behind the global amino acid substitution patterns observed in proteins throughout the tree of life. John Wiley & Sons, Inc. 2021-07-30 2021-10 /pmc/articles/PMC8442976/ /pubmed/34218472 http://dx.doi.org/10.1002/pro.4155 Text en © 2021 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Full‐Length Papers Norn, Christoffer André, Ingemar Theobald, Douglas L. A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices |
title | A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices |
title_full | A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices |
title_fullStr | A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices |
title_full_unstemmed | A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices |
title_short | A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices |
title_sort | thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices |
topic | Full‐Length Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8442976/ https://www.ncbi.nlm.nih.gov/pubmed/34218472 http://dx.doi.org/10.1002/pro.4155 |
work_keys_str_mv | AT nornchristoffer athermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices AT andreingemar athermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices AT theobalddouglasl athermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices AT nornchristoffer thermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices AT andreingemar thermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices AT theobalddouglasl thermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices |