Cargando…

A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices

Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid subst...

Descripción completa

Detalles Bibliográficos
Autores principales: Norn, Christoffer, André, Ingemar, Theobald, Douglas L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8442976/
https://www.ncbi.nlm.nih.gov/pubmed/34218472
http://dx.doi.org/10.1002/pro.4155
_version_ 1783753099832197120
author Norn, Christoffer
André, Ingemar
Theobald, Douglas L.
author_facet Norn, Christoffer
André, Ingemar
Theobald, Douglas L.
author_sort Norn, Christoffer
collection PubMed
description Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid substitution matrices such as BLOSUM, JTT, WAG, and LG, which can be used to successfully detect homologs, infer phylogenies, and reconstruct ancestral sequences. Although the factors that govern the variation of amino acid substitution rates have received much attention, the influence of thermodynamic stability constraints remains unresolved. Here we develop a simple model to calculate amino acid substitution matrices from evolutionary dynamics controlled by a fitness function that reports on the thermodynamic effects of amino acid mutations in protein structures. This hybrid biophysical and evolutionary model accounts for nucleotide transition/transversion rate bias, multi‐nucleotide codon changes, the number of codons per amino acid, and thermodynamic protein stability. We find that our theoretical model accurately recapitulates the complex yet universal pattern observed in common global amino acid substitution matrices used in phylogenetics. These results suggest that selection for thermodynamically stable proteins, coupled with nucleotide mutation bias filtered by the structure of the genetic code, is the primary driver behind the global amino acid substitution patterns observed in proteins throughout the tree of life.
format Online
Article
Text
id pubmed-8442976
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-84429762021-09-21 A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices Norn, Christoffer André, Ingemar Theobald, Douglas L. Protein Sci Full‐Length Papers Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid substitution matrices such as BLOSUM, JTT, WAG, and LG, which can be used to successfully detect homologs, infer phylogenies, and reconstruct ancestral sequences. Although the factors that govern the variation of amino acid substitution rates have received much attention, the influence of thermodynamic stability constraints remains unresolved. Here we develop a simple model to calculate amino acid substitution matrices from evolutionary dynamics controlled by a fitness function that reports on the thermodynamic effects of amino acid mutations in protein structures. This hybrid biophysical and evolutionary model accounts for nucleotide transition/transversion rate bias, multi‐nucleotide codon changes, the number of codons per amino acid, and thermodynamic protein stability. We find that our theoretical model accurately recapitulates the complex yet universal pattern observed in common global amino acid substitution matrices used in phylogenetics. These results suggest that selection for thermodynamically stable proteins, coupled with nucleotide mutation bias filtered by the structure of the genetic code, is the primary driver behind the global amino acid substitution patterns observed in proteins throughout the tree of life. John Wiley & Sons, Inc. 2021-07-30 2021-10 /pmc/articles/PMC8442976/ /pubmed/34218472 http://dx.doi.org/10.1002/pro.4155 Text en © 2021 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Full‐Length Papers
Norn, Christoffer
André, Ingemar
Theobald, Douglas L.
A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices
title A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices
title_full A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices
title_fullStr A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices
title_full_unstemmed A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices
title_short A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices
title_sort thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices
topic Full‐Length Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8442976/
https://www.ncbi.nlm.nih.gov/pubmed/34218472
http://dx.doi.org/10.1002/pro.4155
work_keys_str_mv AT nornchristoffer athermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices
AT andreingemar athermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices
AT theobalddouglasl athermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices
AT nornchristoffer thermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices
AT andreingemar thermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices
AT theobalddouglasl thermodynamicmodelofproteinstructureevolutionexplainsempiricalaminoacidsubstitutionmatrices