Cargando…
Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution
During their evolution, proteins explore sequence space via an interplay between random mutations and phenotypic selection. Here, we build upon recent progress in reconstructing data-driven fitness landscapes for families of homologous proteins, to propose stochastic models of experimental protein e...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8789065/ https://www.ncbi.nlm.nih.gov/pubmed/34751386 http://dx.doi.org/10.1093/molbev/msab321 |
_version_ | 1784639683888676864 |
---|---|
author | Bisardi, Matteo Rodriguez-Rivas, Juan Zamponi, Francesco Weigt, Martin |
author_facet | Bisardi, Matteo Rodriguez-Rivas, Juan Zamponi, Francesco Weigt, Martin |
author_sort | Bisardi, Matteo |
collection | PubMed |
description | During their evolution, proteins explore sequence space via an interplay between random mutations and phenotypic selection. Here, we build upon recent progress in reconstructing data-driven fitness landscapes for families of homologous proteins, to propose stochastic models of experimental protein evolution. These models predict quantitatively important features of experimentally evolved sequence libraries, like fitness distributions and position-specific mutational spectra. They also allow us to efficiently simulate sequence libraries for a vast array of combinations of experimental parameters like sequence divergence, selection strength, and library size. We showcase the potential of the approach in reanalyzing two recent experiments to determine protein structure from signals of epistasis emerging in experimental sequence libraries. To be detectable, these signals require sufficiently large and sufficiently diverged libraries. Our modeling framework offers a quantitative explanation for different outcomes of recently published experiments. Furthermore, we can forecast the outcome of time- and resource-intensive evolution experiments, opening thereby a way to computationally optimize experimental protocols. |
format | Online Article Text |
id | pubmed-8789065 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-87890652022-01-26 Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution Bisardi, Matteo Rodriguez-Rivas, Juan Zamponi, Francesco Weigt, Martin Mol Biol Evol Methods During their evolution, proteins explore sequence space via an interplay between random mutations and phenotypic selection. Here, we build upon recent progress in reconstructing data-driven fitness landscapes for families of homologous proteins, to propose stochastic models of experimental protein evolution. These models predict quantitatively important features of experimentally evolved sequence libraries, like fitness distributions and position-specific mutational spectra. They also allow us to efficiently simulate sequence libraries for a vast array of combinations of experimental parameters like sequence divergence, selection strength, and library size. We showcase the potential of the approach in reanalyzing two recent experiments to determine protein structure from signals of epistasis emerging in experimental sequence libraries. To be detectable, these signals require sufficiently large and sufficiently diverged libraries. Our modeling framework offers a quantitative explanation for different outcomes of recently published experiments. Furthermore, we can forecast the outcome of time- and resource-intensive evolution experiments, opening thereby a way to computationally optimize experimental protocols. Oxford University Press 2021-11-09 /pmc/articles/PMC8789065/ /pubmed/34751386 http://dx.doi.org/10.1093/molbev/msab321 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Bisardi, Matteo Rodriguez-Rivas, Juan Zamponi, Francesco Weigt, Martin Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution |
title | Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution |
title_full | Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution |
title_fullStr | Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution |
title_full_unstemmed | Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution |
title_short | Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution |
title_sort | modeling sequence-space exploration and emergence of epistatic signals in protein evolution |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8789065/ https://www.ncbi.nlm.nih.gov/pubmed/34751386 http://dx.doi.org/10.1093/molbev/msab321 |
work_keys_str_mv | AT bisardimatteo modelingsequencespaceexplorationandemergenceofepistaticsignalsinproteinevolution AT rodriguezrivasjuan modelingsequencespaceexplorationandemergenceofepistaticsignalsinproteinevolution AT zamponifrancesco modelingsequencespaceexplorationandemergenceofepistaticsignalsinproteinevolution AT weigtmartin modelingsequencespaceexplorationandemergenceofepistaticsignalsinproteinevolution |