Cargando…
Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes
Characterizing the effect of mutations is key to understand the evolution of protein sequences and to separate neutral amino-acid changes from deleterious ones. Epistatic interactions between residues can lead to a context dependence of mutation effects. Context dependence constrains the amino-acid...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9276797/ https://www.ncbi.nlm.nih.gov/pubmed/35821377 http://dx.doi.org/10.1038/s41467-022-31643-3 |
_version_ | 1784745807045459968 |
---|---|
author | Vigué, Lucile Croce, Giancarlo Petitjean, Marie Ruppé, Etienne Tenaillon, Olivier Weigt, Martin |
author_facet | Vigué, Lucile Croce, Giancarlo Petitjean, Marie Ruppé, Etienne Tenaillon, Olivier Weigt, Martin |
author_sort | Vigué, Lucile |
collection | PubMed |
description | Characterizing the effect of mutations is key to understand the evolution of protein sequences and to separate neutral amino-acid changes from deleterious ones. Epistatic interactions between residues can lead to a context dependence of mutation effects. Context dependence constrains the amino-acid changes that can contribute to polymorphism in the short term, and the ones that can accumulate between species in the long term. We use computational approaches to accurately predict the polymorphisms segregating in a panel of 61,157 Escherichia coli genomes from the analysis of distant homologues. By comparing a context-aware Direct-Coupling Analysis modelling to a non-epistatic approach, we show that the genetic context strongly constrains the tolerable amino acids in 30% to 50% of amino-acid sites. The study of more distant species suggests the gradual build-up of genetic context over long evolutionary timescales by the accumulation of small epistatic contributions. |
format | Online Article Text |
id | pubmed-9276797 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-92767972022-07-14 Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes Vigué, Lucile Croce, Giancarlo Petitjean, Marie Ruppé, Etienne Tenaillon, Olivier Weigt, Martin Nat Commun Article Characterizing the effect of mutations is key to understand the evolution of protein sequences and to separate neutral amino-acid changes from deleterious ones. Epistatic interactions between residues can lead to a context dependence of mutation effects. Context dependence constrains the amino-acid changes that can contribute to polymorphism in the short term, and the ones that can accumulate between species in the long term. We use computational approaches to accurately predict the polymorphisms segregating in a panel of 61,157 Escherichia coli genomes from the analysis of distant homologues. By comparing a context-aware Direct-Coupling Analysis modelling to a non-epistatic approach, we show that the genetic context strongly constrains the tolerable amino acids in 30% to 50% of amino-acid sites. The study of more distant species suggests the gradual build-up of genetic context over long evolutionary timescales by the accumulation of small epistatic contributions. Nature Publishing Group UK 2022-07-12 /pmc/articles/PMC9276797/ /pubmed/35821377 http://dx.doi.org/10.1038/s41467-022-31643-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Vigué, Lucile Croce, Giancarlo Petitjean, Marie Ruppé, Etienne Tenaillon, Olivier Weigt, Martin Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes |
title | Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes |
title_full | Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes |
title_fullStr | Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes |
title_full_unstemmed | Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes |
title_short | Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes |
title_sort | deciphering polymorphism in 61,157 escherichia coli genomes via epistatic sequence landscapes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9276797/ https://www.ncbi.nlm.nih.gov/pubmed/35821377 http://dx.doi.org/10.1038/s41467-022-31643-3 |
work_keys_str_mv | AT viguelucile decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes AT crocegiancarlo decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes AT petitjeanmarie decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes AT ruppeetienne decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes AT tenaillonolivier decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes AT weigtmartin decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes |