Cargando…

Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes

Characterizing the effect of mutations is key to understand the evolution of protein sequences and to separate neutral amino-acid changes from deleterious ones. Epistatic interactions between residues can lead to a context dependence of mutation effects. Context dependence constrains the amino-acid...

Descripción completa

Detalles Bibliográficos
Autores principales: Vigué, Lucile, Croce, Giancarlo, Petitjean, Marie, Ruppé, Etienne, Tenaillon, Olivier, Weigt, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9276797/
https://www.ncbi.nlm.nih.gov/pubmed/35821377
http://dx.doi.org/10.1038/s41467-022-31643-3
_version_ 1784745807045459968
author Vigué, Lucile
Croce, Giancarlo
Petitjean, Marie
Ruppé, Etienne
Tenaillon, Olivier
Weigt, Martin
author_facet Vigué, Lucile
Croce, Giancarlo
Petitjean, Marie
Ruppé, Etienne
Tenaillon, Olivier
Weigt, Martin
author_sort Vigué, Lucile
collection PubMed
description Characterizing the effect of mutations is key to understand the evolution of protein sequences and to separate neutral amino-acid changes from deleterious ones. Epistatic interactions between residues can lead to a context dependence of mutation effects. Context dependence constrains the amino-acid changes that can contribute to polymorphism in the short term, and the ones that can accumulate between species in the long term. We use computational approaches to accurately predict the polymorphisms segregating in a panel of 61,157 Escherichia coli genomes from the analysis of distant homologues. By comparing a context-aware Direct-Coupling Analysis modelling to a non-epistatic approach, we show that the genetic context strongly constrains the tolerable amino acids in 30% to 50% of amino-acid sites. The study of more distant species suggests the gradual build-up of genetic context over long evolutionary timescales by the accumulation of small epistatic contributions.
format Online
Article
Text
id pubmed-9276797
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92767972022-07-14 Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes Vigué, Lucile Croce, Giancarlo Petitjean, Marie Ruppé, Etienne Tenaillon, Olivier Weigt, Martin Nat Commun Article Characterizing the effect of mutations is key to understand the evolution of protein sequences and to separate neutral amino-acid changes from deleterious ones. Epistatic interactions between residues can lead to a context dependence of mutation effects. Context dependence constrains the amino-acid changes that can contribute to polymorphism in the short term, and the ones that can accumulate between species in the long term. We use computational approaches to accurately predict the polymorphisms segregating in a panel of 61,157 Escherichia coli genomes from the analysis of distant homologues. By comparing a context-aware Direct-Coupling Analysis modelling to a non-epistatic approach, we show that the genetic context strongly constrains the tolerable amino acids in 30% to 50% of amino-acid sites. The study of more distant species suggests the gradual build-up of genetic context over long evolutionary timescales by the accumulation of small epistatic contributions. Nature Publishing Group UK 2022-07-12 /pmc/articles/PMC9276797/ /pubmed/35821377 http://dx.doi.org/10.1038/s41467-022-31643-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Vigué, Lucile
Croce, Giancarlo
Petitjean, Marie
Ruppé, Etienne
Tenaillon, Olivier
Weigt, Martin
Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes
title Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes
title_full Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes
title_fullStr Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes
title_full_unstemmed Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes
title_short Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes
title_sort deciphering polymorphism in 61,157 escherichia coli genomes via epistatic sequence landscapes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9276797/
https://www.ncbi.nlm.nih.gov/pubmed/35821377
http://dx.doi.org/10.1038/s41467-022-31643-3
work_keys_str_mv AT viguelucile decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes
AT crocegiancarlo decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes
AT petitjeanmarie decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes
AT ruppeetienne decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes
AT tenaillonolivier decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes
AT weigtmartin decipheringpolymorphismin61157escherichiacoligenomesviaepistaticsequencelandscapes