Cargando…

Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness

BACKGROUND: How, and the extent to which, evolution acts on DNA and protein sequences to ensure mutational robustness and evolvability is a long-standing open question in the field of molecular evolution. We addressed this issue through the first structurome-scale computational investigation, in whi...

Descripción completa

Detalles Bibliográficos
Autores principales: Schwersensky, Martin, Rooman, Marianne, Pucci, Fabrizio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7576759/
https://www.ncbi.nlm.nih.gov/pubmed/33081759
http://dx.doi.org/10.1186/s12915-020-00870-9
_version_ 1783598078181244928
author Schwersensky, Martin
Rooman, Marianne
Pucci, Fabrizio
author_facet Schwersensky, Martin
Rooman, Marianne
Pucci, Fabrizio
author_sort Schwersensky, Martin
collection PubMed
description BACKGROUND: How, and the extent to which, evolution acts on DNA and protein sequences to ensure mutational robustness and evolvability is a long-standing open question in the field of molecular evolution. We addressed this issue through the first structurome-scale computational investigation, in which we estimated the change in folding free energy upon all possible single-site mutations introduced in more than 20,000 protein structures, as well as through available experimental stability and fitness data. RESULTS: At the amino acid level, we found the protein surface to be more robust against random mutations than the core, this difference being stronger for small proteins. The destabilizing and neutral mutations are more numerous in the core and on the surface, respectively, whereas the stabilizing mutations are about 4% in both regions. At the genetic code level, we observed smallest destabilization for mutations that are due to substitutions of base III in the codon, followed by base I, bases I+III, base II, and other multiple base substitutions. This ranking highly anticorrelates with the codon-anticodon mispairing frequency in the translation process. This suggests that the standard genetic code is optimized to limit the impact of random mutations, but even more so to limit translation errors. At the codon level, both the codon usage and the usage bias appear to optimize mutational robustness and translation accuracy, especially for surface residues. CONCLUSION: Our results highlight the non-universality of mutational robustness and its multiscale dependence on protein features, the structure of the genetic code, and the codon usage. Our analyses and approach are strongly supported by available experimental mutagenesis data.
format Online
Article
Text
id pubmed-7576759
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75767592020-10-21 Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness Schwersensky, Martin Rooman, Marianne Pucci, Fabrizio BMC Biol Research Article BACKGROUND: How, and the extent to which, evolution acts on DNA and protein sequences to ensure mutational robustness and evolvability is a long-standing open question in the field of molecular evolution. We addressed this issue through the first structurome-scale computational investigation, in which we estimated the change in folding free energy upon all possible single-site mutations introduced in more than 20,000 protein structures, as well as through available experimental stability and fitness data. RESULTS: At the amino acid level, we found the protein surface to be more robust against random mutations than the core, this difference being stronger for small proteins. The destabilizing and neutral mutations are more numerous in the core and on the surface, respectively, whereas the stabilizing mutations are about 4% in both regions. At the genetic code level, we observed smallest destabilization for mutations that are due to substitutions of base III in the codon, followed by base I, bases I+III, base II, and other multiple base substitutions. This ranking highly anticorrelates with the codon-anticodon mispairing frequency in the translation process. This suggests that the standard genetic code is optimized to limit the impact of random mutations, but even more so to limit translation errors. At the codon level, both the codon usage and the usage bias appear to optimize mutational robustness and translation accuracy, especially for surface residues. CONCLUSION: Our results highlight the non-universality of mutational robustness and its multiscale dependence on protein features, the structure of the genetic code, and the codon usage. Our analyses and approach are strongly supported by available experimental mutagenesis data. BioMed Central 2020-10-20 /pmc/articles/PMC7576759/ /pubmed/33081759 http://dx.doi.org/10.1186/s12915-020-00870-9 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Schwersensky, Martin
Rooman, Marianne
Pucci, Fabrizio
Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness
title Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness
title_full Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness
title_fullStr Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness
title_full_unstemmed Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness
title_short Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness
title_sort large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7576759/
https://www.ncbi.nlm.nih.gov/pubmed/33081759
http://dx.doi.org/10.1186/s12915-020-00870-9
work_keys_str_mv AT schwersenskymartin largescaleinsilicomutagenesisexperimentsrevealoptimizationofgeneticcodeandcodonusageforproteinmutationalrobustness
AT roomanmarianne largescaleinsilicomutagenesisexperimentsrevealoptimizationofgeneticcodeandcodonusageforproteinmutationalrobustness
AT puccifabrizio largescaleinsilicomutagenesisexperimentsrevealoptimizationofgeneticcodeandcodonusageforproteinmutationalrobustness