Cargando…

Evolution is not Uniform Along Coding Sequences

Amino acids evolve at different speeds within protein sequences, because their functional and structural roles are different. Notably, amino acids located at the surface of proteins are known to evolve more rapidly than those in the core. In particular, amino acids at the N- and C-termini of protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Bricout, Raphaël, Weil, Dominique, Stroebel, David, Genovesio, Auguste, Roest Crollius, Hugues
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025431/
https://www.ncbi.nlm.nih.gov/pubmed/36857092
http://dx.doi.org/10.1093/molbev/msad042
_version_ 1784909329895260160
author Bricout, Raphaël
Weil, Dominique
Stroebel, David
Genovesio, Auguste
Roest Crollius, Hugues
author_facet Bricout, Raphaël
Weil, Dominique
Stroebel, David
Genovesio, Auguste
Roest Crollius, Hugues
author_sort Bricout, Raphaël
collection PubMed
description Amino acids evolve at different speeds within protein sequences, because their functional and structural roles are different. Notably, amino acids located at the surface of proteins are known to evolve more rapidly than those in the core. In particular, amino acids at the N- and C-termini of protein sequences are likely to be more exposed than those at the core of the folded protein due to their location in the peptidic chain, and they are known to be less structured. Because of these reasons, we would expect that amino acids located at protein termini would evolve faster than residues located inside the chain. Here we test this hypothesis and found that amino acids evolve almost twice as fast at protein termini compared with those in the center, hinting at a strong topological bias along the sequence length. We further show that the distribution of solvent-accessible residues and functional domains in proteins readily explain how structural and functional constraints are weaker at their termini, leading to the observed excess of amino acid substitutions. Finally, we show that the specific evolutionary rates at protein termini may have direct consequences, notably misleading in silico methods used to infer sites under positive selection within genes. These results suggest that accounting for positional information should improve evolutionary models.
format Online
Article
Text
id pubmed-10025431
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100254312023-03-21 Evolution is not Uniform Along Coding Sequences Bricout, Raphaël Weil, Dominique Stroebel, David Genovesio, Auguste Roest Crollius, Hugues Mol Biol Evol Discoveries Amino acids evolve at different speeds within protein sequences, because their functional and structural roles are different. Notably, amino acids located at the surface of proteins are known to evolve more rapidly than those in the core. In particular, amino acids at the N- and C-termini of protein sequences are likely to be more exposed than those at the core of the folded protein due to their location in the peptidic chain, and they are known to be less structured. Because of these reasons, we would expect that amino acids located at protein termini would evolve faster than residues located inside the chain. Here we test this hypothesis and found that amino acids evolve almost twice as fast at protein termini compared with those in the center, hinting at a strong topological bias along the sequence length. We further show that the distribution of solvent-accessible residues and functional domains in proteins readily explain how structural and functional constraints are weaker at their termini, leading to the observed excess of amino acid substitutions. Finally, we show that the specific evolutionary rates at protein termini may have direct consequences, notably misleading in silico methods used to infer sites under positive selection within genes. These results suggest that accounting for positional information should improve evolutionary models. Oxford University Press 2023-02-28 /pmc/articles/PMC10025431/ /pubmed/36857092 http://dx.doi.org/10.1093/molbev/msad042 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Discoveries
Bricout, Raphaël
Weil, Dominique
Stroebel, David
Genovesio, Auguste
Roest Crollius, Hugues
Evolution is not Uniform Along Coding Sequences
title Evolution is not Uniform Along Coding Sequences
title_full Evolution is not Uniform Along Coding Sequences
title_fullStr Evolution is not Uniform Along Coding Sequences
title_full_unstemmed Evolution is not Uniform Along Coding Sequences
title_short Evolution is not Uniform Along Coding Sequences
title_sort evolution is not uniform along coding sequences
topic Discoveries
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025431/
https://www.ncbi.nlm.nih.gov/pubmed/36857092
http://dx.doi.org/10.1093/molbev/msad042
work_keys_str_mv AT bricoutraphael evolutionisnotuniformalongcodingsequences
AT weildominique evolutionisnotuniformalongcodingsequences
AT stroebeldavid evolutionisnotuniformalongcodingsequences
AT genovesioauguste evolutionisnotuniformalongcodingsequences
AT roestcrolliushugues evolutionisnotuniformalongcodingsequences