Cargando…
Evolution is not Uniform Along Coding Sequences
Amino acids evolve at different speeds within protein sequences, because their functional and structural roles are different. Notably, amino acids located at the surface of proteins are known to evolve more rapidly than those in the core. In particular, amino acids at the N- and C-termini of protein...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025431/ https://www.ncbi.nlm.nih.gov/pubmed/36857092 http://dx.doi.org/10.1093/molbev/msad042 |
_version_ | 1784909329895260160 |
---|---|
author | Bricout, Raphaël Weil, Dominique Stroebel, David Genovesio, Auguste Roest Crollius, Hugues |
author_facet | Bricout, Raphaël Weil, Dominique Stroebel, David Genovesio, Auguste Roest Crollius, Hugues |
author_sort | Bricout, Raphaël |
collection | PubMed |
description | Amino acids evolve at different speeds within protein sequences, because their functional and structural roles are different. Notably, amino acids located at the surface of proteins are known to evolve more rapidly than those in the core. In particular, amino acids at the N- and C-termini of protein sequences are likely to be more exposed than those at the core of the folded protein due to their location in the peptidic chain, and they are known to be less structured. Because of these reasons, we would expect that amino acids located at protein termini would evolve faster than residues located inside the chain. Here we test this hypothesis and found that amino acids evolve almost twice as fast at protein termini compared with those in the center, hinting at a strong topological bias along the sequence length. We further show that the distribution of solvent-accessible residues and functional domains in proteins readily explain how structural and functional constraints are weaker at their termini, leading to the observed excess of amino acid substitutions. Finally, we show that the specific evolutionary rates at protein termini may have direct consequences, notably misleading in silico methods used to infer sites under positive selection within genes. These results suggest that accounting for positional information should improve evolutionary models. |
format | Online Article Text |
id | pubmed-10025431 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-100254312023-03-21 Evolution is not Uniform Along Coding Sequences Bricout, Raphaël Weil, Dominique Stroebel, David Genovesio, Auguste Roest Crollius, Hugues Mol Biol Evol Discoveries Amino acids evolve at different speeds within protein sequences, because their functional and structural roles are different. Notably, amino acids located at the surface of proteins are known to evolve more rapidly than those in the core. In particular, amino acids at the N- and C-termini of protein sequences are likely to be more exposed than those at the core of the folded protein due to their location in the peptidic chain, and they are known to be less structured. Because of these reasons, we would expect that amino acids located at protein termini would evolve faster than residues located inside the chain. Here we test this hypothesis and found that amino acids evolve almost twice as fast at protein termini compared with those in the center, hinting at a strong topological bias along the sequence length. We further show that the distribution of solvent-accessible residues and functional domains in proteins readily explain how structural and functional constraints are weaker at their termini, leading to the observed excess of amino acid substitutions. Finally, we show that the specific evolutionary rates at protein termini may have direct consequences, notably misleading in silico methods used to infer sites under positive selection within genes. These results suggest that accounting for positional information should improve evolutionary models. Oxford University Press 2023-02-28 /pmc/articles/PMC10025431/ /pubmed/36857092 http://dx.doi.org/10.1093/molbev/msad042 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Discoveries Bricout, Raphaël Weil, Dominique Stroebel, David Genovesio, Auguste Roest Crollius, Hugues Evolution is not Uniform Along Coding Sequences |
title | Evolution is not Uniform Along Coding Sequences |
title_full | Evolution is not Uniform Along Coding Sequences |
title_fullStr | Evolution is not Uniform Along Coding Sequences |
title_full_unstemmed | Evolution is not Uniform Along Coding Sequences |
title_short | Evolution is not Uniform Along Coding Sequences |
title_sort | evolution is not uniform along coding sequences |
topic | Discoveries |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025431/ https://www.ncbi.nlm.nih.gov/pubmed/36857092 http://dx.doi.org/10.1093/molbev/msad042 |
work_keys_str_mv | AT bricoutraphael evolutionisnotuniformalongcodingsequences AT weildominique evolutionisnotuniformalongcodingsequences AT stroebeldavid evolutionisnotuniformalongcodingsequences AT genovesioauguste evolutionisnotuniformalongcodingsequences AT roestcrolliushugues evolutionisnotuniformalongcodingsequences |