Cargando…

The sequence context in poly-alanine regions: structure, function and conservation

MOTIVATION: Poly-alanine (polyA) regions are protein stretches mostly composed of alanines. Despite their abundance in eukaryotic proteomes and their association to nine inherited human diseases, the structural and functional roles exerted by polyA stretches remain poorly understood. In this work we...

Descripción completa

Detalles Bibliográficos
Autores principales: Mier, Pablo, Elena-Real, Carlos A, Cortés, Juan, Bernadó, Pau, Andrade-Navarro, Miguel A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620824/
https://www.ncbi.nlm.nih.gov/pubmed/36106994
http://dx.doi.org/10.1093/bioinformatics/btac610
_version_ 1784821400970723328
author Mier, Pablo
Elena-Real, Carlos A
Cortés, Juan
Bernadó, Pau
Andrade-Navarro, Miguel A
author_facet Mier, Pablo
Elena-Real, Carlos A
Cortés, Juan
Bernadó, Pau
Andrade-Navarro, Miguel A
author_sort Mier, Pablo
collection PubMed
description MOTIVATION: Poly-alanine (polyA) regions are protein stretches mostly composed of alanines. Despite their abundance in eukaryotic proteomes and their association to nine inherited human diseases, the structural and functional roles exerted by polyA stretches remain poorly understood. In this work we study how the amino acid context in which polyA regions are settled in proteins influences their structure and function. RESULTS: We identified glycine and proline as the most abundant amino acids within polyA and in the flanking regions of polyA tracts, in human proteins as well as in 17 additional eukaryotic species. Our analyses indicate that the non-structuring nature of these two amino acids influences the α-helical conformations predicted for polyA, suggesting a relevant role in reducing the inherent aggregation propensity of long polyA. Then, we show how polyA position in protein N-termini relates with their function as transit peptides. PolyA placed just after the initial methionine is often predicted as part of mitochondrial transit peptides, whereas when placed in downstream positions, polyA are part of signal peptides. A few examples from known structures suggest that short polyA can emerge by alanine substitutions in α-helices; but evolution by insertion is observed for longer polyA. Our results showcase the importance of studying the sequence context of homorepeats as a mechanism to shape their structure–function relationships. AVAILABILITY AND IMPLEMENTATION: The datasets used and/or analyzed during the current study are available from the corresponding author onreasonable request. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9620824
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96208242022-11-01 The sequence context in poly-alanine regions: structure, function and conservation Mier, Pablo Elena-Real, Carlos A Cortés, Juan Bernadó, Pau Andrade-Navarro, Miguel A Bioinformatics Original Papers MOTIVATION: Poly-alanine (polyA) regions are protein stretches mostly composed of alanines. Despite their abundance in eukaryotic proteomes and their association to nine inherited human diseases, the structural and functional roles exerted by polyA stretches remain poorly understood. In this work we study how the amino acid context in which polyA regions are settled in proteins influences their structure and function. RESULTS: We identified glycine and proline as the most abundant amino acids within polyA and in the flanking regions of polyA tracts, in human proteins as well as in 17 additional eukaryotic species. Our analyses indicate that the non-structuring nature of these two amino acids influences the α-helical conformations predicted for polyA, suggesting a relevant role in reducing the inherent aggregation propensity of long polyA. Then, we show how polyA position in protein N-termini relates with their function as transit peptides. PolyA placed just after the initial methionine is often predicted as part of mitochondrial transit peptides, whereas when placed in downstream positions, polyA are part of signal peptides. A few examples from known structures suggest that short polyA can emerge by alanine substitutions in α-helices; but evolution by insertion is observed for longer polyA. Our results showcase the importance of studying the sequence context of homorepeats as a mechanism to shape their structure–function relationships. AVAILABILITY AND IMPLEMENTATION: The datasets used and/or analyzed during the current study are available from the corresponding author onreasonable request. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-09-15 /pmc/articles/PMC9620824/ /pubmed/36106994 http://dx.doi.org/10.1093/bioinformatics/btac610 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Mier, Pablo
Elena-Real, Carlos A
Cortés, Juan
Bernadó, Pau
Andrade-Navarro, Miguel A
The sequence context in poly-alanine regions: structure, function and conservation
title The sequence context in poly-alanine regions: structure, function and conservation
title_full The sequence context in poly-alanine regions: structure, function and conservation
title_fullStr The sequence context in poly-alanine regions: structure, function and conservation
title_full_unstemmed The sequence context in poly-alanine regions: structure, function and conservation
title_short The sequence context in poly-alanine regions: structure, function and conservation
title_sort sequence context in poly-alanine regions: structure, function and conservation
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620824/
https://www.ncbi.nlm.nih.gov/pubmed/36106994
http://dx.doi.org/10.1093/bioinformatics/btac610
work_keys_str_mv AT mierpablo thesequencecontextinpolyalanineregionsstructurefunctionandconservation
AT elenarealcarlosa thesequencecontextinpolyalanineregionsstructurefunctionandconservation
AT cortesjuan thesequencecontextinpolyalanineregionsstructurefunctionandconservation
AT bernadopau thesequencecontextinpolyalanineregionsstructurefunctionandconservation
AT andradenavarromiguela thesequencecontextinpolyalanineregionsstructurefunctionandconservation
AT mierpablo sequencecontextinpolyalanineregionsstructurefunctionandconservation
AT elenarealcarlosa sequencecontextinpolyalanineregionsstructurefunctionandconservation
AT cortesjuan sequencecontextinpolyalanineregionsstructurefunctionandconservation
AT bernadopau sequencecontextinpolyalanineregionsstructurefunctionandconservation
AT andradenavarromiguela sequencecontextinpolyalanineregionsstructurefunctionandconservation