Cargando…

Pervasive, conserved secondary structure in highly charged protein regions

Understanding how protein sequences confer function remains a defining challenge in molecular biology. Two approaches have yielded enormous insight yet are often pursued separately: structure-based, where sequence-encoded structures mediate function, and disorder-based, where sequences dictate physi...

Descripción completa

Detalles Bibliográficos
Autores principales: Triandafillou, Catherine G., Pan, Rosalind Wenshan, Dinner, Aaron R., Drummond, D. Allan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9949069/
https://www.ncbi.nlm.nih.gov/pubmed/36824805
http://dx.doi.org/10.1101/2023.02.15.528637
_version_ 1784892903292665856
author Triandafillou, Catherine G.
Pan, Rosalind Wenshan
Dinner, Aaron R.
Drummond, D. Allan
author_facet Triandafillou, Catherine G.
Pan, Rosalind Wenshan
Dinner, Aaron R.
Drummond, D. Allan
author_sort Triandafillou, Catherine G.
collection PubMed
description Understanding how protein sequences confer function remains a defining challenge in molecular biology. Two approaches have yielded enormous insight yet are often pursued separately: structure-based, where sequence-encoded structures mediate function, and disorder-based, where sequences dictate physicochemical and dynamical properties which determine function in the absence of stable structure. Here we study highly charged protein regions (>40% charged residues), which are routinely presumed to be disordered. Using recent advances in structure prediction and experimental structures, we show that roughly 40% of these regions form well-structured helices. Features often used to predict disorder—high charge density, low hydrophobicity, low sequence complexity, and evolutionarily varying length—are also compatible with solvated, variable-length helices. We show that a simple composition classifier predicts the existence of structure far better than well-established heuristics based on charge and hydropathy. We show that helical structure is more prevalent than previously appreciated in highly charged regions of diverse proteomes and characterize the conservation of highly charged regions. Our results underscore the importance of integrating, rather than choosing between, structure- and disorder-based approaches.
format Online
Article
Text
id pubmed-9949069
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-99490692023-02-24 Pervasive, conserved secondary structure in highly charged protein regions Triandafillou, Catherine G. Pan, Rosalind Wenshan Dinner, Aaron R. Drummond, D. Allan bioRxiv Article Understanding how protein sequences confer function remains a defining challenge in molecular biology. Two approaches have yielded enormous insight yet are often pursued separately: structure-based, where sequence-encoded structures mediate function, and disorder-based, where sequences dictate physicochemical and dynamical properties which determine function in the absence of stable structure. Here we study highly charged protein regions (>40% charged residues), which are routinely presumed to be disordered. Using recent advances in structure prediction and experimental structures, we show that roughly 40% of these regions form well-structured helices. Features often used to predict disorder—high charge density, low hydrophobicity, low sequence complexity, and evolutionarily varying length—are also compatible with solvated, variable-length helices. We show that a simple composition classifier predicts the existence of structure far better than well-established heuristics based on charge and hydropathy. We show that helical structure is more prevalent than previously appreciated in highly charged regions of diverse proteomes and characterize the conservation of highly charged regions. Our results underscore the importance of integrating, rather than choosing between, structure- and disorder-based approaches. Cold Spring Harbor Laboratory 2023-02-15 /pmc/articles/PMC9949069/ /pubmed/36824805 http://dx.doi.org/10.1101/2023.02.15.528637 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Triandafillou, Catherine G.
Pan, Rosalind Wenshan
Dinner, Aaron R.
Drummond, D. Allan
Pervasive, conserved secondary structure in highly charged protein regions
title Pervasive, conserved secondary structure in highly charged protein regions
title_full Pervasive, conserved secondary structure in highly charged protein regions
title_fullStr Pervasive, conserved secondary structure in highly charged protein regions
title_full_unstemmed Pervasive, conserved secondary structure in highly charged protein regions
title_short Pervasive, conserved secondary structure in highly charged protein regions
title_sort pervasive, conserved secondary structure in highly charged protein regions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9949069/
https://www.ncbi.nlm.nih.gov/pubmed/36824805
http://dx.doi.org/10.1101/2023.02.15.528637
work_keys_str_mv AT triandafilloucatherineg pervasiveconservedsecondarystructureinhighlychargedproteinregions
AT panrosalindwenshan pervasiveconservedsecondarystructureinhighlychargedproteinregions
AT dinneraaronr pervasiveconservedsecondarystructureinhighlychargedproteinregions
AT drummonddallan pervasiveconservedsecondarystructureinhighlychargedproteinregions