Cargando…

Pervasive, conserved secondary structure in highly charged protein regions

Understanding how protein sequences confer function remains a defining challenge in molecular biology. Two approaches have yielded enormous insight yet are often pursued separately: structure-based, where sequence-encoded structures mediate function, and disorder-based, where sequences dictate physi...

Descripción completa

Detalles Bibliográficos
Autores principales: Triandafillou, Catherine G., Pan, Rosalind Wenshan, Dinner, Aaron R., Drummond, D. Allan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10602382/
https://www.ncbi.nlm.nih.gov/pubmed/37844070
http://dx.doi.org/10.1371/journal.pcbi.1011565
_version_ 1785126390637527040
author Triandafillou, Catherine G.
Pan, Rosalind Wenshan
Dinner, Aaron R.
Drummond, D. Allan
author_facet Triandafillou, Catherine G.
Pan, Rosalind Wenshan
Dinner, Aaron R.
Drummond, D. Allan
author_sort Triandafillou, Catherine G.
collection PubMed
description Understanding how protein sequences confer function remains a defining challenge in molecular biology. Two approaches have yielded enormous insight yet are often pursued separately: structure-based, where sequence-encoded structures mediate function, and disorder-based, where sequences dictate physicochemical and dynamical properties which determine function in the absence of stable structure. Here we study highly charged protein regions (>40% charged residues), which are routinely presumed to be disordered. Using recent advances in structure prediction and experimental structures, we show that roughly 40% of these regions form well-structured helices. Features often used to predict disorder—high charge density, low hydrophobicity, low sequence complexity, and evolutionarily varying length—are also compatible with solvated, variable-length helices. We show that a simple composition classifier predicts the existence of structure far better than well-established heuristics based on charge and hydropathy. We show that helical structure is more prevalent than previously appreciated in highly charged regions of diverse proteomes and characterize the conservation of highly charged regions. Our results underscore the importance of integrating, rather than choosing between, structure- and disorder-based approaches.
format Online
Article
Text
id pubmed-10602382
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-106023822023-10-27 Pervasive, conserved secondary structure in highly charged protein regions Triandafillou, Catherine G. Pan, Rosalind Wenshan Dinner, Aaron R. Drummond, D. Allan PLoS Comput Biol Research Article Understanding how protein sequences confer function remains a defining challenge in molecular biology. Two approaches have yielded enormous insight yet are often pursued separately: structure-based, where sequence-encoded structures mediate function, and disorder-based, where sequences dictate physicochemical and dynamical properties which determine function in the absence of stable structure. Here we study highly charged protein regions (>40% charged residues), which are routinely presumed to be disordered. Using recent advances in structure prediction and experimental structures, we show that roughly 40% of these regions form well-structured helices. Features often used to predict disorder—high charge density, low hydrophobicity, low sequence complexity, and evolutionarily varying length—are also compatible with solvated, variable-length helices. We show that a simple composition classifier predicts the existence of structure far better than well-established heuristics based on charge and hydropathy. We show that helical structure is more prevalent than previously appreciated in highly charged regions of diverse proteomes and characterize the conservation of highly charged regions. Our results underscore the importance of integrating, rather than choosing between, structure- and disorder-based approaches. Public Library of Science 2023-10-16 /pmc/articles/PMC10602382/ /pubmed/37844070 http://dx.doi.org/10.1371/journal.pcbi.1011565 Text en © 2023 Triandafillou et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Triandafillou, Catherine G.
Pan, Rosalind Wenshan
Dinner, Aaron R.
Drummond, D. Allan
Pervasive, conserved secondary structure in highly charged protein regions
title Pervasive, conserved secondary structure in highly charged protein regions
title_full Pervasive, conserved secondary structure in highly charged protein regions
title_fullStr Pervasive, conserved secondary structure in highly charged protein regions
title_full_unstemmed Pervasive, conserved secondary structure in highly charged protein regions
title_short Pervasive, conserved secondary structure in highly charged protein regions
title_sort pervasive, conserved secondary structure in highly charged protein regions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10602382/
https://www.ncbi.nlm.nih.gov/pubmed/37844070
http://dx.doi.org/10.1371/journal.pcbi.1011565
work_keys_str_mv AT triandafilloucatherineg pervasiveconservedsecondarystructureinhighlychargedproteinregions
AT panrosalindwenshan pervasiveconservedsecondarystructureinhighlychargedproteinregions
AT dinneraaronr pervasiveconservedsecondarystructureinhighlychargedproteinregions
AT drummonddallan pervasiveconservedsecondarystructureinhighlychargedproteinregions