Cargando…

Expansion and functional analysis of the SR-related protein family across the domains of life

Serine/arginine-rich (SR) proteins comprise a family of proteins that is predominantly found in eukaryotes and plays a prominent role in RNA splicing. A characteristic feature of SR proteins is the presence of an S/R-rich low-complexity domain (RS domain), often in conjunction with spatially distinc...

Descripción completa

Detalles Bibliográficos
Autores principales: Cascarina, Sean M., Ross, Eric D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9479744/
https://www.ncbi.nlm.nih.gov/pubmed/35863866
http://dx.doi.org/10.1261/rna.079170.122
_version_ 1784790860113641472
author Cascarina, Sean M.
Ross, Eric D.
author_facet Cascarina, Sean M.
Ross, Eric D.
author_sort Cascarina, Sean M.
collection PubMed
description Serine/arginine-rich (SR) proteins comprise a family of proteins that is predominantly found in eukaryotes and plays a prominent role in RNA splicing. A characteristic feature of SR proteins is the presence of an S/R-rich low-complexity domain (RS domain), often in conjunction with spatially distinct RNA recognition motifs (RRMs). To date, 52 human proteins have been classified as SR or SR-related proteins. Here, using an unbiased series of composition criteria together with enrichment for known RNA binding activity, we identified >100 putative SR-related proteins in the human proteome. This method recovers known SR and SR-related proteins with high sensitivity (∼94%), yet identifies a number of additional proteins with many of the hallmark features of true SR-related proteins. Newly identified SR-related proteins display slightly different amino acid compositions yet similar levels of post-translational modification, suggesting that these new SR-related candidates are regulated in vivo and functionally important. Furthermore, candidate SR-related proteins with known RNA-binding activity (but not currently recognized as SR-related proteins) are nevertheless strongly associated with a variety of functions related to mRNA splicing and nuclear speckles. Finally, we applied our SR search method to all available reference proteomes, and provide maps of RS domains and Pfam annotations for all putative SR-related proteins as a resource. Together, these results expand the set of SR-related proteins in humans, and identify the most common functions associated with SR-related proteins across all domains of life.
format Online
Article
Text
id pubmed-9479744
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-94797442022-10-01 Expansion and functional analysis of the SR-related protein family across the domains of life Cascarina, Sean M. Ross, Eric D. RNA Bioinformatics Serine/arginine-rich (SR) proteins comprise a family of proteins that is predominantly found in eukaryotes and plays a prominent role in RNA splicing. A characteristic feature of SR proteins is the presence of an S/R-rich low-complexity domain (RS domain), often in conjunction with spatially distinct RNA recognition motifs (RRMs). To date, 52 human proteins have been classified as SR or SR-related proteins. Here, using an unbiased series of composition criteria together with enrichment for known RNA binding activity, we identified >100 putative SR-related proteins in the human proteome. This method recovers known SR and SR-related proteins with high sensitivity (∼94%), yet identifies a number of additional proteins with many of the hallmark features of true SR-related proteins. Newly identified SR-related proteins display slightly different amino acid compositions yet similar levels of post-translational modification, suggesting that these new SR-related candidates are regulated in vivo and functionally important. Furthermore, candidate SR-related proteins with known RNA-binding activity (but not currently recognized as SR-related proteins) are nevertheless strongly associated with a variety of functions related to mRNA splicing and nuclear speckles. Finally, we applied our SR search method to all available reference proteomes, and provide maps of RS domains and Pfam annotations for all putative SR-related proteins as a resource. Together, these results expand the set of SR-related proteins in humans, and identify the most common functions associated with SR-related proteins across all domains of life. Cold Spring Harbor Laboratory Press 2022-10 /pmc/articles/PMC9479744/ /pubmed/35863866 http://dx.doi.org/10.1261/rna.079170.122 Text en © 2022 Cascarina and Ross; Published by Cold Spring Harbor Laboratory Press for the RNA Society https://creativecommons.org/licenses/by/4.0/This article, published in RNA, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Bioinformatics
Cascarina, Sean M.
Ross, Eric D.
Expansion and functional analysis of the SR-related protein family across the domains of life
title Expansion and functional analysis of the SR-related protein family across the domains of life
title_full Expansion and functional analysis of the SR-related protein family across the domains of life
title_fullStr Expansion and functional analysis of the SR-related protein family across the domains of life
title_full_unstemmed Expansion and functional analysis of the SR-related protein family across the domains of life
title_short Expansion and functional analysis of the SR-related protein family across the domains of life
title_sort expansion and functional analysis of the sr-related protein family across the domains of life
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9479744/
https://www.ncbi.nlm.nih.gov/pubmed/35863866
http://dx.doi.org/10.1261/rna.079170.122
work_keys_str_mv AT cascarinaseanm expansionandfunctionalanalysisofthesrrelatedproteinfamilyacrossthedomainsoflife
AT rossericd expansionandfunctionalanalysisofthesrrelatedproteinfamilyacrossthedomainsoflife