Cargando…

Identification and Characterization of Glycine- and Arginine-Rich Motifs in Proteins by a Novel GAR Motif Finder Program

Glycine- and arginine-rich (GAR) motifs with different combinations of RG/RGG repeats are present in many proteins. The nucleolar rRNA 2′-O-methyltransferase fibrillarin (FBL) contains a conserved long N-terminal GAR domain with more than 10 RGG plus RG repeats separated by specific amino acids, mos...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yi-Chun, Huang, Shang-Hsuan, Chang, Chien-Ping, Li, Chuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9957100/
https://www.ncbi.nlm.nih.gov/pubmed/36833257
http://dx.doi.org/10.3390/genes14020330
_version_ 1784894742154182656
author Wang, Yi-Chun
Huang, Shang-Hsuan
Chang, Chien-Ping
Li, Chuan
author_facet Wang, Yi-Chun
Huang, Shang-Hsuan
Chang, Chien-Ping
Li, Chuan
author_sort Wang, Yi-Chun
collection PubMed
description Glycine- and arginine-rich (GAR) motifs with different combinations of RG/RGG repeats are present in many proteins. The nucleolar rRNA 2′-O-methyltransferase fibrillarin (FBL) contains a conserved long N-terminal GAR domain with more than 10 RGG plus RG repeats separated by specific amino acids, mostly phenylanalines. We developed a GAR motif finder (GMF) program based on the features of the GAR domain of FBL. The G(0,3)-X(0,1)-R-G(1,2)-X(0,5)-G(0,2)-X(0,1)-R-G(1,2) pattern allows the accommodation of extra-long GAR motifs with continuous RG/RGG interrupted by polyglycine or other amino acids. The program has a graphic interface and can easily output the results as .csv and .txt files. We used GMF to show the characteristics of the long GAR domains in FBL and two other nucleolar proteins, nucleolin and GAR1. GMF analyses can illustrate the similarities and also differences between the long GAR domains in the three nucleolar proteins and motifs in other typical RG/RGG-repeat-containing proteins, specifically the FET family members FUS, EWS, and TAF15 in position, motif length, RG/RGG number, and amino acid composition. We also used GMF to analyze the human proteome and focused on the ones with at least 10 RGG plus RG repeats. We showed the classification of the long GAR motifs and their putative correlation with protein/RNA interactions and liquid–liquid phase separation. The GMF algorithm can facilitate further systematic analyses of the GAR motifs in proteins and proteomes.
format Online
Article
Text
id pubmed-9957100
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99571002023-02-25 Identification and Characterization of Glycine- and Arginine-Rich Motifs in Proteins by a Novel GAR Motif Finder Program Wang, Yi-Chun Huang, Shang-Hsuan Chang, Chien-Ping Li, Chuan Genes (Basel) Article Glycine- and arginine-rich (GAR) motifs with different combinations of RG/RGG repeats are present in many proteins. The nucleolar rRNA 2′-O-methyltransferase fibrillarin (FBL) contains a conserved long N-terminal GAR domain with more than 10 RGG plus RG repeats separated by specific amino acids, mostly phenylanalines. We developed a GAR motif finder (GMF) program based on the features of the GAR domain of FBL. The G(0,3)-X(0,1)-R-G(1,2)-X(0,5)-G(0,2)-X(0,1)-R-G(1,2) pattern allows the accommodation of extra-long GAR motifs with continuous RG/RGG interrupted by polyglycine or other amino acids. The program has a graphic interface and can easily output the results as .csv and .txt files. We used GMF to show the characteristics of the long GAR domains in FBL and two other nucleolar proteins, nucleolin and GAR1. GMF analyses can illustrate the similarities and also differences between the long GAR domains in the three nucleolar proteins and motifs in other typical RG/RGG-repeat-containing proteins, specifically the FET family members FUS, EWS, and TAF15 in position, motif length, RG/RGG number, and amino acid composition. We also used GMF to analyze the human proteome and focused on the ones with at least 10 RGG plus RG repeats. We showed the classification of the long GAR motifs and their putative correlation with protein/RNA interactions and liquid–liquid phase separation. The GMF algorithm can facilitate further systematic analyses of the GAR motifs in proteins and proteomes. MDPI 2023-01-27 /pmc/articles/PMC9957100/ /pubmed/36833257 http://dx.doi.org/10.3390/genes14020330 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Yi-Chun
Huang, Shang-Hsuan
Chang, Chien-Ping
Li, Chuan
Identification and Characterization of Glycine- and Arginine-Rich Motifs in Proteins by a Novel GAR Motif Finder Program
title Identification and Characterization of Glycine- and Arginine-Rich Motifs in Proteins by a Novel GAR Motif Finder Program
title_full Identification and Characterization of Glycine- and Arginine-Rich Motifs in Proteins by a Novel GAR Motif Finder Program
title_fullStr Identification and Characterization of Glycine- and Arginine-Rich Motifs in Proteins by a Novel GAR Motif Finder Program
title_full_unstemmed Identification and Characterization of Glycine- and Arginine-Rich Motifs in Proteins by a Novel GAR Motif Finder Program
title_short Identification and Characterization of Glycine- and Arginine-Rich Motifs in Proteins by a Novel GAR Motif Finder Program
title_sort identification and characterization of glycine- and arginine-rich motifs in proteins by a novel gar motif finder program
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9957100/
https://www.ncbi.nlm.nih.gov/pubmed/36833257
http://dx.doi.org/10.3390/genes14020330
work_keys_str_mv AT wangyichun identificationandcharacterizationofglycineandargininerichmotifsinproteinsbyanovelgarmotiffinderprogram
AT huangshanghsuan identificationandcharacterizationofglycineandargininerichmotifsinproteinsbyanovelgarmotiffinderprogram
AT changchienping identificationandcharacterizationofglycineandargininerichmotifsinproteinsbyanovelgarmotiffinderprogram
AT lichuan identificationandcharacterizationofglycineandargininerichmotifsinproteinsbyanovelgarmotiffinderprogram