Cargando…

Identification and Analysis of Long Repeats of Proteins at the Domain Level

Amino acid repeats play an important role in the structure and function of proteins. Analysis of long repeats in protein sequences enables one to understand their abundance, structure and function in the protein universe. In the present study, amino acid repeats of length >50 (long repeats) were...

Descripción completa

Detalles Bibliográficos
Autores principales: Rajathei, David Mary, Parthasarathy, Subbiah, Selvaraj, Samuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6795024/
https://www.ncbi.nlm.nih.gov/pubmed/31649924
http://dx.doi.org/10.3389/fbioe.2019.00250
_version_ 1783459407552577536
author Rajathei, David Mary
Parthasarathy, Subbiah
Selvaraj, Samuel
author_facet Rajathei, David Mary
Parthasarathy, Subbiah
Selvaraj, Samuel
author_sort Rajathei, David Mary
collection PubMed
description Amino acid repeats play an important role in the structure and function of proteins. Analysis of long repeats in protein sequences enables one to understand their abundance, structure and function in the protein universe. In the present study, amino acid repeats of length >50 (long repeats) were identified in a non-redundant set of UniProt sequences using the RADAR program. The underlying structures and functions of these long repeats were carried out using the Gene3D for structural domains, Pfam for functional domains and enzyme and non-enzyme functional classification for catalytic and binding of the proteins. From a structural perspective, these long repeats seem to predominantly occur in certain architectures such as sandwich, bundle, barrel, and roll and within these architectures abundant in the superfolds. The lengths of the repeats within each fold are not uniform exhibiting different structures for different functions. We also observed that long repeats are in the domain regions of the family and are involved in the function of the proteins. After grouping based on enzyme and non-enzyme classes, we observed the abundant occurrence of long repeats in specific catalytic and binding of the proteins. In this study, we have analyzed the occurrence of long repeats in the protein sequence universe apart from well-characterized short tandem repeats in sequences and their structures and functions of the proteins at the domain level. The present study suggests that long repeats may play an important role in the structure and function of domains of the proteins.
format Online
Article
Text
id pubmed-6795024
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67950242019-10-24 Identification and Analysis of Long Repeats of Proteins at the Domain Level Rajathei, David Mary Parthasarathy, Subbiah Selvaraj, Samuel Front Bioeng Biotechnol Bioengineering and Biotechnology Amino acid repeats play an important role in the structure and function of proteins. Analysis of long repeats in protein sequences enables one to understand their abundance, structure and function in the protein universe. In the present study, amino acid repeats of length >50 (long repeats) were identified in a non-redundant set of UniProt sequences using the RADAR program. The underlying structures and functions of these long repeats were carried out using the Gene3D for structural domains, Pfam for functional domains and enzyme and non-enzyme functional classification for catalytic and binding of the proteins. From a structural perspective, these long repeats seem to predominantly occur in certain architectures such as sandwich, bundle, barrel, and roll and within these architectures abundant in the superfolds. The lengths of the repeats within each fold are not uniform exhibiting different structures for different functions. We also observed that long repeats are in the domain regions of the family and are involved in the function of the proteins. After grouping based on enzyme and non-enzyme classes, we observed the abundant occurrence of long repeats in specific catalytic and binding of the proteins. In this study, we have analyzed the occurrence of long repeats in the protein sequence universe apart from well-characterized short tandem repeats in sequences and their structures and functions of the proteins at the domain level. The present study suggests that long repeats may play an important role in the structure and function of domains of the proteins. Frontiers Media S.A. 2019-10-08 /pmc/articles/PMC6795024/ /pubmed/31649924 http://dx.doi.org/10.3389/fbioe.2019.00250 Text en Copyright © 2019 Rajathei, Parthasarathy and Selvaraj. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Rajathei, David Mary
Parthasarathy, Subbiah
Selvaraj, Samuel
Identification and Analysis of Long Repeats of Proteins at the Domain Level
title Identification and Analysis of Long Repeats of Proteins at the Domain Level
title_full Identification and Analysis of Long Repeats of Proteins at the Domain Level
title_fullStr Identification and Analysis of Long Repeats of Proteins at the Domain Level
title_full_unstemmed Identification and Analysis of Long Repeats of Proteins at the Domain Level
title_short Identification and Analysis of Long Repeats of Proteins at the Domain Level
title_sort identification and analysis of long repeats of proteins at the domain level
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6795024/
https://www.ncbi.nlm.nih.gov/pubmed/31649924
http://dx.doi.org/10.3389/fbioe.2019.00250
work_keys_str_mv AT rajatheidavidmary identificationandanalysisoflongrepeatsofproteinsatthedomainlevel
AT parthasarathysubbiah identificationandanalysisoflongrepeatsofproteinsatthedomainlevel
AT selvarajsamuel identificationandanalysisoflongrepeatsofproteinsatthedomainlevel