Cargando…

Understanding and identifying amino acid repeats

Amino acid repeats (AARs) are abundant in protein sequences. They have particular roles in protein function and evolution. Simple repeat patterns generated by DNA slippage tend to introduce length variations and point mutations in repeat regions. Loss of normal and gain of abnormal function owing to...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Hong, Nijveen, Harm
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103538/
https://www.ncbi.nlm.nih.gov/pubmed/23418055
http://dx.doi.org/10.1093/bib/bbt003
_version_ 1782327164997206016
author Luo, Hong
Nijveen, Harm
author_facet Luo, Hong
Nijveen, Harm
author_sort Luo, Hong
collection PubMed
description Amino acid repeats (AARs) are abundant in protein sequences. They have particular roles in protein function and evolution. Simple repeat patterns generated by DNA slippage tend to introduce length variations and point mutations in repeat regions. Loss of normal and gain of abnormal function owing to their variable length are potential risks leading to diseases. Repeats with complex patterns mostly refer to the functional domain repeats, such as the well-known leucine-rich repeat and WD repeat, which are frequently involved in protein–protein interaction. They are mainly derived from internal gene duplication events and stabilized by ‘gate-keeper’ residues, which play crucial roles in preventing inter-domain aggregation. AARs are widely distributed in different proteomes across a variety of taxonomic ranges, and especially abundant in eukaryotic proteins. However, their specific evolutionary and functional scenarios are still poorly understood. Identifying AARs in protein sequences is the first step for the further investigation of their biological function and evolutionary mechanism. In principle, this is an NP-hard problem, as most of the repeat fragments are shaped by a series of sophisticated evolutionary events and become latent periodical patterns. It is not possible to define a uniform criterion for detecting and verifying various repeat patterns. Instead, different algorithms based on different strategies have been developed to cope with different repeat patterns. In this review, we attempt to describe the amino acid repeat-detection algorithms currently available and compare their strategies based on an in-depth analysis of the biological significance of protein repeats.
format Online
Article
Text
id pubmed-4103538
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41035382014-07-18 Understanding and identifying amino acid repeats Luo, Hong Nijveen, Harm Brief Bioinform Papers Amino acid repeats (AARs) are abundant in protein sequences. They have particular roles in protein function and evolution. Simple repeat patterns generated by DNA slippage tend to introduce length variations and point mutations in repeat regions. Loss of normal and gain of abnormal function owing to their variable length are potential risks leading to diseases. Repeats with complex patterns mostly refer to the functional domain repeats, such as the well-known leucine-rich repeat and WD repeat, which are frequently involved in protein–protein interaction. They are mainly derived from internal gene duplication events and stabilized by ‘gate-keeper’ residues, which play crucial roles in preventing inter-domain aggregation. AARs are widely distributed in different proteomes across a variety of taxonomic ranges, and especially abundant in eukaryotic proteins. However, their specific evolutionary and functional scenarios are still poorly understood. Identifying AARs in protein sequences is the first step for the further investigation of their biological function and evolutionary mechanism. In principle, this is an NP-hard problem, as most of the repeat fragments are shaped by a series of sophisticated evolutionary events and become latent periodical patterns. It is not possible to define a uniform criterion for detecting and verifying various repeat patterns. Instead, different algorithms based on different strategies have been developed to cope with different repeat patterns. In this review, we attempt to describe the amino acid repeat-detection algorithms currently available and compare their strategies based on an in-depth analysis of the biological significance of protein repeats. Oxford University Press 2014-07 2014-07-16 /pmc/articles/PMC4103538/ /pubmed/23418055 http://dx.doi.org/10.1093/bib/bbt003 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Papers
Luo, Hong
Nijveen, Harm
Understanding and identifying amino acid repeats
title Understanding and identifying amino acid repeats
title_full Understanding and identifying amino acid repeats
title_fullStr Understanding and identifying amino acid repeats
title_full_unstemmed Understanding and identifying amino acid repeats
title_short Understanding and identifying amino acid repeats
title_sort understanding and identifying amino acid repeats
topic Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103538/
https://www.ncbi.nlm.nih.gov/pubmed/23418055
http://dx.doi.org/10.1093/bib/bbt003
work_keys_str_mv AT luohong understandingandidentifyingaminoacidrepeats
AT nijveenharm understandingandidentifyingaminoacidrepeats