Cargando…

Understanding and identifying amino acid repeats

Amino acid repeats (AARs) are abundant in protein sequences. They have particular roles in protein function and evolution. Simple repeat patterns generated by DNA slippage tend to introduce length variations and point mutations in repeat regions. Loss of normal and gain of abnormal function owing to...

Descripción completa

Detalles Bibliográficos
Autores principales:	Luo, Hong, Nijveen, Harm
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2014
Materias:	Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103538/ https://www.ncbi.nlm.nih.gov/pubmed/23418055 http://dx.doi.org/10.1093/bib/bbt003

_version_	1782327164997206016
author	Luo, Hong Nijveen, Harm
author_facet	Luo, Hong Nijveen, Harm
author_sort	Luo, Hong
collection	PubMed
description	Amino acid repeats (AARs) are abundant in protein sequences. They have particular roles in protein function and evolution. Simple repeat patterns generated by DNA slippage tend to introduce length variations and point mutations in repeat regions. Loss of normal and gain of abnormal function owing to their variable length are potential risks leading to diseases. Repeats with complex patterns mostly refer to the functional domain repeats, such as the well-known leucine-rich repeat and WD repeat, which are frequently involved in protein–protein interaction. They are mainly derived from internal gene duplication events and stabilized by ‘gate-keeper’ residues, which play crucial roles in preventing inter-domain aggregation. AARs are widely distributed in different proteomes across a variety of taxonomic ranges, and especially abundant in eukaryotic proteins. However, their specific evolutionary and functional scenarios are still poorly understood. Identifying AARs in protein sequences is the first step for the further investigation of their biological function and evolutionary mechanism. In principle, this is an NP-hard problem, as most of the repeat fragments are shaped by a series of sophisticated evolutionary events and become latent periodical patterns. It is not possible to define a uniform criterion for detecting and verifying various repeat patterns. Instead, different algorithms based on different strategies have been developed to cope with different repeat patterns. In this review, we attempt to describe the amino acid repeat-detection algorithms currently available and compare their strategies based on an in-depth analysis of the biological significance of protein repeats.
format	Online Article Text
id	pubmed-4103538
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-41035382014-07-18 Understanding and identifying amino acid repeats Luo, Hong Nijveen, Harm Brief Bioinform Papers Amino acid repeats (AARs) are abundant in protein sequences. They have particular roles in protein function and evolution. Simple repeat patterns generated by DNA slippage tend to introduce length variations and point mutations in repeat regions. Loss of normal and gain of abnormal function owing to their variable length are potential risks leading to diseases. Repeats with complex patterns mostly refer to the functional domain repeats, such as the well-known leucine-rich repeat and WD repeat, which are frequently involved in protein–protein interaction. They are mainly derived from internal gene duplication events and stabilized by ‘gate-keeper’ residues, which play crucial roles in preventing inter-domain aggregation. AARs are widely distributed in different proteomes across a variety of taxonomic ranges, and especially abundant in eukaryotic proteins. However, their specific evolutionary and functional scenarios are still poorly understood. Identifying AARs in protein sequences is the first step for the further investigation of their biological function and evolutionary mechanism. In principle, this is an NP-hard problem, as most of the repeat fragments are shaped by a series of sophisticated evolutionary events and become latent periodical patterns. It is not possible to define a uniform criterion for detecting and verifying various repeat patterns. Instead, different algorithms based on different strategies have been developed to cope with different repeat patterns. In this review, we attempt to describe the amino acid repeat-detection algorithms currently available and compare their strategies based on an in-depth analysis of the biological significance of protein repeats. Oxford University Press 2014-07 2014-07-16 /pmc/articles/PMC4103538/ /pubmed/23418055 http://dx.doi.org/10.1093/bib/bbt003 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Papers Luo, Hong Nijveen, Harm Understanding and identifying amino acid repeats
title	Understanding and identifying amino acid repeats
title_full	Understanding and identifying amino acid repeats
title_fullStr	Understanding and identifying amino acid repeats
title_full_unstemmed	Understanding and identifying amino acid repeats
title_short	Understanding and identifying amino acid repeats
title_sort	understanding and identifying amino acid repeats
topic	Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103538/ https://www.ncbi.nlm.nih.gov/pubmed/23418055 http://dx.doi.org/10.1093/bib/bbt003
work_keys_str_mv	AT luohong understandingandidentifyingaminoacidrepeats AT nijveenharm understandingandidentifyingaminoacidrepeats

Understanding and identifying amino acid repeats

Ejemplares similares