Cargando…
Understanding and identifying amino acid repeats
Amino acid repeats (AARs) are abundant in protein sequences. They have particular roles in protein function and evolution. Simple repeat patterns generated by DNA slippage tend to introduce length variations and point mutations in repeat regions. Loss of normal and gain of abnormal function owing to...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103538/ https://www.ncbi.nlm.nih.gov/pubmed/23418055 http://dx.doi.org/10.1093/bib/bbt003 |
_version_ | 1782327164997206016 |
---|---|
author | Luo, Hong Nijveen, Harm |
author_facet | Luo, Hong Nijveen, Harm |
author_sort | Luo, Hong |
collection | PubMed |
description | Amino acid repeats (AARs) are abundant in protein sequences. They have particular roles in protein function and evolution. Simple repeat patterns generated by DNA slippage tend to introduce length variations and point mutations in repeat regions. Loss of normal and gain of abnormal function owing to their variable length are potential risks leading to diseases. Repeats with complex patterns mostly refer to the functional domain repeats, such as the well-known leucine-rich repeat and WD repeat, which are frequently involved in protein–protein interaction. They are mainly derived from internal gene duplication events and stabilized by ‘gate-keeper’ residues, which play crucial roles in preventing inter-domain aggregation. AARs are widely distributed in different proteomes across a variety of taxonomic ranges, and especially abundant in eukaryotic proteins. However, their specific evolutionary and functional scenarios are still poorly understood. Identifying AARs in protein sequences is the first step for the further investigation of their biological function and evolutionary mechanism. In principle, this is an NP-hard problem, as most of the repeat fragments are shaped by a series of sophisticated evolutionary events and become latent periodical patterns. It is not possible to define a uniform criterion for detecting and verifying various repeat patterns. Instead, different algorithms based on different strategies have been developed to cope with different repeat patterns. In this review, we attempt to describe the amino acid repeat-detection algorithms currently available and compare their strategies based on an in-depth analysis of the biological significance of protein repeats. |
format | Online Article Text |
id | pubmed-4103538 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-41035382014-07-18 Understanding and identifying amino acid repeats Luo, Hong Nijveen, Harm Brief Bioinform Papers Amino acid repeats (AARs) are abundant in protein sequences. They have particular roles in protein function and evolution. Simple repeat patterns generated by DNA slippage tend to introduce length variations and point mutations in repeat regions. Loss of normal and gain of abnormal function owing to their variable length are potential risks leading to diseases. Repeats with complex patterns mostly refer to the functional domain repeats, such as the well-known leucine-rich repeat and WD repeat, which are frequently involved in protein–protein interaction. They are mainly derived from internal gene duplication events and stabilized by ‘gate-keeper’ residues, which play crucial roles in preventing inter-domain aggregation. AARs are widely distributed in different proteomes across a variety of taxonomic ranges, and especially abundant in eukaryotic proteins. However, their specific evolutionary and functional scenarios are still poorly understood. Identifying AARs in protein sequences is the first step for the further investigation of their biological function and evolutionary mechanism. In principle, this is an NP-hard problem, as most of the repeat fragments are shaped by a series of sophisticated evolutionary events and become latent periodical patterns. It is not possible to define a uniform criterion for detecting and verifying various repeat patterns. Instead, different algorithms based on different strategies have been developed to cope with different repeat patterns. In this review, we attempt to describe the amino acid repeat-detection algorithms currently available and compare their strategies based on an in-depth analysis of the biological significance of protein repeats. Oxford University Press 2014-07 2014-07-16 /pmc/articles/PMC4103538/ /pubmed/23418055 http://dx.doi.org/10.1093/bib/bbt003 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Papers Luo, Hong Nijveen, Harm Understanding and identifying amino acid repeats |
title | Understanding and identifying amino acid repeats |
title_full | Understanding and identifying amino acid repeats |
title_fullStr | Understanding and identifying amino acid repeats |
title_full_unstemmed | Understanding and identifying amino acid repeats |
title_short | Understanding and identifying amino acid repeats |
title_sort | understanding and identifying amino acid repeats |
topic | Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103538/ https://www.ncbi.nlm.nih.gov/pubmed/23418055 http://dx.doi.org/10.1093/bib/bbt003 |
work_keys_str_mv | AT luohong understandingandidentifyingaminoacidrepeats AT nijveenharm understandingandidentifyingaminoacidrepeats |