Cargando…

Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins

BACKGROUND: Amino acid repeats (AARs) are common features of protein sequences. They often evolve rapidly and are involved in a number of human diseases. They also show significant associations with particular Gene Ontology (GO) functional categories, particularly transcription, suggesting they play...

Descripción completa

Detalles Bibliográficos
Autores principales: Simon, Michelle, Hancock, John M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718493/
https://www.ncbi.nlm.nih.gov/pubmed/19486509
http://dx.doi.org/10.1186/gb-2009-10-6-r59
_version_ 1782169990907035648
author Simon, Michelle
Hancock, John M
author_facet Simon, Michelle
Hancock, John M
author_sort Simon, Michelle
collection PubMed
description BACKGROUND: Amino acid repeats (AARs) are common features of protein sequences. They often evolve rapidly and are involved in a number of human diseases. They also show significant associations with particular Gene Ontology (GO) functional categories, particularly transcription, suggesting they play some role in protein function. It has been suggested recently that AARs play a significant role in the evolution of intrinsically unstructured regions (IURs) of proteins. We investigate the relationship between AAR frequency and evolution and their localization within proteins based on a set of 5,815 orthologous proteins from four mammalian (human, chimpanzee, mouse and rat) and a bird (chicken) genome. We consider two classes of AAR (tandem repeats and cryptic repeats: regions of proteins containing overrepresentations of short amino acid repeats). RESULTS: Mammals show very similar repeat frequencies but chicken shows lower frequencies of many of the cryptic repeats common in mammals. Regions flanking tandem AARs evolve more rapidly than the rest of the protein containing the repeat and this phenomenon is more pronounced for non-conserved repeats than for conserved ones. GO associations are similar to those previously described for the mammals, but chicken cryptic repeats show fewer significant associations. Comparing the overlaps of AARs with IURs and protein domains showed that up to 96% of some AAR types are associated preferentially with IURs. However, no more than 15% of IURs contained an AAR. CONCLUSIONS: Their location within IURs explains many of the evolutionary properties of AARs. Further study is needed on the types of IURs containing AARs.
format Text
id pubmed-2718493
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27184932009-07-30 Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins Simon, Michelle Hancock, John M Genome Biol Research BACKGROUND: Amino acid repeats (AARs) are common features of protein sequences. They often evolve rapidly and are involved in a number of human diseases. They also show significant associations with particular Gene Ontology (GO) functional categories, particularly transcription, suggesting they play some role in protein function. It has been suggested recently that AARs play a significant role in the evolution of intrinsically unstructured regions (IURs) of proteins. We investigate the relationship between AAR frequency and evolution and their localization within proteins based on a set of 5,815 orthologous proteins from four mammalian (human, chimpanzee, mouse and rat) and a bird (chicken) genome. We consider two classes of AAR (tandem repeats and cryptic repeats: regions of proteins containing overrepresentations of short amino acid repeats). RESULTS: Mammals show very similar repeat frequencies but chicken shows lower frequencies of many of the cryptic repeats common in mammals. Regions flanking tandem AARs evolve more rapidly than the rest of the protein containing the repeat and this phenomenon is more pronounced for non-conserved repeats than for conserved ones. GO associations are similar to those previously described for the mammals, but chicken cryptic repeats show fewer significant associations. Comparing the overlaps of AARs with IURs and protein domains showed that up to 96% of some AAR types are associated preferentially with IURs. However, no more than 15% of IURs contained an AAR. CONCLUSIONS: Their location within IURs explains many of the evolutionary properties of AARs. Further study is needed on the types of IURs containing AARs. BioMed Central 2009 2009-06-01 /pmc/articles/PMC2718493/ /pubmed/19486509 http://dx.doi.org/10.1186/gb-2009-10-6-r59 Text en Copyright © 2009 Simon and Hancock; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Simon, Michelle
Hancock, John M
Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins
title Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins
title_full Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins
title_fullStr Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins
title_full_unstemmed Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins
title_short Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins
title_sort tandem and cryptic amino acid repeats accumulate in disordered regions of proteins
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718493/
https://www.ncbi.nlm.nih.gov/pubmed/19486509
http://dx.doi.org/10.1186/gb-2009-10-6-r59
work_keys_str_mv AT simonmichelle tandemandcrypticaminoacidrepeatsaccumulateindisorderedregionsofproteins
AT hancockjohnm tandemandcrypticaminoacidrepeatsaccumulateindisorderedregionsofproteins