Cargando…

Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins

BACKGROUND: In order to characterize mammalian intrinsically disordered domains (IDDs) we examined the patterns in their amino acid abundance as well as overrepresented local sequence motifs. We considered IDDs from mouse proteins associated with innate immune responses as well as a set of generic h...

Descripción completa

Detalles Bibliográficos
Autores principales: Teraguchi, Shunsuke, Patil, Ashwini, Standley, Daron M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2957690/
https://www.ncbi.nlm.nih.gov/pubmed/21106129
http://dx.doi.org/10.1186/1471-2105-11-S7-S7
_version_ 1782188250441449472
author Teraguchi, Shunsuke
Patil, Ashwini
Standley, Daron M
author_facet Teraguchi, Shunsuke
Patil, Ashwini
Standley, Daron M
author_sort Teraguchi, Shunsuke
collection PubMed
description BACKGROUND: In order to characterize mammalian intrinsically disordered domains (IDDs) we examined the patterns in their amino acid abundance as well as overrepresented local sequence motifs. We considered IDDs from mouse proteins associated with innate immune responses as well as a set of generic human genes. These sets were compared with artificially generated random sequences with the same overall amino acid abundance and length distributions. IDDs were then clustered by amino acid abundance, and further analyzed in terms of co-occurrence of clusters with functionally characterized Pfam domains. RESULTS: Overall, IDDs were very different from randomly generated sequences. The deviation from random distributions was at least as great as that for ordered domains, for which the deviation can be rationalized in terms of strong evolutionary pressure for structure and function. The co-occurrence of certain Pfam domains with specific IDD clusters was found to be significant (p-value < 0.01). Local sequence motifs that were over-represented in the innate immune set consisted mostly of low complexity fragments, primarily characterized by amino acid repeats, and could not be assigned an obvious functional role. CONCLUSIONS: Our results suggest that IDDs are constrained within a narrow subset of possible sequences. This is most likely a result of biophysical restraints that have yet to be elucidated. More detailed examination of the functional relationship between the IDDs and associated Pfam domains is one possible avenue of investigation.
format Text
id pubmed-2957690
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29576902010-10-21 Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins Teraguchi, Shunsuke Patil, Ashwini Standley, Daron M BMC Bioinformatics Proceedings BACKGROUND: In order to characterize mammalian intrinsically disordered domains (IDDs) we examined the patterns in their amino acid abundance as well as overrepresented local sequence motifs. We considered IDDs from mouse proteins associated with innate immune responses as well as a set of generic human genes. These sets were compared with artificially generated random sequences with the same overall amino acid abundance and length distributions. IDDs were then clustered by amino acid abundance, and further analyzed in terms of co-occurrence of clusters with functionally characterized Pfam domains. RESULTS: Overall, IDDs were very different from randomly generated sequences. The deviation from random distributions was at least as great as that for ordered domains, for which the deviation can be rationalized in terms of strong evolutionary pressure for structure and function. The co-occurrence of certain Pfam domains with specific IDD clusters was found to be significant (p-value < 0.01). Local sequence motifs that were over-represented in the innate immune set consisted mostly of low complexity fragments, primarily characterized by amino acid repeats, and could not be assigned an obvious functional role. CONCLUSIONS: Our results suggest that IDDs are constrained within a narrow subset of possible sequences. This is most likely a result of biophysical restraints that have yet to be elucidated. More detailed examination of the functional relationship between the IDDs and associated Pfam domains is one possible avenue of investigation. BioMed Central 2010-10-15 /pmc/articles/PMC2957690/ /pubmed/21106129 http://dx.doi.org/10.1186/1471-2105-11-S7-S7 Text en Copyright ©2010 Teraguchi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Teraguchi, Shunsuke
Patil, Ashwini
Standley, Daron M
Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins
title Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins
title_full Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins
title_fullStr Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins
title_full_unstemmed Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins
title_short Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins
title_sort intrinsically disordered domains deviate significantly from random sequences in mammalian proteins
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2957690/
https://www.ncbi.nlm.nih.gov/pubmed/21106129
http://dx.doi.org/10.1186/1471-2105-11-S7-S7
work_keys_str_mv AT teraguchishunsuke intrinsicallydisordereddomainsdeviatesignificantlyfromrandomsequencesinmammalianproteins
AT patilashwini intrinsicallydisordereddomainsdeviatesignificantlyfromrandomsequencesinmammalianproteins
AT standleydaronm intrinsicallydisordereddomainsdeviatesignificantlyfromrandomsequencesinmammalianproteins