Cargando…

Inferring Function Using Patterns of Native Disorder in Proteins

Natively unstructured regions are a common feature of eukaryotic proteomes. Between 30% and 60% of proteins are predicted to contain long stretches of disordered residues, and not only have many of these regions been confirmed experimentally, but they have also been found to be essential for protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Lobley, Anna, Swindells, Mark B, Orengo, Christine A, Jones, David T
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950950/
https://www.ncbi.nlm.nih.gov/pubmed/17722973
http://dx.doi.org/10.1371/journal.pcbi.0030162
_version_ 1782134583780704256
author Lobley, Anna
Swindells, Mark B
Orengo, Christine A
Jones, David T
author_facet Lobley, Anna
Swindells, Mark B
Orengo, Christine A
Jones, David T
author_sort Lobley, Anna
collection PubMed
description Natively unstructured regions are a common feature of eukaryotic proteomes. Between 30% and 60% of proteins are predicted to contain long stretches of disordered residues, and not only have many of these regions been confirmed experimentally, but they have also been found to be essential for protein function. In this study, we directly address the potential contribution of protein disorder in predicting protein function using standard Gene Ontology (GO) categories. Initially we analyse the occurrence of protein disorder in the human proteome and report ontology categories that are enriched in disordered proteins. Pattern analysis of the distributions of disordered regions in human sequences demonstrated that the functions of intrinsically disordered proteins are both length- and position-dependent. These dependencies were then encoded in feature vectors to quantify the contribution of disorder in human protein function prediction using Support Vector Machine classifiers. The prediction accuracies of 26 GO categories relating to signalling and molecular recognition are improved using the disorder features. The most significant improvements were observed for kinase, phosphorylation, growth factor, and helicase categories. Furthermore, we provide predicted GO term assignments using these classifiers for a set of unannotated and orphan human proteins. In this study, the importance of capturing protein disorder information and its value in function prediction is demonstrated. The GO category classifiers generated can be used to provide more reliable predictions and further insights into the behaviour of orphan and unannotated proteins.
format Text
id pubmed-1950950
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-19509502007-09-07 Inferring Function Using Patterns of Native Disorder in Proteins Lobley, Anna Swindells, Mark B Orengo, Christine A Jones, David T PLoS Comput Biol Research Article Natively unstructured regions are a common feature of eukaryotic proteomes. Between 30% and 60% of proteins are predicted to contain long stretches of disordered residues, and not only have many of these regions been confirmed experimentally, but they have also been found to be essential for protein function. In this study, we directly address the potential contribution of protein disorder in predicting protein function using standard Gene Ontology (GO) categories. Initially we analyse the occurrence of protein disorder in the human proteome and report ontology categories that are enriched in disordered proteins. Pattern analysis of the distributions of disordered regions in human sequences demonstrated that the functions of intrinsically disordered proteins are both length- and position-dependent. These dependencies were then encoded in feature vectors to quantify the contribution of disorder in human protein function prediction using Support Vector Machine classifiers. The prediction accuracies of 26 GO categories relating to signalling and molecular recognition are improved using the disorder features. The most significant improvements were observed for kinase, phosphorylation, growth factor, and helicase categories. Furthermore, we provide predicted GO term assignments using these classifiers for a set of unannotated and orphan human proteins. In this study, the importance of capturing protein disorder information and its value in function prediction is demonstrated. The GO category classifiers generated can be used to provide more reliable predictions and further insights into the behaviour of orphan and unannotated proteins. Public Library of Science 2007-08 2007-08-24 /pmc/articles/PMC1950950/ /pubmed/17722973 http://dx.doi.org/10.1371/journal.pcbi.0030162 Text en © 2007 Lobley et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lobley, Anna
Swindells, Mark B
Orengo, Christine A
Jones, David T
Inferring Function Using Patterns of Native Disorder in Proteins
title Inferring Function Using Patterns of Native Disorder in Proteins
title_full Inferring Function Using Patterns of Native Disorder in Proteins
title_fullStr Inferring Function Using Patterns of Native Disorder in Proteins
title_full_unstemmed Inferring Function Using Patterns of Native Disorder in Proteins
title_short Inferring Function Using Patterns of Native Disorder in Proteins
title_sort inferring function using patterns of native disorder in proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950950/
https://www.ncbi.nlm.nih.gov/pubmed/17722973
http://dx.doi.org/10.1371/journal.pcbi.0030162
work_keys_str_mv AT lobleyanna inferringfunctionusingpatternsofnativedisorderinproteins
AT swindellsmarkb inferringfunctionusingpatternsofnativedisorderinproteins
AT orengochristinea inferringfunctionusingpatternsofnativedisorderinproteins
AT jonesdavidt inferringfunctionusingpatternsofnativedisorderinproteins