Cargando…

The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks

BACKGROUND: Protein-protein interactions mediate a wide range of cellular functions and responses and have been studied rigorously through recent large-scale proteomics experiments and bioinformatics analyses. One of the most important findings of those endeavours was the observation that 'hub&...

Descripción completa

Detalles Bibliográficos
Autores principales: Hsing, Michael, Byler, Kendall Grant, Cherkasov, Artem
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553323/
https://www.ncbi.nlm.nih.gov/pubmed/18796161
http://dx.doi.org/10.1186/1752-0509-2-80
_version_ 1782159494625624064
author Hsing, Michael
Byler, Kendall Grant
Cherkasov, Artem
author_facet Hsing, Michael
Byler, Kendall Grant
Cherkasov, Artem
author_sort Hsing, Michael
collection PubMed
description BACKGROUND: Protein-protein interactions mediate a wide range of cellular functions and responses and have been studied rigorously through recent large-scale proteomics experiments and bioinformatics analyses. One of the most important findings of those endeavours was the observation that 'hub' proteins participate in significant numbers of protein interactions and play critical roles in the organization and function of cellular protein interaction networks (PINs) [1,2]. It has also been demonstrated that such hub proteins may constitute an important pool of attractive drug targets. Thus, it is crucial to be able to identify hub proteins based not only on experimental data but also by means of bioinformatics predictions. RESULTS: A hub protein classifier has been developed based on the available interaction data and Gene Ontology (GO) annotations for proteins in the Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens genomes. In particular, by utilizing the machine learning method of boosting trees we were able to create a predictive bioinformatics tool for the identification of proteins that are likely to play the role of a hub in protein interaction networks. Testing the developed hub classifier on external sets of experimental protein interaction data in Methicillin-resistant Staphylococcus aureus (MRSA) 252 and Caenorhabditis elegans demonstrated that our approach can predict hub proteins with a high degree of accuracy. A practical application of the developed bioinformatics method has been illustrated by the effective protein bait selection for large-scale pull-down experiments that aim to map complete protein-protein interaction networks for several species. CONCLUSION: The successful development of an accurate hub classifier demonstrated that highly-connected proteins tend to share certain relevant functional properties reflected in their Gene Ontology annotations. It is anticipated that the developed bioinformatics hub classifier will represent a useful tool for the theoretical prediction of highly-interacting proteins, the study of cellular network organizations, and the identification of prospective drug targets – even in those organisms that currently lack large-scale protein interaction data.
format Text
id pubmed-2553323
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25533232008-09-26 The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks Hsing, Michael Byler, Kendall Grant Cherkasov, Artem BMC Syst Biol Research Article BACKGROUND: Protein-protein interactions mediate a wide range of cellular functions and responses and have been studied rigorously through recent large-scale proteomics experiments and bioinformatics analyses. One of the most important findings of those endeavours was the observation that 'hub' proteins participate in significant numbers of protein interactions and play critical roles in the organization and function of cellular protein interaction networks (PINs) [1,2]. It has also been demonstrated that such hub proteins may constitute an important pool of attractive drug targets. Thus, it is crucial to be able to identify hub proteins based not only on experimental data but also by means of bioinformatics predictions. RESULTS: A hub protein classifier has been developed based on the available interaction data and Gene Ontology (GO) annotations for proteins in the Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens genomes. In particular, by utilizing the machine learning method of boosting trees we were able to create a predictive bioinformatics tool for the identification of proteins that are likely to play the role of a hub in protein interaction networks. Testing the developed hub classifier on external sets of experimental protein interaction data in Methicillin-resistant Staphylococcus aureus (MRSA) 252 and Caenorhabditis elegans demonstrated that our approach can predict hub proteins with a high degree of accuracy. A practical application of the developed bioinformatics method has been illustrated by the effective protein bait selection for large-scale pull-down experiments that aim to map complete protein-protein interaction networks for several species. CONCLUSION: The successful development of an accurate hub classifier demonstrated that highly-connected proteins tend to share certain relevant functional properties reflected in their Gene Ontology annotations. It is anticipated that the developed bioinformatics hub classifier will represent a useful tool for the theoretical prediction of highly-interacting proteins, the study of cellular network organizations, and the identification of prospective drug targets – even in those organisms that currently lack large-scale protein interaction data. BioMed Central 2008-09-16 /pmc/articles/PMC2553323/ /pubmed/18796161 http://dx.doi.org/10.1186/1752-0509-2-80 Text en Copyright © 2008 Hsing et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hsing, Michael
Byler, Kendall Grant
Cherkasov, Artem
The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks
title The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks
title_full The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks
title_fullStr The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks
title_full_unstemmed The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks
title_short The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks
title_sort use of gene ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553323/
https://www.ncbi.nlm.nih.gov/pubmed/18796161
http://dx.doi.org/10.1186/1752-0509-2-80
work_keys_str_mv AT hsingmichael theuseofgeneontologytermsforpredictinghighlyconnectedhubnodesinproteinproteininteractionnetworks
AT bylerkendallgrant theuseofgeneontologytermsforpredictinghighlyconnectedhubnodesinproteinproteininteractionnetworks
AT cherkasovartem theuseofgeneontologytermsforpredictinghighlyconnectedhubnodesinproteinproteininteractionnetworks
AT hsingmichael useofgeneontologytermsforpredictinghighlyconnectedhubnodesinproteinproteininteractionnetworks
AT bylerkendallgrant useofgeneontologytermsforpredictinghighlyconnectedhubnodesinproteinproteininteractionnetworks
AT cherkasovartem useofgeneontologytermsforpredictinghighlyconnectedhubnodesinproteinproteininteractionnetworks