Cargando…

Collective prediction of protein functions from protein-protein interaction networks

BACKGROUND: Automated assignment of functions to unknown proteins is one of the most important task in computational biology. The development of experimental methods for genome scale analysis of molecular interaction networks offers new ways to infer protein function from protein-protein interaction...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Qingyao, Ye, Yunming, Ng, Michael K, Ho, Shen-Shyang, Shi, Ruichao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015526/
https://www.ncbi.nlm.nih.gov/pubmed/24564855
http://dx.doi.org/10.1186/1471-2105-15-S2-S9
_version_ 1782315349266399232
author Wu, Qingyao
Ye, Yunming
Ng, Michael K
Ho, Shen-Shyang
Shi, Ruichao
author_facet Wu, Qingyao
Ye, Yunming
Ng, Michael K
Ho, Shen-Shyang
Shi, Ruichao
author_sort Wu, Qingyao
collection PubMed
description BACKGROUND: Automated assignment of functions to unknown proteins is one of the most important task in computational biology. The development of experimental methods for genome scale analysis of molecular interaction networks offers new ways to infer protein function from protein-protein interaction (PPI) network data. Existing techniques for collective classification (CC) usually increase accuracy for network data, wherein instances are interlinked with each other, using a large amount of labeled data for training. However, the labeled data are time-consuming and expensive to obtain. On the other hand, one can easily obtain large amount of unlabeled data. Thus, more sophisticated methods are needed to exploit the unlabeled data to increase prediction accuracy for protein function prediction. RESULTS: In this paper, we propose an effective Markov chain based CC algorithm (ICAM) to tackle the label deficiency problem in CC for interrelated proteins from PPI networks. Our idea is to model the problem using two distinct Markov chain classifiers to make separate predictions with regard to attribute features from protein data and relational features from relational information. The ICAM learning algorithm combines the results of the two classifiers to compute the ranks of labels to indicate the importance of a set of labels to an instance, and uses an ICA framework to iteratively refine the learning models for improving performance of protein function prediction from PPI networks in the paucity of labeled data. CONCLUSION: Experimental results on the real-world Yeast protein-protein interaction datasets show that our proposed ICAM method is better than the other ICA-type methods given limited labeled training data. This approach can serve as a valuable tool for the study of protein function prediction from PPI networks.
format Online
Article
Text
id pubmed-4015526
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40155262014-05-23 Collective prediction of protein functions from protein-protein interaction networks Wu, Qingyao Ye, Yunming Ng, Michael K Ho, Shen-Shyang Shi, Ruichao BMC Bioinformatics Proceedings BACKGROUND: Automated assignment of functions to unknown proteins is one of the most important task in computational biology. The development of experimental methods for genome scale analysis of molecular interaction networks offers new ways to infer protein function from protein-protein interaction (PPI) network data. Existing techniques for collective classification (CC) usually increase accuracy for network data, wherein instances are interlinked with each other, using a large amount of labeled data for training. However, the labeled data are time-consuming and expensive to obtain. On the other hand, one can easily obtain large amount of unlabeled data. Thus, more sophisticated methods are needed to exploit the unlabeled data to increase prediction accuracy for protein function prediction. RESULTS: In this paper, we propose an effective Markov chain based CC algorithm (ICAM) to tackle the label deficiency problem in CC for interrelated proteins from PPI networks. Our idea is to model the problem using two distinct Markov chain classifiers to make separate predictions with regard to attribute features from protein data and relational features from relational information. The ICAM learning algorithm combines the results of the two classifiers to compute the ranks of labels to indicate the importance of a set of labels to an instance, and uses an ICA framework to iteratively refine the learning models for improving performance of protein function prediction from PPI networks in the paucity of labeled data. CONCLUSION: Experimental results on the real-world Yeast protein-protein interaction datasets show that our proposed ICAM method is better than the other ICA-type methods given limited labeled training data. This approach can serve as a valuable tool for the study of protein function prediction from PPI networks. BioMed Central 2014-01-24 /pmc/articles/PMC4015526/ /pubmed/24564855 http://dx.doi.org/10.1186/1471-2105-15-S2-S9 Text en Copyright © 2014 Wu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Wu, Qingyao
Ye, Yunming
Ng, Michael K
Ho, Shen-Shyang
Shi, Ruichao
Collective prediction of protein functions from protein-protein interaction networks
title Collective prediction of protein functions from protein-protein interaction networks
title_full Collective prediction of protein functions from protein-protein interaction networks
title_fullStr Collective prediction of protein functions from protein-protein interaction networks
title_full_unstemmed Collective prediction of protein functions from protein-protein interaction networks
title_short Collective prediction of protein functions from protein-protein interaction networks
title_sort collective prediction of protein functions from protein-protein interaction networks
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015526/
https://www.ncbi.nlm.nih.gov/pubmed/24564855
http://dx.doi.org/10.1186/1471-2105-15-S2-S9
work_keys_str_mv AT wuqingyao collectivepredictionofproteinfunctionsfromproteinproteininteractionnetworks
AT yeyunming collectivepredictionofproteinfunctionsfromproteinproteininteractionnetworks
AT ngmichaelk collectivepredictionofproteinfunctionsfromproteinproteininteractionnetworks
AT hoshenshyang collectivepredictionofproteinfunctionsfromproteinproteininteractionnetworks
AT shiruichao collectivepredictionofproteinfunctionsfromproteinproteininteractionnetworks