Cargando…

Predicting protein complex in protein interaction network - a supervised learning based method

BACKGROUND: Protein complexes are important for understanding principles of cellular organization and function. High-throughput experimental techniques have produced a large amount of protein interactions, making it possible to predict protein complexes from protein -protein interaction networks. Ho...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yu, Feng Ying, Yang, Zhi Hao, Tang, Nan, Lin, Hong Fei, Wang, Jian, Yang, Zhi Wei
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2014
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243764/ https://www.ncbi.nlm.nih.gov/pubmed/25349902 http://dx.doi.org/10.1186/1752-0509-8-S3-S4

_version_	1782346143037915136
author	Yu, Feng Ying Yang, Zhi Hao Tang, Nan Lin, Hong Fei Wang, Jian Yang, Zhi Wei
author_facet	Yu, Feng Ying Yang, Zhi Hao Tang, Nan Lin, Hong Fei Wang, Jian Yang, Zhi Wei
author_sort	Yu, Feng Ying
collection	PubMed
description	BACKGROUND: Protein complexes are important for understanding principles of cellular organization and function. High-throughput experimental techniques have produced a large amount of protein interactions, making it possible to predict protein complexes from protein -protein interaction networks. However, most of current methods are unsupervised learning based methods which can't utilize the information of the large amount of available known complexes. METHODS: We present a supervised learning-based method for predicting protein complexes in protein - protein interaction networks. The method extracts rich features from both the unweighted and weighted networks to train a Regression model, which is then used for the cliques filtering, growth, and candidate complex filtering. The model utilizes additional "uncertainty" samples and, therefore, is more discriminative when used in the complex detection algorithm. In addition, our method uses the maximal cliques found by the Cliques algorithm as the initial cliques, which has been proven to be more effective than the method of expanding from the seeding proteins used in other methods. RESULTS: The experimental results on several PIN datasets show that in most cases the performance of our method are superior to comparable state-of-the-art protein complex detection techniques. CONCLUSIONS: The results demonstrate the several advantages of our method over other state-of-the-art techniques. Firstly, our method is a supervised learning-based method that can make full use of the information of the available known complexes instead of being only based on the topological structure of the PIN. That also means, if more training samples are provided, our method can achieve better performance than those unsupervised methods. Secondly, we design the rich feature set to describe the properties of the known complexes, which includes not only the features from the unweighted network, but also those from the weighted network built based on the Gene Ontology information. Thirdly, our Regression model utilizes additional "uncertainty" samples and, therefore, becomes more discriminative, whose effectiveness for the complex detection is indicated by our experimental results.
format	Online Article Text
id	pubmed-4243764
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-42437642014-11-26 Predicting protein complex in protein interaction network - a supervised learning based method Yu, Feng Ying Yang, Zhi Hao Tang, Nan Lin, Hong Fei Wang, Jian Yang, Zhi Wei BMC Syst Biol Research BACKGROUND: Protein complexes are important for understanding principles of cellular organization and function. High-throughput experimental techniques have produced a large amount of protein interactions, making it possible to predict protein complexes from protein -protein interaction networks. However, most of current methods are unsupervised learning based methods which can't utilize the information of the large amount of available known complexes. METHODS: We present a supervised learning-based method for predicting protein complexes in protein - protein interaction networks. The method extracts rich features from both the unweighted and weighted networks to train a Regression model, which is then used for the cliques filtering, growth, and candidate complex filtering. The model utilizes additional "uncertainty" samples and, therefore, is more discriminative when used in the complex detection algorithm. In addition, our method uses the maximal cliques found by the Cliques algorithm as the initial cliques, which has been proven to be more effective than the method of expanding from the seeding proteins used in other methods. RESULTS: The experimental results on several PIN datasets show that in most cases the performance of our method are superior to comparable state-of-the-art protein complex detection techniques. CONCLUSIONS: The results demonstrate the several advantages of our method over other state-of-the-art techniques. Firstly, our method is a supervised learning-based method that can make full use of the information of the available known complexes instead of being only based on the topological structure of the PIN. That also means, if more training samples are provided, our method can achieve better performance than those unsupervised methods. Secondly, we design the rich feature set to describe the properties of the known complexes, which includes not only the features from the unweighted network, but also those from the weighted network built based on the Gene Ontology information. Thirdly, our Regression model utilizes additional "uncertainty" samples and, therefore, becomes more discriminative, whose effectiveness for the complex detection is indicated by our experimental results. BioMed Central 2014-10-22 /pmc/articles/PMC4243764/ /pubmed/25349902 http://dx.doi.org/10.1186/1752-0509-8-S3-S4 Text en Copyright © 2014 Yu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Yu, Feng Ying Yang, Zhi Hao Tang, Nan Lin, Hong Fei Wang, Jian Yang, Zhi Wei Predicting protein complex in protein interaction network - a supervised learning based method
title	Predicting protein complex in protein interaction network - a supervised learning based method
title_full	Predicting protein complex in protein interaction network - a supervised learning based method
title_fullStr	Predicting protein complex in protein interaction network - a supervised learning based method
title_full_unstemmed	Predicting protein complex in protein interaction network - a supervised learning based method
title_short	Predicting protein complex in protein interaction network - a supervised learning based method
title_sort	predicting protein complex in protein interaction network - a supervised learning based method
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243764/ https://www.ncbi.nlm.nih.gov/pubmed/25349902 http://dx.doi.org/10.1186/1752-0509-8-S3-S4
work_keys_str_mv	AT yufengying predictingproteincomplexinproteininteractionnetworkasupervisedlearningbasedmethod AT yangzhihao predictingproteincomplexinproteininteractionnetworkasupervisedlearningbasedmethod AT tangnan predictingproteincomplexinproteininteractionnetworkasupervisedlearningbasedmethod AT linhongfei predictingproteincomplexinproteininteractionnetworkasupervisedlearningbasedmethod AT wangjian predictingproteincomplexinproteininteractionnetworkasupervisedlearningbasedmethod AT yangzhiwei predictingproteincomplexinproteininteractionnetworkasupervisedlearningbasedmethod

Predicting protein complex in protein interaction network - a supervised learning based method

Ejemplares similares