Cargando…

PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction

Determining protein-protein interaction (PPI) in biological systems is of considerable importance, and prediction of PPI has become a popular research area. Although different classifiers have been developed for PPI prediction, no single classifier seems to be able to predict PPI with high confidenc...

Descripción completa

Detalles Bibliográficos
Autores principales: Yao, Jianzhuang, Guo, Hong, Yang, Xiaohan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619929/
https://www.ncbi.nlm.nih.gov/pubmed/26539460
http://dx.doi.org/10.1155/2015/608042
_version_ 1782397215153586176
author Yao, Jianzhuang
Guo, Hong
Yang, Xiaohan
author_facet Yao, Jianzhuang
Guo, Hong
Yang, Xiaohan
author_sort Yao, Jianzhuang
collection PubMed
description Determining protein-protein interaction (PPI) in biological systems is of considerable importance, and prediction of PPI has become a popular research area. Although different classifiers have been developed for PPI prediction, no single classifier seems to be able to predict PPI with high confidence. We postulated that by combining individual classifiers the accuracy of PPI prediction could be improved. We developed a method called protein-protein interaction prediction classifiers merger (PPCM), and this method combines output from two PPI prediction tools, GO2PPI and Phyloprof, using Random Forests algorithm. The performance of PPCM was tested by area under the curve (AUC) using an assembled Gold Standard database that contains both positive and negative PPI pairs. Our AUC test showed that PPCM significantly improved the PPI prediction accuracy over the corresponding individual classifiers. We found that additional classifiers incorporated into PPCM could lead to further improvement in the PPI prediction accuracy. Furthermore, cross species PPCM could achieve competitive and even better prediction accuracy compared to the single species PPCM. This study established a robust pipeline for PPI prediction by integrating multiple classifiers using Random Forests algorithm. This pipeline will be useful for predicting PPI in nonmodel species.
format Online
Article
Text
id pubmed-4619929
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-46199292015-11-04 PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction Yao, Jianzhuang Guo, Hong Yang, Xiaohan Int J Genomics Research Article Determining protein-protein interaction (PPI) in biological systems is of considerable importance, and prediction of PPI has become a popular research area. Although different classifiers have been developed for PPI prediction, no single classifier seems to be able to predict PPI with high confidence. We postulated that by combining individual classifiers the accuracy of PPI prediction could be improved. We developed a method called protein-protein interaction prediction classifiers merger (PPCM), and this method combines output from two PPI prediction tools, GO2PPI and Phyloprof, using Random Forests algorithm. The performance of PPCM was tested by area under the curve (AUC) using an assembled Gold Standard database that contains both positive and negative PPI pairs. Our AUC test showed that PPCM significantly improved the PPI prediction accuracy over the corresponding individual classifiers. We found that additional classifiers incorporated into PPCM could lead to further improvement in the PPI prediction accuracy. Furthermore, cross species PPCM could achieve competitive and even better prediction accuracy compared to the single species PPCM. This study established a robust pipeline for PPI prediction by integrating multiple classifiers using Random Forests algorithm. This pipeline will be useful for predicting PPI in nonmodel species. Hindawi Publishing Corporation 2015 2015-10-11 /pmc/articles/PMC4619929/ /pubmed/26539460 http://dx.doi.org/10.1155/2015/608042 Text en Copyright © 2015 Jianzhuang Yao et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yao, Jianzhuang
Guo, Hong
Yang, Xiaohan
PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction
title PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction
title_full PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction
title_fullStr PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction
title_full_unstemmed PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction
title_short PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction
title_sort ppcm: combing multiple classifiers to improve protein-protein interaction prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619929/
https://www.ncbi.nlm.nih.gov/pubmed/26539460
http://dx.doi.org/10.1155/2015/608042
work_keys_str_mv AT yaojianzhuang ppcmcombingmultipleclassifierstoimproveproteinproteininteractionprediction
AT guohong ppcmcombingmultipleclassifierstoimproveproteinproteininteractionprediction
AT yangxiaohan ppcmcombingmultipleclassifierstoimproveproteinproteininteractionprediction