Cargando…
PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction
Determining protein-protein interaction (PPI) in biological systems is of considerable importance, and prediction of PPI has become a popular research area. Although different classifiers have been developed for PPI prediction, no single classifier seems to be able to predict PPI with high confidenc...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619929/ https://www.ncbi.nlm.nih.gov/pubmed/26539460 http://dx.doi.org/10.1155/2015/608042 |
_version_ | 1782397215153586176 |
---|---|
author | Yao, Jianzhuang Guo, Hong Yang, Xiaohan |
author_facet | Yao, Jianzhuang Guo, Hong Yang, Xiaohan |
author_sort | Yao, Jianzhuang |
collection | PubMed |
description | Determining protein-protein interaction (PPI) in biological systems is of considerable importance, and prediction of PPI has become a popular research area. Although different classifiers have been developed for PPI prediction, no single classifier seems to be able to predict PPI with high confidence. We postulated that by combining individual classifiers the accuracy of PPI prediction could be improved. We developed a method called protein-protein interaction prediction classifiers merger (PPCM), and this method combines output from two PPI prediction tools, GO2PPI and Phyloprof, using Random Forests algorithm. The performance of PPCM was tested by area under the curve (AUC) using an assembled Gold Standard database that contains both positive and negative PPI pairs. Our AUC test showed that PPCM significantly improved the PPI prediction accuracy over the corresponding individual classifiers. We found that additional classifiers incorporated into PPCM could lead to further improvement in the PPI prediction accuracy. Furthermore, cross species PPCM could achieve competitive and even better prediction accuracy compared to the single species PPCM. This study established a robust pipeline for PPI prediction by integrating multiple classifiers using Random Forests algorithm. This pipeline will be useful for predicting PPI in nonmodel species. |
format | Online Article Text |
id | pubmed-4619929 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-46199292015-11-04 PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction Yao, Jianzhuang Guo, Hong Yang, Xiaohan Int J Genomics Research Article Determining protein-protein interaction (PPI) in biological systems is of considerable importance, and prediction of PPI has become a popular research area. Although different classifiers have been developed for PPI prediction, no single classifier seems to be able to predict PPI with high confidence. We postulated that by combining individual classifiers the accuracy of PPI prediction could be improved. We developed a method called protein-protein interaction prediction classifiers merger (PPCM), and this method combines output from two PPI prediction tools, GO2PPI and Phyloprof, using Random Forests algorithm. The performance of PPCM was tested by area under the curve (AUC) using an assembled Gold Standard database that contains both positive and negative PPI pairs. Our AUC test showed that PPCM significantly improved the PPI prediction accuracy over the corresponding individual classifiers. We found that additional classifiers incorporated into PPCM could lead to further improvement in the PPI prediction accuracy. Furthermore, cross species PPCM could achieve competitive and even better prediction accuracy compared to the single species PPCM. This study established a robust pipeline for PPI prediction by integrating multiple classifiers using Random Forests algorithm. This pipeline will be useful for predicting PPI in nonmodel species. Hindawi Publishing Corporation 2015 2015-10-11 /pmc/articles/PMC4619929/ /pubmed/26539460 http://dx.doi.org/10.1155/2015/608042 Text en Copyright © 2015 Jianzhuang Yao et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Yao, Jianzhuang Guo, Hong Yang, Xiaohan PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction |
title | PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction |
title_full | PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction |
title_fullStr | PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction |
title_full_unstemmed | PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction |
title_short | PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction |
title_sort | ppcm: combing multiple classifiers to improve protein-protein interaction prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619929/ https://www.ncbi.nlm.nih.gov/pubmed/26539460 http://dx.doi.org/10.1155/2015/608042 |
work_keys_str_mv | AT yaojianzhuang ppcmcombingmultipleclassifierstoimproveproteinproteininteractionprediction AT guohong ppcmcombingmultipleclassifierstoimproveproteinproteininteractionprediction AT yangxiaohan ppcmcombingmultipleclassifierstoimproveproteinproteininteractionprediction |