Cargando…

Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators

BACKGROUND: Identification of protein-protein interactions (PPIs) is essential for a better understanding of biological processes, pathways and functions. However, experimental identification of the complete set of PPIs in a cell/organism (“an interactome”) is still a difficult task. To circumvent l...

Descripción completa

Detalles Bibliográficos
Autores principales: Murakami, Yoichi, Mizuguchi, Kenji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4229973/
https://www.ncbi.nlm.nih.gov/pubmed/24953126
http://dx.doi.org/10.1186/1471-2105-15-213
_version_ 1782344201901441024
author Murakami, Yoichi
Mizuguchi, Kenji
author_facet Murakami, Yoichi
Mizuguchi, Kenji
author_sort Murakami, Yoichi
collection PubMed
description BACKGROUND: Identification of protein-protein interactions (PPIs) is essential for a better understanding of biological processes, pathways and functions. However, experimental identification of the complete set of PPIs in a cell/organism (“an interactome”) is still a difficult task. To circumvent limitations of current high-throughput experimental techniques, it is necessary to develop high-performance computational methods for predicting PPIs. RESULTS: In this article, we propose a new computational method to predict interaction between a given pair of protein sequences using features derived from known homologous PPIs. The proposed method is capable of predicting interaction between two proteins (of unknown structure) using Averaged One-Dependence Estimators (AODE) and three features calculated for the protein pair: (a) sequence similarities to a known interacting protein pair (F(Seq)), (b) statistical propensities of domain pairs observed in interacting proteins (F(Dom)) and (c) a sum of edge weights along the shortest path between homologous proteins in a PPI network (F(Net)). Feature vectors were defined to lie in a half-space of the symmetrical high-dimensional feature space to make them independent of the protein order. The predictability of the method was assessed by a 10-fold cross validation on a recently created human PPI dataset with randomly sampled negative data, and the best model achieved an Area Under the Curve of 0.79 (pAUC(0.5%) = 0.16). In addition, the AODE trained on all three features (named PSOPIA) showed better prediction performance on a separate independent data set than a recently reported homology-based method. CONCLUSIONS: Our results suggest that F(Net), a feature representing proximity in a known PPI network between two proteins that are homologous to a target protein pair, contributes to the prediction of whether the target proteins interact or not. PSOPIA will help identify novel PPIs and estimate complete PPI networks. The method proposed in this article is freely available on the web at http://mizuguchilab.org/PSOPIA.
format Online
Article
Text
id pubmed-4229973
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42299732014-11-14 Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators Murakami, Yoichi Mizuguchi, Kenji BMC Bioinformatics Research Article BACKGROUND: Identification of protein-protein interactions (PPIs) is essential for a better understanding of biological processes, pathways and functions. However, experimental identification of the complete set of PPIs in a cell/organism (“an interactome”) is still a difficult task. To circumvent limitations of current high-throughput experimental techniques, it is necessary to develop high-performance computational methods for predicting PPIs. RESULTS: In this article, we propose a new computational method to predict interaction between a given pair of protein sequences using features derived from known homologous PPIs. The proposed method is capable of predicting interaction between two proteins (of unknown structure) using Averaged One-Dependence Estimators (AODE) and three features calculated for the protein pair: (a) sequence similarities to a known interacting protein pair (F(Seq)), (b) statistical propensities of domain pairs observed in interacting proteins (F(Dom)) and (c) a sum of edge weights along the shortest path between homologous proteins in a PPI network (F(Net)). Feature vectors were defined to lie in a half-space of the symmetrical high-dimensional feature space to make them independent of the protein order. The predictability of the method was assessed by a 10-fold cross validation on a recently created human PPI dataset with randomly sampled negative data, and the best model achieved an Area Under the Curve of 0.79 (pAUC(0.5%) = 0.16). In addition, the AODE trained on all three features (named PSOPIA) showed better prediction performance on a separate independent data set than a recently reported homology-based method. CONCLUSIONS: Our results suggest that F(Net), a feature representing proximity in a known PPI network between two proteins that are homologous to a target protein pair, contributes to the prediction of whether the target proteins interact or not. PSOPIA will help identify novel PPIs and estimate complete PPI networks. The method proposed in this article is freely available on the web at http://mizuguchilab.org/PSOPIA. BioMed Central 2014-06-23 /pmc/articles/PMC4229973/ /pubmed/24953126 http://dx.doi.org/10.1186/1471-2105-15-213 Text en Copyright © 2014 Murakami and Mizuguchi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Murakami, Yoichi
Mizuguchi, Kenji
Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators
title Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators
title_full Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators
title_fullStr Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators
title_full_unstemmed Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators
title_short Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators
title_sort homology-based prediction of interactions between proteins using averaged one-dependence estimators
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4229973/
https://www.ncbi.nlm.nih.gov/pubmed/24953126
http://dx.doi.org/10.1186/1471-2105-15-213
work_keys_str_mv AT murakamiyoichi homologybasedpredictionofinteractionsbetweenproteinsusingaveragedonedependenceestimators
AT mizuguchikenji homologybasedpredictionofinteractionsbetweenproteinsusingaveragedonedependenceestimators