Cargando…

PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables

Protein-protein interactions (PPI) control most of the biological processes in a living cell. In order to fully understand protein functions, a knowledge of protein-protein interactions is necessary. Prediction of PPI is challenging, especially when the three-dimensional structure of interacting par...

Descripción completa

Detalles Bibliográficos
Autores principales: Chatterjee, Piyali, Basu, Subhadip, Kundu, Mahantapas, Nasipuri, Mita, Plewczynski, Dariusz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SP Versita 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6275787/
https://www.ncbi.nlm.nih.gov/pubmed/21442443
http://dx.doi.org/10.2478/s11658-011-0008-x
_version_ 1783377877934276608
author Chatterjee, Piyali
Basu, Subhadip
Kundu, Mahantapas
Nasipuri, Mita
Plewczynski, Dariusz
author_facet Chatterjee, Piyali
Basu, Subhadip
Kundu, Mahantapas
Nasipuri, Mita
Plewczynski, Dariusz
author_sort Chatterjee, Piyali
collection PubMed
description Protein-protein interactions (PPI) control most of the biological processes in a living cell. In order to fully understand protein functions, a knowledge of protein-protein interactions is necessary. Prediction of PPI is challenging, especially when the three-dimensional structure of interacting partners is not known. Recently, a novel prediction method was proposed by exploiting physical interactions of constituent domains. We propose here a novel knowledge-based prediction method, namely PPI_SVM, which predicts interactions between two protein sequences by exploiting their domain information. We trained a two-class support vector machine on the benchmarking set of pairs of interacting proteins extracted from the Database of Interacting Proteins (DIP). The method considers all possible combinations of constituent domains between two protein sequences, unlike most of the existing approaches. Moreover, it deals with both single-domain proteins and multi domain proteins; therefore it can be applied to the whole proteome in high-throughput studies. Our machine learning classifier, following a brainstorming approach, achieves accuracy of 86%, with specificity of 95%, and sensitivity of 75%, which are better results than most previous methods that sacrifice recall values in order to boost the overall precision. Our method has on average better sensitivity combined with good selectivity on the benchmarking dataset. The PPI_SVM source code, train/test datasets and supplementary files are available freely in the public domain at: http://code.google.com/p/cmater-bioinfo/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi: 10.2478/s11658-011-0008-x contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6275787
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher SP Versita
record_format MEDLINE/PubMed
spelling pubmed-62757872018-12-10 PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables Chatterjee, Piyali Basu, Subhadip Kundu, Mahantapas Nasipuri, Mita Plewczynski, Dariusz Cell Mol Biol Lett Short Communication Protein-protein interactions (PPI) control most of the biological processes in a living cell. In order to fully understand protein functions, a knowledge of protein-protein interactions is necessary. Prediction of PPI is challenging, especially when the three-dimensional structure of interacting partners is not known. Recently, a novel prediction method was proposed by exploiting physical interactions of constituent domains. We propose here a novel knowledge-based prediction method, namely PPI_SVM, which predicts interactions between two protein sequences by exploiting their domain information. We trained a two-class support vector machine on the benchmarking set of pairs of interacting proteins extracted from the Database of Interacting Proteins (DIP). The method considers all possible combinations of constituent domains between two protein sequences, unlike most of the existing approaches. Moreover, it deals with both single-domain proteins and multi domain proteins; therefore it can be applied to the whole proteome in high-throughput studies. Our machine learning classifier, following a brainstorming approach, achieves accuracy of 86%, with specificity of 95%, and sensitivity of 75%, which are better results than most previous methods that sacrifice recall values in order to boost the overall precision. Our method has on average better sensitivity combined with good selectivity on the benchmarking dataset. The PPI_SVM source code, train/test datasets and supplementary files are available freely in the public domain at: http://code.google.com/p/cmater-bioinfo/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi: 10.2478/s11658-011-0008-x contains supplementary material, which is available to authorized users. SP Versita 2011-03-20 /pmc/articles/PMC6275787/ /pubmed/21442443 http://dx.doi.org/10.2478/s11658-011-0008-x Text en © © Versita Warsaw and Springer-Verlag Wien 2011
spellingShingle Short Communication
Chatterjee, Piyali
Basu, Subhadip
Kundu, Mahantapas
Nasipuri, Mita
Plewczynski, Dariusz
PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables
title PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables
title_full PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables
title_fullStr PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables
title_full_unstemmed PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables
title_short PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables
title_sort ppi_svm: prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables
topic Short Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6275787/
https://www.ncbi.nlm.nih.gov/pubmed/21442443
http://dx.doi.org/10.2478/s11658-011-0008-x
work_keys_str_mv AT chatterjeepiyali ppisvmpredictionofproteinproteininteractionsusingmachinelearningdomaindomainaffinitiesandfrequencytables
AT basusubhadip ppisvmpredictionofproteinproteininteractionsusingmachinelearningdomaindomainaffinitiesandfrequencytables
AT kundumahantapas ppisvmpredictionofproteinproteininteractionsusingmachinelearningdomaindomainaffinitiesandfrequencytables
AT nasipurimita ppisvmpredictionofproteinproteininteractionsusingmachinelearningdomaindomainaffinitiesandfrequencytables
AT plewczynskidariusz ppisvmpredictionofproteinproteininteractionsusingmachinelearningdomaindomainaffinitiesandfrequencytables