Cargando…

The Development of a Universal In Silico Predictor of Protein-Protein Interactions

Protein-protein interactions (PPIs) are essential for understanding the function of biological systems and have been characterized using a vast array of experimental techniques. These techniques detect only a small proportion of all PPIs and are labor intensive and time consuming. Therefore, the dev...

Descripción completa

Detalles Bibliográficos
Autores principales: Valente, Guilherme T., Acencio, Marcio L., Martins, Cesar, Lemke, Ney
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3669264/
https://www.ncbi.nlm.nih.gov/pubmed/23741499
http://dx.doi.org/10.1371/journal.pone.0065587
_version_ 1782271719428325376
author Valente, Guilherme T.
Acencio, Marcio L.
Martins, Cesar
Lemke, Ney
author_facet Valente, Guilherme T.
Acencio, Marcio L.
Martins, Cesar
Lemke, Ney
author_sort Valente, Guilherme T.
collection PubMed
description Protein-protein interactions (PPIs) are essential for understanding the function of biological systems and have been characterized using a vast array of experimental techniques. These techniques detect only a small proportion of all PPIs and are labor intensive and time consuming. Therefore, the development of computational methods capable of predicting PPIs accelerates the pace of discovery of new interactions. This paper reports a machine learning-based prediction model, the Universal In Silico Predictor of Protein-Protein Interactions (UNISPPI), which is a decision tree model that can reliably predict PPIs for all species (including proteins from parasite-host associations) using only 20 combinations of amino acids frequencies from interacting and non-interacting proteins as learning features. UNISPPI was able to correctly classify 79.4% and 72.6% of experimentally supported interactions and non-interacting protein pairs, respectively, from an independent test set. Moreover, UNISPPI suggests that the frequencies of the amino acids asparagine, cysteine and isoleucine are important features for distinguishing between interacting and non-interacting protein pairs. We envisage that UNISPPI can be a useful tool for prioritizing interactions for experimental validation.
format Online
Article
Text
id pubmed-3669264
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36692642013-06-05 The Development of a Universal In Silico Predictor of Protein-Protein Interactions Valente, Guilherme T. Acencio, Marcio L. Martins, Cesar Lemke, Ney PLoS One Research Article Protein-protein interactions (PPIs) are essential for understanding the function of biological systems and have been characterized using a vast array of experimental techniques. These techniques detect only a small proportion of all PPIs and are labor intensive and time consuming. Therefore, the development of computational methods capable of predicting PPIs accelerates the pace of discovery of new interactions. This paper reports a machine learning-based prediction model, the Universal In Silico Predictor of Protein-Protein Interactions (UNISPPI), which is a decision tree model that can reliably predict PPIs for all species (including proteins from parasite-host associations) using only 20 combinations of amino acids frequencies from interacting and non-interacting proteins as learning features. UNISPPI was able to correctly classify 79.4% and 72.6% of experimentally supported interactions and non-interacting protein pairs, respectively, from an independent test set. Moreover, UNISPPI suggests that the frequencies of the amino acids asparagine, cysteine and isoleucine are important features for distinguishing between interacting and non-interacting protein pairs. We envisage that UNISPPI can be a useful tool for prioritizing interactions for experimental validation. Public Library of Science 2013-05-31 /pmc/articles/PMC3669264/ /pubmed/23741499 http://dx.doi.org/10.1371/journal.pone.0065587 Text en © 2013 Valente et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Valente, Guilherme T.
Acencio, Marcio L.
Martins, Cesar
Lemke, Ney
The Development of a Universal In Silico Predictor of Protein-Protein Interactions
title The Development of a Universal In Silico Predictor of Protein-Protein Interactions
title_full The Development of a Universal In Silico Predictor of Protein-Protein Interactions
title_fullStr The Development of a Universal In Silico Predictor of Protein-Protein Interactions
title_full_unstemmed The Development of a Universal In Silico Predictor of Protein-Protein Interactions
title_short The Development of a Universal In Silico Predictor of Protein-Protein Interactions
title_sort development of a universal in silico predictor of protein-protein interactions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3669264/
https://www.ncbi.nlm.nih.gov/pubmed/23741499
http://dx.doi.org/10.1371/journal.pone.0065587
work_keys_str_mv AT valenteguilhermet thedevelopmentofauniversalinsilicopredictorofproteinproteininteractions
AT acenciomarciol thedevelopmentofauniversalinsilicopredictorofproteinproteininteractions
AT martinscesar thedevelopmentofauniversalinsilicopredictorofproteinproteininteractions
AT lemkeney thedevelopmentofauniversalinsilicopredictorofproteinproteininteractions
AT valenteguilhermet developmentofauniversalinsilicopredictorofproteinproteininteractions
AT acenciomarciol developmentofauniversalinsilicopredictorofproteinproteininteractions
AT martinscesar developmentofauniversalinsilicopredictorofproteinproteininteractions
AT lemkeney developmentofauniversalinsilicopredictorofproteinproteininteractions