Cargando…

Integrating diverse biological and computational sources for reliable protein-protein interactions

BACKGROUND: Protein-protein interactions (PPIs) play important roles in various cellular processes. However, the low quality of current PPI data detected from high-throughput screening techniques has diminished the potential usefulness of the data. We need to develop a method to address the high dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Min, Li, Xiaoli, Chua, Hon Nian, Kwoh, Chee-Keong, Ng, See-Kiong
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2957691/
https://www.ncbi.nlm.nih.gov/pubmed/21106130
http://dx.doi.org/10.1186/1471-2105-11-S7-S8
_version_ 1782188250671087616
author Wu, Min
Li, Xiaoli
Chua, Hon Nian
Kwoh, Chee-Keong
Ng, See-Kiong
author_facet Wu, Min
Li, Xiaoli
Chua, Hon Nian
Kwoh, Chee-Keong
Ng, See-Kiong
author_sort Wu, Min
collection PubMed
description BACKGROUND: Protein-protein interactions (PPIs) play important roles in various cellular processes. However, the low quality of current PPI data detected from high-throughput screening techniques has diminished the potential usefulness of the data. We need to develop a method to address the high data noise and incompleteness of PPI data, namely, to filter out inaccurate protein interactions (false positives) and predict putative protein interactions (false negatives). RESULTS: In this paper, we proposed a novel two-step method to integrate diverse biological and computational sources of supporting evidence for reliable PPIs. The first step, interaction binning or InterBIN, groups PPIs together to more accurately estimate the likelihood (Bin-Confidence score) that the protein pairs interact for each biological or computational evidence source. The second step, interaction classification or InterCLASS, integrates the collected Bin-Confidence scores to build classifiers and identify reliable interactions. CONCLUSIONS: We performed comprehensive experiments on two benchmark yeast PPI datasets. The experimental results showed that our proposed method can effectively eliminate false positives in detected PPIs and identify false negatives by predicting novel yet reliable PPIs. Our proposed method also performed significantly better than merely using each of individual evidence sources, illustrating the importance of integrating various biological and computational sources of data and evidence.
format Text
id pubmed-2957691
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29576912010-10-21 Integrating diverse biological and computational sources for reliable protein-protein interactions Wu, Min Li, Xiaoli Chua, Hon Nian Kwoh, Chee-Keong Ng, See-Kiong BMC Bioinformatics Proceedings BACKGROUND: Protein-protein interactions (PPIs) play important roles in various cellular processes. However, the low quality of current PPI data detected from high-throughput screening techniques has diminished the potential usefulness of the data. We need to develop a method to address the high data noise and incompleteness of PPI data, namely, to filter out inaccurate protein interactions (false positives) and predict putative protein interactions (false negatives). RESULTS: In this paper, we proposed a novel two-step method to integrate diverse biological and computational sources of supporting evidence for reliable PPIs. The first step, interaction binning or InterBIN, groups PPIs together to more accurately estimate the likelihood (Bin-Confidence score) that the protein pairs interact for each biological or computational evidence source. The second step, interaction classification or InterCLASS, integrates the collected Bin-Confidence scores to build classifiers and identify reliable interactions. CONCLUSIONS: We performed comprehensive experiments on two benchmark yeast PPI datasets. The experimental results showed that our proposed method can effectively eliminate false positives in detected PPIs and identify false negatives by predicting novel yet reliable PPIs. Our proposed method also performed significantly better than merely using each of individual evidence sources, illustrating the importance of integrating various biological and computational sources of data and evidence. BioMed Central 2010-10-15 /pmc/articles/PMC2957691/ /pubmed/21106130 http://dx.doi.org/10.1186/1471-2105-11-S7-S8 Text en Copyright ©2010 Wu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Wu, Min
Li, Xiaoli
Chua, Hon Nian
Kwoh, Chee-Keong
Ng, See-Kiong
Integrating diverse biological and computational sources for reliable protein-protein interactions
title Integrating diverse biological and computational sources for reliable protein-protein interactions
title_full Integrating diverse biological and computational sources for reliable protein-protein interactions
title_fullStr Integrating diverse biological and computational sources for reliable protein-protein interactions
title_full_unstemmed Integrating diverse biological and computational sources for reliable protein-protein interactions
title_short Integrating diverse biological and computational sources for reliable protein-protein interactions
title_sort integrating diverse biological and computational sources for reliable protein-protein interactions
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2957691/
https://www.ncbi.nlm.nih.gov/pubmed/21106130
http://dx.doi.org/10.1186/1471-2105-11-S7-S8
work_keys_str_mv AT wumin integratingdiversebiologicalandcomputationalsourcesforreliableproteinproteininteractions
AT lixiaoli integratingdiversebiologicalandcomputationalsourcesforreliableproteinproteininteractions
AT chuahonnian integratingdiversebiologicalandcomputationalsourcesforreliableproteinproteininteractions
AT kwohcheekeong integratingdiversebiologicalandcomputationalsourcesforreliableproteinproteininteractions
AT ngseekiong integratingdiversebiologicalandcomputationalsourcesforreliableproteinproteininteractions