Cargando…

Inferring high-confidence human protein-protein interactions

BACKGROUND: As numerous experimental factors drive the acquisition, identification, and interpretation of protein-protein interactions (PPIs), aggregated assemblies of human PPI data invariably contain experiment-dependent noise. Ascertaining the reliability of PPIs collected from these diverse stud...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yu, Xueping, Wallqvist, Anders, Reifman, Jaques
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416704/ https://www.ncbi.nlm.nih.gov/pubmed/22558947 http://dx.doi.org/10.1186/1471-2105-13-79

_version_	1782240427115544576
author	Yu, Xueping Wallqvist, Anders Reifman, Jaques
author_facet	Yu, Xueping Wallqvist, Anders Reifman, Jaques
author_sort	Yu, Xueping
collection	PubMed
description	BACKGROUND: As numerous experimental factors drive the acquisition, identification, and interpretation of protein-protein interactions (PPIs), aggregated assemblies of human PPI data invariably contain experiment-dependent noise. Ascertaining the reliability of PPIs collected from these diverse studies and scoring them to infer high-confidence networks is a non-trivial task. Moreover, a large number of PPIs share the same number of reported occurrences, making it impossible to distinguish the reliability of these PPIs and rank-order them. For example, for the data analyzed here, we found that the majority (>83%) of currently available human PPIs have been reported only once. RESULTS: In this work, we proposed an unsupervised statistical approach to score a set of diverse, experimentally identified PPIs from nine primary databases to create subsets of high-confidence human PPI networks. We evaluated this ranking method by comparing it with other methods and assessing their ability to retrieve protein associations from a number of diverse and independent reference sets. These reference sets contain known biological data that are either directly or indirectly linked to interactions between proteins. We quantified the average effect of using ranked protein interaction data to retrieve this information and showed that, when compared to randomly ranked interaction data sets, the proposed method created a larger enrichment (~134%) than either ranking based on the hypergeometric test (~109%) or occurrence ranking (~46%). CONCLUSIONS: From our evaluations, it was clear that ranked interactions were always of value because higher-ranked PPIs had a higher likelihood of retrieving high-confidence experimental data. Reducing the noise inherent in aggregated experimental PPIs via our ranking scheme further increased the accuracy and enrichment of PPIs derived from a number of biologically relevant data sets. These results suggest that using our high-confidence protein interactions at different levels of confidence will help clarify the topological and biological properties associated with human protein networks.
format	Online Article Text
id	pubmed-3416704
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-34167042012-08-13 Inferring high-confidence human protein-protein interactions Yu, Xueping Wallqvist, Anders Reifman, Jaques BMC Bioinformatics Methodology Article BACKGROUND: As numerous experimental factors drive the acquisition, identification, and interpretation of protein-protein interactions (PPIs), aggregated assemblies of human PPI data invariably contain experiment-dependent noise. Ascertaining the reliability of PPIs collected from these diverse studies and scoring them to infer high-confidence networks is a non-trivial task. Moreover, a large number of PPIs share the same number of reported occurrences, making it impossible to distinguish the reliability of these PPIs and rank-order them. For example, for the data analyzed here, we found that the majority (>83%) of currently available human PPIs have been reported only once. RESULTS: In this work, we proposed an unsupervised statistical approach to score a set of diverse, experimentally identified PPIs from nine primary databases to create subsets of high-confidence human PPI networks. We evaluated this ranking method by comparing it with other methods and assessing their ability to retrieve protein associations from a number of diverse and independent reference sets. These reference sets contain known biological data that are either directly or indirectly linked to interactions between proteins. We quantified the average effect of using ranked protein interaction data to retrieve this information and showed that, when compared to randomly ranked interaction data sets, the proposed method created a larger enrichment (~134%) than either ranking based on the hypergeometric test (~109%) or occurrence ranking (~46%). CONCLUSIONS: From our evaluations, it was clear that ranked interactions were always of value because higher-ranked PPIs had a higher likelihood of retrieving high-confidence experimental data. Reducing the noise inherent in aggregated experimental PPIs via our ranking scheme further increased the accuracy and enrichment of PPIs derived from a number of biologically relevant data sets. These results suggest that using our high-confidence protein interactions at different levels of confidence will help clarify the topological and biological properties associated with human protein networks. BioMed Central 2012-05-04 /pmc/articles/PMC3416704/ /pubmed/22558947 http://dx.doi.org/10.1186/1471-2105-13-79 Text en Copyright ©2012 Yu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Yu, Xueping Wallqvist, Anders Reifman, Jaques Inferring high-confidence human protein-protein interactions
title	Inferring high-confidence human protein-protein interactions
title_full	Inferring high-confidence human protein-protein interactions
title_fullStr	Inferring high-confidence human protein-protein interactions
title_full_unstemmed	Inferring high-confidence human protein-protein interactions
title_short	Inferring high-confidence human protein-protein interactions
title_sort	inferring high-confidence human protein-protein interactions
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416704/ https://www.ncbi.nlm.nih.gov/pubmed/22558947 http://dx.doi.org/10.1186/1471-2105-13-79
work_keys_str_mv	AT yuxueping inferringhighconfidencehumanproteinproteininteractions AT wallqvistanders inferringhighconfidencehumanproteinproteininteractions AT reifmanjaques inferringhighconfidencehumanproteinproteininteractions

Inferring high-confidence human protein-protein interactions

Ejemplares similares