Cargando…

Constructing a robust protein-protein interaction network by integrating multiple public databases

BACKGROUND: Protein-protein interactions (PPIs) are a critical component for many underlying biological processes. A PPI network can provide insight into the mechanisms of these processes, as well as the relationships among different proteins and toxicants that are potentially involved in the proces...

Descripción completa

Detalles Bibliográficos
Autores principales: Martha, Venkata-Swamy, Liu, Zhichao, Guo, Li, Su, Zhenqiang, Ye, Yanbin, Fang, Hong, Ding, Don, Tong, Weida, Xu, Xiaowei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3236850/
https://www.ncbi.nlm.nih.gov/pubmed/22165958
http://dx.doi.org/10.1186/1471-2105-12-S10-S7
_version_ 1782218796154486784
author Martha, Venkata-Swamy
Liu, Zhichao
Guo, Li
Su, Zhenqiang
Ye, Yanbin
Fang, Hong
Ding, Don
Tong, Weida
Xu, Xiaowei
author_facet Martha, Venkata-Swamy
Liu, Zhichao
Guo, Li
Su, Zhenqiang
Ye, Yanbin
Fang, Hong
Ding, Don
Tong, Weida
Xu, Xiaowei
author_sort Martha, Venkata-Swamy
collection PubMed
description BACKGROUND: Protein-protein interactions (PPIs) are a critical component for many underlying biological processes. A PPI network can provide insight into the mechanisms of these processes, as well as the relationships among different proteins and toxicants that are potentially involved in the processes. There are many PPI databases publicly available, each with a specific focus. The challenge is how to effectively combine their contents to generate a robust and biologically relevant PPI network. METHODS: In this study, seven public PPI databases, BioGRID, DIP, HPRD, IntAct, MINT, REACTOME, and SPIKE, were used to explore a powerful approach to combine multiple PPI databases for an integrated PPI network. We developed a novel method called k-votes to create seven different integrated networks by using values of k ranging from 1-7. Functional modules were mined by using SCAN, a Structural Clustering Algorithm for Networks. Overall module qualities were evaluated for each integrated network using the following statistical and biological measures: (1) modularity, (2) similarity-based modularity, (3) clustering score, and (4) enrichment. RESULTS: Each integrated human PPI network was constructed based on the number of votes (k) for a particular interaction from the committee of the original seven PPI databases. The performance of functional modules obtained by SCAN from each integrated network was evaluated. The optimal value for k was determined by the functional module analysis. Our results demonstrate that the k-votes method outperforms the traditional union approach in terms of both statistical significance and biological meaning. The best network is achieved at k=2, which is composed of interactions that are confirmed in at least two PPI databases. In contrast, the traditional union approach yields an integrated network that consists of all interactions of seven PPI databases, which might be subject to high false positives. CONCLUSIONS: We determined that the k-votes method for constructing a robust PPI network by integrating multiple public databases outperforms previously reported approaches and that a value of k=2 provides the best results. The developed strategies for combining databases show promise in the advancement of network construction and modeling.
format Online
Article
Text
id pubmed-3236850
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32368502011-12-14 Constructing a robust protein-protein interaction network by integrating multiple public databases Martha, Venkata-Swamy Liu, Zhichao Guo, Li Su, Zhenqiang Ye, Yanbin Fang, Hong Ding, Don Tong, Weida Xu, Xiaowei BMC Bioinformatics Proceedings BACKGROUND: Protein-protein interactions (PPIs) are a critical component for many underlying biological processes. A PPI network can provide insight into the mechanisms of these processes, as well as the relationships among different proteins and toxicants that are potentially involved in the processes. There are many PPI databases publicly available, each with a specific focus. The challenge is how to effectively combine their contents to generate a robust and biologically relevant PPI network. METHODS: In this study, seven public PPI databases, BioGRID, DIP, HPRD, IntAct, MINT, REACTOME, and SPIKE, were used to explore a powerful approach to combine multiple PPI databases for an integrated PPI network. We developed a novel method called k-votes to create seven different integrated networks by using values of k ranging from 1-7. Functional modules were mined by using SCAN, a Structural Clustering Algorithm for Networks. Overall module qualities were evaluated for each integrated network using the following statistical and biological measures: (1) modularity, (2) similarity-based modularity, (3) clustering score, and (4) enrichment. RESULTS: Each integrated human PPI network was constructed based on the number of votes (k) for a particular interaction from the committee of the original seven PPI databases. The performance of functional modules obtained by SCAN from each integrated network was evaluated. The optimal value for k was determined by the functional module analysis. Our results demonstrate that the k-votes method outperforms the traditional union approach in terms of both statistical significance and biological meaning. The best network is achieved at k=2, which is composed of interactions that are confirmed in at least two PPI databases. In contrast, the traditional union approach yields an integrated network that consists of all interactions of seven PPI databases, which might be subject to high false positives. CONCLUSIONS: We determined that the k-votes method for constructing a robust PPI network by integrating multiple public databases outperforms previously reported approaches and that a value of k=2 provides the best results. The developed strategies for combining databases show promise in the advancement of network construction and modeling. BioMed Central 2011-10-18 /pmc/articles/PMC3236850/ /pubmed/22165958 http://dx.doi.org/10.1186/1471-2105-12-S10-S7 Text en Copyright ©2011 Swamy et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Martha, Venkata-Swamy
Liu, Zhichao
Guo, Li
Su, Zhenqiang
Ye, Yanbin
Fang, Hong
Ding, Don
Tong, Weida
Xu, Xiaowei
Constructing a robust protein-protein interaction network by integrating multiple public databases
title Constructing a robust protein-protein interaction network by integrating multiple public databases
title_full Constructing a robust protein-protein interaction network by integrating multiple public databases
title_fullStr Constructing a robust protein-protein interaction network by integrating multiple public databases
title_full_unstemmed Constructing a robust protein-protein interaction network by integrating multiple public databases
title_short Constructing a robust protein-protein interaction network by integrating multiple public databases
title_sort constructing a robust protein-protein interaction network by integrating multiple public databases
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3236850/
https://www.ncbi.nlm.nih.gov/pubmed/22165958
http://dx.doi.org/10.1186/1471-2105-12-S10-S7
work_keys_str_mv AT marthavenkataswamy constructingarobustproteinproteininteractionnetworkbyintegratingmultiplepublicdatabases
AT liuzhichao constructingarobustproteinproteininteractionnetworkbyintegratingmultiplepublicdatabases
AT guoli constructingarobustproteinproteininteractionnetworkbyintegratingmultiplepublicdatabases
AT suzhenqiang constructingarobustproteinproteininteractionnetworkbyintegratingmultiplepublicdatabases
AT yeyanbin constructingarobustproteinproteininteractionnetworkbyintegratingmultiplepublicdatabases
AT fanghong constructingarobustproteinproteininteractionnetworkbyintegratingmultiplepublicdatabases
AT dingdon constructingarobustproteinproteininteractionnetworkbyintegratingmultiplepublicdatabases
AT tongweida constructingarobustproteinproteininteractionnetworkbyintegratingmultiplepublicdatabases
AT xuxiaowei constructingarobustproteinproteininteractionnetworkbyintegratingmultiplepublicdatabases