Reconstructing Genome-Wide Protein–Protein Interaction Networks Using Multiple Strategies with Homologous Mapping

BACKGROUND: One of the crucial steps toward understanding the biological functions of a cellular system is to investigate protein–protein interaction (PPI) networks. As an increasing number of reliable PPIs become available, there is a growing need for discovering PPIs to reconstruct PPI networks of...

Descripción completa

Detalles Bibliográficos
Autores principales: Lo, Yu-Shu, Huang, Sing-Han, Luo, Yong-Chun, Lin, Chun-Yu, Yang, Jinn-Moon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4300222/
https://www.ncbi.nlm.nih.gov/pubmed/25602759
http://dx.doi.org/10.1371/journal.pone.0116347
_version_ 1782353498294190080
author Lo, Yu-Shu
Huang, Sing-Han
Luo, Yong-Chun
Lin, Chun-Yu
Yang, Jinn-Moon
author_facet Lo, Yu-Shu
Huang, Sing-Han
Luo, Yong-Chun
Lin, Chun-Yu
Yang, Jinn-Moon
author_sort Lo, Yu-Shu
collection PubMed
description BACKGROUND: One of the crucial steps toward understanding the biological functions of a cellular system is to investigate protein–protein interaction (PPI) networks. As an increasing number of reliable PPIs become available, there is a growing need for discovering PPIs to reconstruct PPI networks of interesting organisms. Some interolog-based methods and homologous PPI families have been proposed for predicting PPIs from the known PPIs of source organisms. RESULTS: Here, we propose a multiple-strategy scoring method to identify reliable PPIs for reconstructing the mouse PPI network from two well-known organisms: human and fly. We firstly identified the PPI candidates of target organisms based on homologous PPIs, sharing significant sequence similarities (joint E-value ≤ 1 × 10(−40)), from source organisms using generalized interolog mapping. These PPI candidates were evaluated by our multiple-strategy scoring method, combining sequence similarities, normalized ranks, and conservation scores across multiple organisms. According to 106,825 PPI candidates in yeast derived from human and fly, our scoring method can achieve high prediction accuracy and outperform generalized interolog mapping. Experiment results show that our multiple-strategy score can avoid the influence of the protein family size and length to significantly improve PPI prediction accuracy and reflect the biological functions. In addition, the top-ranked and conserved PPIs are often orthologous/essential interactions and share the functional similarity. Based on these reliable predicted PPIs, we reconstructed a comprehensive mouse PPI network, which is a scale-free network and can reflect the biological functions and high connectivity of 292 KEGG modules, including 216 pathways and 76 structural complexes. CONCLUSIONS: Experimental results show that our scoring method can improve the predicting accuracy based on the normalized rank and evolutionary conservation from multiple organisms. Our predicted PPIs share similar biological processes and cellular components, and the reconstructed genome-wide PPI network can reflect network topology and modularity. We believe that our method is useful for inferring reliable PPIs and reconstructing a comprehensive PPI network of an interesting organism.
format Online
Article
Text
id pubmed-4300222
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43002222015-01-30 Reconstructing Genome-Wide Protein–Protein Interaction Networks Using Multiple Strategies with Homologous Mapping Lo, Yu-Shu Huang, Sing-Han Luo, Yong-Chun Lin, Chun-Yu Yang, Jinn-Moon PLoS One Research Article BACKGROUND: One of the crucial steps toward understanding the biological functions of a cellular system is to investigate protein–protein interaction (PPI) networks. As an increasing number of reliable PPIs become available, there is a growing need for discovering PPIs to reconstruct PPI networks of interesting organisms. Some interolog-based methods and homologous PPI families have been proposed for predicting PPIs from the known PPIs of source organisms. RESULTS: Here, we propose a multiple-strategy scoring method to identify reliable PPIs for reconstructing the mouse PPI network from two well-known organisms: human and fly. We firstly identified the PPI candidates of target organisms based on homologous PPIs, sharing significant sequence similarities (joint E-value ≤ 1 × 10(−40)), from source organisms using generalized interolog mapping. These PPI candidates were evaluated by our multiple-strategy scoring method, combining sequence similarities, normalized ranks, and conservation scores across multiple organisms. According to 106,825 PPI candidates in yeast derived from human and fly, our scoring method can achieve high prediction accuracy and outperform generalized interolog mapping. Experiment results show that our multiple-strategy score can avoid the influence of the protein family size and length to significantly improve PPI prediction accuracy and reflect the biological functions. In addition, the top-ranked and conserved PPIs are often orthologous/essential interactions and share the functional similarity. Based on these reliable predicted PPIs, we reconstructed a comprehensive mouse PPI network, which is a scale-free network and can reflect the biological functions and high connectivity of 292 KEGG modules, including 216 pathways and 76 structural complexes. CONCLUSIONS: Experimental results show that our scoring method can improve the predicting accuracy based on the normalized rank and evolutionary conservation from multiple organisms. Our predicted PPIs share similar biological processes and cellular components, and the reconstructed genome-wide PPI network can reflect network topology and modularity. We believe that our method is useful for inferring reliable PPIs and reconstructing a comprehensive PPI network of an interesting organism. Public Library of Science 2015-01-20 /pmc/articles/PMC4300222/ /pubmed/25602759 http://dx.doi.org/10.1371/journal.pone.0116347 Text en © 2015 Lo et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lo, Yu-Shu
Huang, Sing-Han
Luo, Yong-Chun
Lin, Chun-Yu
Yang, Jinn-Moon
Reconstructing Genome-Wide Protein–Protein Interaction Networks Using Multiple Strategies with Homologous Mapping
title Reconstructing Genome-Wide Protein–Protein Interaction Networks Using Multiple Strategies with Homologous Mapping
title_full Reconstructing Genome-Wide Protein–Protein Interaction Networks Using Multiple Strategies with Homologous Mapping
title_fullStr Reconstructing Genome-Wide Protein–Protein Interaction Networks Using Multiple Strategies with Homologous Mapping
title_full_unstemmed Reconstructing Genome-Wide Protein–Protein Interaction Networks Using Multiple Strategies with Homologous Mapping
title_short Reconstructing Genome-Wide Protein–Protein Interaction Networks Using Multiple Strategies with Homologous Mapping
title_sort reconstructing genome-wide protein–protein interaction networks using multiple strategies with homologous mapping
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4300222/
https://www.ncbi.nlm.nih.gov/pubmed/25602759
http://dx.doi.org/10.1371/journal.pone.0116347
work_keys_str_mv AT loyushu reconstructinggenomewideproteinproteininteractionnetworksusingmultiplestrategieswithhomologousmapping
AT huangsinghan reconstructinggenomewideproteinproteininteractionnetworksusingmultiplestrategieswithhomologousmapping
AT luoyongchun reconstructinggenomewideproteinproteininteractionnetworksusingmultiplestrategieswithhomologousmapping
AT linchunyu reconstructinggenomewideproteinproteininteractionnetworksusingmultiplestrategieswithhomologousmapping
AT yangjinnmoon reconstructinggenomewideproteinproteininteractionnetworksusingmultiplestrategieswithhomologousmapping