Cargando…

A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs

BACKGROUND: Predicting novel interactions between HIV-1 and human proteins contributes most promising area in HIV research. Prediction is generally guided by some classification and inference based methods using single biological source of information. RESULTS: In this article we have proposed a nov...

Descripción completa

Detalles Bibliográficos
Autores principales: Ray, Sumanta, Bandyopadhyay, Sanghamitra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4784399/
https://www.ncbi.nlm.nih.gov/pubmed/26956556
http://dx.doi.org/10.1186/s12859-016-0952-6
_version_ 1782420261326290944
author Ray, Sumanta
Bandyopadhyay, Sanghamitra
author_facet Ray, Sumanta
Bandyopadhyay, Sanghamitra
author_sort Ray, Sumanta
collection PubMed
description BACKGROUND: Predicting novel interactions between HIV-1 and human proteins contributes most promising area in HIV research. Prediction is generally guided by some classification and inference based methods using single biological source of information. RESULTS: In this article we have proposed a novel framework to predict protein-protein interactions (PPIs) between HIV-1 and human proteins by integrating multiple biological sources of information through non negative matrix factorization (NMF). For this purpose, the multiple data sets are converted to biological networks, which are then utilized to predict modules. These modules are subsequently combined into meta-modules by using NMF based clustering method. The integrated meta-modules are used to predict novel interactions between HIV-1 and human proteins. We have analyzed the significant GO terms and KEGG pathways in which the human proteins of the meta-modules participate. Moreover, the topological properties of human proteins involved in the meta modules are investigated. We have also performed statistical significance test to evaluate the predictions. CONCLUSIONS: Here, we propose a novel approach based on integration of different biological data sources, for predicting PPIs between HIV-1 and human proteins. Here, the integration is achieved through non negative matrix factorization (NMF) technique. Most of the predicted interactions are found to be well supported by the existing literature in PUBMED. Moreover, human proteins in the predicted set emerge as ‘hubs’ and ‘bottlenecks’ in the analysis. Low p-value in the significance test also suggests that the predictions are statistically significant.
format Online
Article
Text
id pubmed-4784399
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47843992016-03-10 A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs Ray, Sumanta Bandyopadhyay, Sanghamitra BMC Bioinformatics Methodology Article BACKGROUND: Predicting novel interactions between HIV-1 and human proteins contributes most promising area in HIV research. Prediction is generally guided by some classification and inference based methods using single biological source of information. RESULTS: In this article we have proposed a novel framework to predict protein-protein interactions (PPIs) between HIV-1 and human proteins by integrating multiple biological sources of information through non negative matrix factorization (NMF). For this purpose, the multiple data sets are converted to biological networks, which are then utilized to predict modules. These modules are subsequently combined into meta-modules by using NMF based clustering method. The integrated meta-modules are used to predict novel interactions between HIV-1 and human proteins. We have analyzed the significant GO terms and KEGG pathways in which the human proteins of the meta-modules participate. Moreover, the topological properties of human proteins involved in the meta modules are investigated. We have also performed statistical significance test to evaluate the predictions. CONCLUSIONS: Here, we propose a novel approach based on integration of different biological data sources, for predicting PPIs between HIV-1 and human proteins. Here, the integration is achieved through non negative matrix factorization (NMF) technique. Most of the predicted interactions are found to be well supported by the existing literature in PUBMED. Moreover, human proteins in the predicted set emerge as ‘hubs’ and ‘bottlenecks’ in the analysis. Low p-value in the significance test also suggests that the predictions are statistically significant. BioMed Central 2016-03-08 /pmc/articles/PMC4784399/ /pubmed/26956556 http://dx.doi.org/10.1186/s12859-016-0952-6 Text en © Ray and Bandyopadhyay. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Ray, Sumanta
Bandyopadhyay, Sanghamitra
A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs
title A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs
title_full A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs
title_fullStr A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs
title_full_unstemmed A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs
title_short A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs
title_sort nmf based approach for integrating multiple data sources to predict hiv-1–human ppis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4784399/
https://www.ncbi.nlm.nih.gov/pubmed/26956556
http://dx.doi.org/10.1186/s12859-016-0952-6
work_keys_str_mv AT raysumanta anmfbasedapproachforintegratingmultipledatasourcestopredicthiv1humanppis
AT bandyopadhyaysanghamitra anmfbasedapproachforintegratingmultipledatasourcestopredicthiv1humanppis
AT raysumanta nmfbasedapproachforintegratingmultipledatasourcestopredicthiv1humanppis
AT bandyopadhyaysanghamitra nmfbasedapproachforintegratingmultipledatasourcestopredicthiv1humanppis