Cargando…

Multimodal deep representation learning for protein interaction identification and protein family classification

BACKGROUND: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and des...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Da, Kabuka, Mansur
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886253/
https://www.ncbi.nlm.nih.gov/pubmed/31787089
http://dx.doi.org/10.1186/s12859-019-3084-y
_version_ 1783474845493755904
author Zhang, Da
Kabuka, Mansur
author_facet Zhang, Da
Kabuka, Mansur
author_sort Zhang, Da
collection PubMed
description BACKGROUND: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and describe the protein complex structures. However, compared to the protein sequences obtainable from various species and organisms, the number of revealed protein-protein interactions is relatively limited. To address this dilemma, lots of research endeavor have investigated in it to facilitate the discovery of novel PPIs. Among these methods, PPI prediction techniques that merely rely on protein sequence data are more widespread than other methods which require extensive biological domain knowledge. RESULTS: In this paper, we propose a multi-modal deep representation learning structure by incorporating protein physicochemical features with the graph topological features from the PPI networks. Specifically, our method not only bears in mind the protein sequence information but also discerns the topological representations for each protein node in the PPI networks. In our paper, we construct a stacked auto-encoder architecture together with a continuous bag-of-words (CBOW) model based on generated metapaths to study the PPI predictions. Following by that, we utilize the supervised deep neural networks to identify the PPIs and classify the protein families. The PPI prediction accuracy for eight species ranged from 96.76% to 99.77%, which signifies that our multi-modal deep representation learning framework achieves superior performance compared to other computational methods. CONCLUSION: To the best of our knowledge, this is the first multi-modal deep representation learning framework for examining the PPI networks.
format Online
Article
Text
id pubmed-6886253
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68862532019-12-11 Multimodal deep representation learning for protein interaction identification and protein family classification Zhang, Da Kabuka, Mansur BMC Bioinformatics Research BACKGROUND: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and describe the protein complex structures. However, compared to the protein sequences obtainable from various species and organisms, the number of revealed protein-protein interactions is relatively limited. To address this dilemma, lots of research endeavor have investigated in it to facilitate the discovery of novel PPIs. Among these methods, PPI prediction techniques that merely rely on protein sequence data are more widespread than other methods which require extensive biological domain knowledge. RESULTS: In this paper, we propose a multi-modal deep representation learning structure by incorporating protein physicochemical features with the graph topological features from the PPI networks. Specifically, our method not only bears in mind the protein sequence information but also discerns the topological representations for each protein node in the PPI networks. In our paper, we construct a stacked auto-encoder architecture together with a continuous bag-of-words (CBOW) model based on generated metapaths to study the PPI predictions. Following by that, we utilize the supervised deep neural networks to identify the PPIs and classify the protein families. The PPI prediction accuracy for eight species ranged from 96.76% to 99.77%, which signifies that our multi-modal deep representation learning framework achieves superior performance compared to other computational methods. CONCLUSION: To the best of our knowledge, this is the first multi-modal deep representation learning framework for examining the PPI networks. BioMed Central 2019-12-02 /pmc/articles/PMC6886253/ /pubmed/31787089 http://dx.doi.org/10.1186/s12859-019-3084-y Text en © Zhang and Kabuka. 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhang, Da
Kabuka, Mansur
Multimodal deep representation learning for protein interaction identification and protein family classification
title Multimodal deep representation learning for protein interaction identification and protein family classification
title_full Multimodal deep representation learning for protein interaction identification and protein family classification
title_fullStr Multimodal deep representation learning for protein interaction identification and protein family classification
title_full_unstemmed Multimodal deep representation learning for protein interaction identification and protein family classification
title_short Multimodal deep representation learning for protein interaction identification and protein family classification
title_sort multimodal deep representation learning for protein interaction identification and protein family classification
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886253/
https://www.ncbi.nlm.nih.gov/pubmed/31787089
http://dx.doi.org/10.1186/s12859-019-3084-y
work_keys_str_mv AT zhangda multimodaldeeprepresentationlearningforproteininteractionidentificationandproteinfamilyclassification
AT kabukamansur multimodaldeeprepresentationlearningforproteininteractionidentificationandproteinfamilyclassification