Cargando…
Multimodal deep representation learning for protein interaction identification and protein family classification
BACKGROUND: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and des...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886253/ https://www.ncbi.nlm.nih.gov/pubmed/31787089 http://dx.doi.org/10.1186/s12859-019-3084-y |
_version_ | 1783474845493755904 |
---|---|
author | Zhang, Da Kabuka, Mansur |
author_facet | Zhang, Da Kabuka, Mansur |
author_sort | Zhang, Da |
collection | PubMed |
description | BACKGROUND: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and describe the protein complex structures. However, compared to the protein sequences obtainable from various species and organisms, the number of revealed protein-protein interactions is relatively limited. To address this dilemma, lots of research endeavor have investigated in it to facilitate the discovery of novel PPIs. Among these methods, PPI prediction techniques that merely rely on protein sequence data are more widespread than other methods which require extensive biological domain knowledge. RESULTS: In this paper, we propose a multi-modal deep representation learning structure by incorporating protein physicochemical features with the graph topological features from the PPI networks. Specifically, our method not only bears in mind the protein sequence information but also discerns the topological representations for each protein node in the PPI networks. In our paper, we construct a stacked auto-encoder architecture together with a continuous bag-of-words (CBOW) model based on generated metapaths to study the PPI predictions. Following by that, we utilize the supervised deep neural networks to identify the PPIs and classify the protein families. The PPI prediction accuracy for eight species ranged from 96.76% to 99.77%, which signifies that our multi-modal deep representation learning framework achieves superior performance compared to other computational methods. CONCLUSION: To the best of our knowledge, this is the first multi-modal deep representation learning framework for examining the PPI networks. |
format | Online Article Text |
id | pubmed-6886253 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68862532019-12-11 Multimodal deep representation learning for protein interaction identification and protein family classification Zhang, Da Kabuka, Mansur BMC Bioinformatics Research BACKGROUND: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and describe the protein complex structures. However, compared to the protein sequences obtainable from various species and organisms, the number of revealed protein-protein interactions is relatively limited. To address this dilemma, lots of research endeavor have investigated in it to facilitate the discovery of novel PPIs. Among these methods, PPI prediction techniques that merely rely on protein sequence data are more widespread than other methods which require extensive biological domain knowledge. RESULTS: In this paper, we propose a multi-modal deep representation learning structure by incorporating protein physicochemical features with the graph topological features from the PPI networks. Specifically, our method not only bears in mind the protein sequence information but also discerns the topological representations for each protein node in the PPI networks. In our paper, we construct a stacked auto-encoder architecture together with a continuous bag-of-words (CBOW) model based on generated metapaths to study the PPI predictions. Following by that, we utilize the supervised deep neural networks to identify the PPIs and classify the protein families. The PPI prediction accuracy for eight species ranged from 96.76% to 99.77%, which signifies that our multi-modal deep representation learning framework achieves superior performance compared to other computational methods. CONCLUSION: To the best of our knowledge, this is the first multi-modal deep representation learning framework for examining the PPI networks. BioMed Central 2019-12-02 /pmc/articles/PMC6886253/ /pubmed/31787089 http://dx.doi.org/10.1186/s12859-019-3084-y Text en © Zhang and Kabuka. 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zhang, Da Kabuka, Mansur Multimodal deep representation learning for protein interaction identification and protein family classification |
title | Multimodal deep representation learning for protein interaction identification and protein family classification |
title_full | Multimodal deep representation learning for protein interaction identification and protein family classification |
title_fullStr | Multimodal deep representation learning for protein interaction identification and protein family classification |
title_full_unstemmed | Multimodal deep representation learning for protein interaction identification and protein family classification |
title_short | Multimodal deep representation learning for protein interaction identification and protein family classification |
title_sort | multimodal deep representation learning for protein interaction identification and protein family classification |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886253/ https://www.ncbi.nlm.nih.gov/pubmed/31787089 http://dx.doi.org/10.1186/s12859-019-3084-y |
work_keys_str_mv | AT zhangda multimodaldeeprepresentationlearningforproteininteractionidentificationandproteinfamilyclassification AT kabukamansur multimodaldeeprepresentationlearningforproteininteractionidentificationandproteinfamilyclassification |