Cargando…

Multimodal deep representation learning for protein interaction identification and protein family classification

BACKGROUND: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and des...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Da, Kabuka, Mansur
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886253/ https://www.ncbi.nlm.nih.gov/pubmed/31787089 http://dx.doi.org/10.1186/s12859-019-3084-y

_version_	1783474845493755904
author	Zhang, Da Kabuka, Mansur
author_facet	Zhang, Da Kabuka, Mansur
author_sort	Zhang, Da
collection	PubMed
description	BACKGROUND: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and describe the protein complex structures. However, compared to the protein sequences obtainable from various species and organisms, the number of revealed protein-protein interactions is relatively limited. To address this dilemma, lots of research endeavor have investigated in it to facilitate the discovery of novel PPIs. Among these methods, PPI prediction techniques that merely rely on protein sequence data are more widespread than other methods which require extensive biological domain knowledge. RESULTS: In this paper, we propose a multi-modal deep representation learning structure by incorporating protein physicochemical features with the graph topological features from the PPI networks. Specifically, our method not only bears in mind the protein sequence information but also discerns the topological representations for each protein node in the PPI networks. In our paper, we construct a stacked auto-encoder architecture together with a continuous bag-of-words (CBOW) model based on generated metapaths to study the PPI predictions. Following by that, we utilize the supervised deep neural networks to identify the PPIs and classify the protein families. The PPI prediction accuracy for eight species ranged from 96.76% to 99.77%, which signifies that our multi-modal deep representation learning framework achieves superior performance compared to other computational methods. CONCLUSION: To the best of our knowledge, this is the first multi-modal deep representation learning framework for examining the PPI networks.
format	Online Article Text
id	pubmed-6886253
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-68862532019-12-11 Multimodal deep representation learning for protein interaction identification and protein family classification Zhang, Da Kabuka, Mansur BMC Bioinformatics Research BACKGROUND: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and describe the protein complex structures. However, compared to the protein sequences obtainable from various species and organisms, the number of revealed protein-protein interactions is relatively limited. To address this dilemma, lots of research endeavor have investigated in it to facilitate the discovery of novel PPIs. Among these methods, PPI prediction techniques that merely rely on protein sequence data are more widespread than other methods which require extensive biological domain knowledge. RESULTS: In this paper, we propose a multi-modal deep representation learning structure by incorporating protein physicochemical features with the graph topological features from the PPI networks. Specifically, our method not only bears in mind the protein sequence information but also discerns the topological representations for each protein node in the PPI networks. In our paper, we construct a stacked auto-encoder architecture together with a continuous bag-of-words (CBOW) model based on generated metapaths to study the PPI predictions. Following by that, we utilize the supervised deep neural networks to identify the PPIs and classify the protein families. The PPI prediction accuracy for eight species ranged from 96.76% to 99.77%, which signifies that our multi-modal deep representation learning framework achieves superior performance compared to other computational methods. CONCLUSION: To the best of our knowledge, this is the first multi-modal deep representation learning framework for examining the PPI networks. BioMed Central 2019-12-02 /pmc/articles/PMC6886253/ /pubmed/31787089 http://dx.doi.org/10.1186/s12859-019-3084-y Text en © Zhang and Kabuka. 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Zhang, Da Kabuka, Mansur Multimodal deep representation learning for protein interaction identification and protein family classification
title	Multimodal deep representation learning for protein interaction identification and protein family classification
title_full	Multimodal deep representation learning for protein interaction identification and protein family classification
title_fullStr	Multimodal deep representation learning for protein interaction identification and protein family classification
title_full_unstemmed	Multimodal deep representation learning for protein interaction identification and protein family classification
title_short	Multimodal deep representation learning for protein interaction identification and protein family classification
title_sort	multimodal deep representation learning for protein interaction identification and protein family classification
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886253/ https://www.ncbi.nlm.nih.gov/pubmed/31787089 http://dx.doi.org/10.1186/s12859-019-3084-y
work_keys_str_mv	AT zhangda multimodaldeeprepresentationlearningforproteininteractionidentificationandproteinfamilyclassification AT kabukamansur multimodaldeeprepresentationlearningforproteininteractionidentificationandproteinfamilyclassification

Multimodal deep representation learning for protein interaction identification and protein family classification

Ejemplares similares