Cargando…

Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network

Protein-protein interactions (PPIs) are pivotal for cellular functions and biological processes. In the past years, computational methods using amino acid sequences and gene ontology (GO) annotations of proteins for prioritizing PPIs have provided important references for biological experiments in t...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Lingling, Wang, Junjie, Hu, Yang, Cheng, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Gene & Cell Therapy 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515979/
https://www.ncbi.nlm.nih.gov/pubmed/33230427
http://dx.doi.org/10.1016/j.omtn.2020.08.025
_version_ 1783586915693363200
author Zhao, Lingling
Wang, Junjie
Hu, Yang
Cheng, Liang
author_facet Zhao, Lingling
Wang, Junjie
Hu, Yang
Cheng, Liang
author_sort Zhao, Lingling
collection PubMed
description Protein-protein interactions (PPIs) are pivotal for cellular functions and biological processes. In the past years, computational methods using amino acid sequences and gene ontology (GO) annotations of proteins for prioritizing PPIs have provided important references for biological experiments in the wet lab. Despite the current success, sequence information and ontological annotation in semantic representation have not been integrated into current methods. We propose a deep-learning-based PPI prediction methodology conjointly featuring sequence information and GO annotation. First, we adopt a word-embedding tool, the NCBI-blueBERT model pre-trained on PubMed, to map the GO terms into their semantic vectors. Then, the GO semantic vectors and protein sequence vector serve as the input of the proposed inception recurrent neural network (RNN) attention network (IRAN). The IRAN captures the spatial relationship and the potential sequential feature of the protein sequence and ontological annotation semantics. The extensive experimental results on 12 benchmarks demonstrate that our method achieves superiority over state-of-the-art baselines. In the yeast dataset of a binary PPI prediction, our method improved the performance with the Matthews correlation coefficient increasing from 94.2% to 98.2% and the accuracy from 97.1% to 98.2%. The analogous results were also obtained in other comparison evaluations.
format Online
Article
Text
id pubmed-7515979
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Society of Gene & Cell Therapy
record_format MEDLINE/PubMed
spelling pubmed-75159792020-09-30 Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network Zhao, Lingling Wang, Junjie Hu, Yang Cheng, Liang Mol Ther Nucleic Acids Original Article Protein-protein interactions (PPIs) are pivotal for cellular functions and biological processes. In the past years, computational methods using amino acid sequences and gene ontology (GO) annotations of proteins for prioritizing PPIs have provided important references for biological experiments in the wet lab. Despite the current success, sequence information and ontological annotation in semantic representation have not been integrated into current methods. We propose a deep-learning-based PPI prediction methodology conjointly featuring sequence information and GO annotation. First, we adopt a word-embedding tool, the NCBI-blueBERT model pre-trained on PubMed, to map the GO terms into their semantic vectors. Then, the GO semantic vectors and protein sequence vector serve as the input of the proposed inception recurrent neural network (RNN) attention network (IRAN). The IRAN captures the spatial relationship and the potential sequential feature of the protein sequence and ontological annotation semantics. The extensive experimental results on 12 benchmarks demonstrate that our method achieves superiority over state-of-the-art baselines. In the yeast dataset of a binary PPI prediction, our method improved the performance with the Matthews correlation coefficient increasing from 94.2% to 98.2% and the accuracy from 97.1% to 98.2%. The analogous results were also obtained in other comparison evaluations. American Society of Gene & Cell Therapy 2020-08-25 /pmc/articles/PMC7515979/ /pubmed/33230427 http://dx.doi.org/10.1016/j.omtn.2020.08.025 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Article
Zhao, Lingling
Wang, Junjie
Hu, Yang
Cheng, Liang
Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network
title Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network
title_full Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network
title_fullStr Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network
title_full_unstemmed Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network
title_short Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network
title_sort conjoint feature representation of go and protein sequence for ppi prediction based on an inception rnn attention network
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515979/
https://www.ncbi.nlm.nih.gov/pubmed/33230427
http://dx.doi.org/10.1016/j.omtn.2020.08.025
work_keys_str_mv AT zhaolingling conjointfeaturerepresentationofgoandproteinsequenceforppipredictionbasedonaninceptionrnnattentionnetwork
AT wangjunjie conjointfeaturerepresentationofgoandproteinsequenceforppipredictionbasedonaninceptionrnnattentionnetwork
AT huyang conjointfeaturerepresentationofgoandproteinsequenceforppipredictionbasedonaninceptionrnnattentionnetwork
AT chengliang conjointfeaturerepresentationofgoandproteinsequenceforppipredictionbasedonaninceptionrnnattentionnetwork