Cargando…

InfersentPPI: Prediction of Protein-Protein Interaction Using Protein Sentence Embedding With Gene Ontology Information

Protein-protein interaction (PPI) prediction is meaningful work for deciphering cellular behaviors. Although many kinds of data and machine learning algorithms have been used in PPI prediction, the performance still needs to be improved. In this paper, we propose InferSentPPI, a sentence embedding b...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Meijing, Jiang, Yingying, Ryu, Keun Ho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8995897/
https://www.ncbi.nlm.nih.gov/pubmed/35419026
http://dx.doi.org/10.3389/fgene.2022.827540
_version_ 1784684382694408192
author Li, Meijing
Jiang, Yingying
Ryu, Keun Ho
author_facet Li, Meijing
Jiang, Yingying
Ryu, Keun Ho
author_sort Li, Meijing
collection PubMed
description Protein-protein interaction (PPI) prediction is meaningful work for deciphering cellular behaviors. Although many kinds of data and machine learning algorithms have been used in PPI prediction, the performance still needs to be improved. In this paper, we propose InferSentPPI, a sentence embedding based text mining method with gene ontology (GO) information for PPI prediction. First, we design a novel weighting GO term-based protein sentence representation method to generate protein sentences including multi-semantic information in the preprocessing. Gene ontology annotation (GOA) provides the reliability of relationships between proteins and GO terms for PPI prediction. Thus, GO term-based protein sentence can help to improve the prediction performance. Then we also propose an InferSent_PN algorithm based on the protein sentences and InferSent algorithm to extract relations between proteins. In the experiments, we evaluate the effectiveness of InferSentPPI with several benchmarking datasets. The result shows our proposed method has performed better than the state-of-the-art methods for a large PPI dataset.
format Online
Article
Text
id pubmed-8995897
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89958972022-04-12 InfersentPPI: Prediction of Protein-Protein Interaction Using Protein Sentence Embedding With Gene Ontology Information Li, Meijing Jiang, Yingying Ryu, Keun Ho Front Genet Genetics Protein-protein interaction (PPI) prediction is meaningful work for deciphering cellular behaviors. Although many kinds of data and machine learning algorithms have been used in PPI prediction, the performance still needs to be improved. In this paper, we propose InferSentPPI, a sentence embedding based text mining method with gene ontology (GO) information for PPI prediction. First, we design a novel weighting GO term-based protein sentence representation method to generate protein sentences including multi-semantic information in the preprocessing. Gene ontology annotation (GOA) provides the reliability of relationships between proteins and GO terms for PPI prediction. Thus, GO term-based protein sentence can help to improve the prediction performance. Then we also propose an InferSent_PN algorithm based on the protein sentences and InferSent algorithm to extract relations between proteins. In the experiments, we evaluate the effectiveness of InferSentPPI with several benchmarking datasets. The result shows our proposed method has performed better than the state-of-the-art methods for a large PPI dataset. Frontiers Media S.A. 2022-03-28 /pmc/articles/PMC8995897/ /pubmed/35419026 http://dx.doi.org/10.3389/fgene.2022.827540 Text en Copyright © 2022 Li, Jiang and Ryu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Meijing
Jiang, Yingying
Ryu, Keun Ho
InfersentPPI: Prediction of Protein-Protein Interaction Using Protein Sentence Embedding With Gene Ontology Information
title InfersentPPI: Prediction of Protein-Protein Interaction Using Protein Sentence Embedding With Gene Ontology Information
title_full InfersentPPI: Prediction of Protein-Protein Interaction Using Protein Sentence Embedding With Gene Ontology Information
title_fullStr InfersentPPI: Prediction of Protein-Protein Interaction Using Protein Sentence Embedding With Gene Ontology Information
title_full_unstemmed InfersentPPI: Prediction of Protein-Protein Interaction Using Protein Sentence Embedding With Gene Ontology Information
title_short InfersentPPI: Prediction of Protein-Protein Interaction Using Protein Sentence Embedding With Gene Ontology Information
title_sort infersentppi: prediction of protein-protein interaction using protein sentence embedding with gene ontology information
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8995897/
https://www.ncbi.nlm.nih.gov/pubmed/35419026
http://dx.doi.org/10.3389/fgene.2022.827540
work_keys_str_mv AT limeijing infersentppipredictionofproteinproteininteractionusingproteinsentenceembeddingwithgeneontologyinformation
AT jiangyingying infersentppipredictionofproteinproteininteractionusingproteinsentenceembeddingwithgeneontologyinformation
AT ryukeunho infersentppipredictionofproteinproteininteractionusingproteinsentenceembeddingwithgeneontologyinformation