Cargando…

Graph embeddings on gene ontology annotations for protein–protein interaction prediction

BACKGROUND: Protein–protein interaction (PPI) prediction is an important task towards the understanding of many bioinformatics functions and applications, such as predicting protein functions, gene-disease associations and disease-drug associations. However, many previous PPI prediction researches d...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhong, Xiaoshi, Rajapakse, Jagath C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7739483/
https://www.ncbi.nlm.nih.gov/pubmed/33323115
http://dx.doi.org/10.1186/s12859-020-03816-8
_version_ 1783623340106186752
author Zhong, Xiaoshi
Rajapakse, Jagath C.
author_facet Zhong, Xiaoshi
Rajapakse, Jagath C.
author_sort Zhong, Xiaoshi
collection PubMed
description BACKGROUND: Protein–protein interaction (PPI) prediction is an important task towards the understanding of many bioinformatics functions and applications, such as predicting protein functions, gene-disease associations and disease-drug associations. However, many previous PPI prediction researches do not consider missing and spurious interactions inherent in PPI networks. To address these two issues, we define two corresponding tasks, namely missing PPI prediction and spurious PPI prediction, and propose a method that employs graph embeddings that learn vector representations from constructed Gene Ontology Annotation (GOA) graphs and then use embedded vectors to achieve the two tasks. Our method leverages on information from both term–term relations among GO terms and term-protein annotations between GO terms and proteins, and preserves properties of both local and global structural information of the GO annotation graph. RESULTS: We compare our method with those methods that are based on information content (IC) and one method that is based on word embeddings, with experiments on three PPI datasets from STRING database. Experimental results demonstrate that our method is more effective than those compared methods. CONCLUSION: Our experimental results demonstrate the effectiveness of using graph embeddings to learn vector representations from undirected GOA graphs for our defined missing and spurious PPI tasks.
format Online
Article
Text
id pubmed-7739483
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77394832020-12-17 Graph embeddings on gene ontology annotations for protein–protein interaction prediction Zhong, Xiaoshi Rajapakse, Jagath C. BMC Bioinformatics Research BACKGROUND: Protein–protein interaction (PPI) prediction is an important task towards the understanding of many bioinformatics functions and applications, such as predicting protein functions, gene-disease associations and disease-drug associations. However, many previous PPI prediction researches do not consider missing and spurious interactions inherent in PPI networks. To address these two issues, we define two corresponding tasks, namely missing PPI prediction and spurious PPI prediction, and propose a method that employs graph embeddings that learn vector representations from constructed Gene Ontology Annotation (GOA) graphs and then use embedded vectors to achieve the two tasks. Our method leverages on information from both term–term relations among GO terms and term-protein annotations between GO terms and proteins, and preserves properties of both local and global structural information of the GO annotation graph. RESULTS: We compare our method with those methods that are based on information content (IC) and one method that is based on word embeddings, with experiments on three PPI datasets from STRING database. Experimental results demonstrate that our method is more effective than those compared methods. CONCLUSION: Our experimental results demonstrate the effectiveness of using graph embeddings to learn vector representations from undirected GOA graphs for our defined missing and spurious PPI tasks. BioMed Central 2020-12-16 /pmc/articles/PMC7739483/ /pubmed/33323115 http://dx.doi.org/10.1186/s12859-020-03816-8 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhong, Xiaoshi
Rajapakse, Jagath C.
Graph embeddings on gene ontology annotations for protein–protein interaction prediction
title Graph embeddings on gene ontology annotations for protein–protein interaction prediction
title_full Graph embeddings on gene ontology annotations for protein–protein interaction prediction
title_fullStr Graph embeddings on gene ontology annotations for protein–protein interaction prediction
title_full_unstemmed Graph embeddings on gene ontology annotations for protein–protein interaction prediction
title_short Graph embeddings on gene ontology annotations for protein–protein interaction prediction
title_sort graph embeddings on gene ontology annotations for protein–protein interaction prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7739483/
https://www.ncbi.nlm.nih.gov/pubmed/33323115
http://dx.doi.org/10.1186/s12859-020-03816-8
work_keys_str_mv AT zhongxiaoshi graphembeddingsongeneontologyannotationsforproteinproteininteractionprediction
AT rajapaksejagathc graphembeddingsongeneontologyannotationsforproteinproteininteractionprediction