Cargando…
Graph embeddings on gene ontology annotations for protein–protein interaction prediction
BACKGROUND: Protein–protein interaction (PPI) prediction is an important task towards the understanding of many bioinformatics functions and applications, such as predicting protein functions, gene-disease associations and disease-drug associations. However, many previous PPI prediction researches d...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7739483/ https://www.ncbi.nlm.nih.gov/pubmed/33323115 http://dx.doi.org/10.1186/s12859-020-03816-8 |
_version_ | 1783623340106186752 |
---|---|
author | Zhong, Xiaoshi Rajapakse, Jagath C. |
author_facet | Zhong, Xiaoshi Rajapakse, Jagath C. |
author_sort | Zhong, Xiaoshi |
collection | PubMed |
description | BACKGROUND: Protein–protein interaction (PPI) prediction is an important task towards the understanding of many bioinformatics functions and applications, such as predicting protein functions, gene-disease associations and disease-drug associations. However, many previous PPI prediction researches do not consider missing and spurious interactions inherent in PPI networks. To address these two issues, we define two corresponding tasks, namely missing PPI prediction and spurious PPI prediction, and propose a method that employs graph embeddings that learn vector representations from constructed Gene Ontology Annotation (GOA) graphs and then use embedded vectors to achieve the two tasks. Our method leverages on information from both term–term relations among GO terms and term-protein annotations between GO terms and proteins, and preserves properties of both local and global structural information of the GO annotation graph. RESULTS: We compare our method with those methods that are based on information content (IC) and one method that is based on word embeddings, with experiments on three PPI datasets from STRING database. Experimental results demonstrate that our method is more effective than those compared methods. CONCLUSION: Our experimental results demonstrate the effectiveness of using graph embeddings to learn vector representations from undirected GOA graphs for our defined missing and spurious PPI tasks. |
format | Online Article Text |
id | pubmed-7739483 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-77394832020-12-17 Graph embeddings on gene ontology annotations for protein–protein interaction prediction Zhong, Xiaoshi Rajapakse, Jagath C. BMC Bioinformatics Research BACKGROUND: Protein–protein interaction (PPI) prediction is an important task towards the understanding of many bioinformatics functions and applications, such as predicting protein functions, gene-disease associations and disease-drug associations. However, many previous PPI prediction researches do not consider missing and spurious interactions inherent in PPI networks. To address these two issues, we define two corresponding tasks, namely missing PPI prediction and spurious PPI prediction, and propose a method that employs graph embeddings that learn vector representations from constructed Gene Ontology Annotation (GOA) graphs and then use embedded vectors to achieve the two tasks. Our method leverages on information from both term–term relations among GO terms and term-protein annotations between GO terms and proteins, and preserves properties of both local and global structural information of the GO annotation graph. RESULTS: We compare our method with those methods that are based on information content (IC) and one method that is based on word embeddings, with experiments on three PPI datasets from STRING database. Experimental results demonstrate that our method is more effective than those compared methods. CONCLUSION: Our experimental results demonstrate the effectiveness of using graph embeddings to learn vector representations from undirected GOA graphs for our defined missing and spurious PPI tasks. BioMed Central 2020-12-16 /pmc/articles/PMC7739483/ /pubmed/33323115 http://dx.doi.org/10.1186/s12859-020-03816-8 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zhong, Xiaoshi Rajapakse, Jagath C. Graph embeddings on gene ontology annotations for protein–protein interaction prediction |
title | Graph embeddings on gene ontology annotations for protein–protein interaction prediction |
title_full | Graph embeddings on gene ontology annotations for protein–protein interaction prediction |
title_fullStr | Graph embeddings on gene ontology annotations for protein–protein interaction prediction |
title_full_unstemmed | Graph embeddings on gene ontology annotations for protein–protein interaction prediction |
title_short | Graph embeddings on gene ontology annotations for protein–protein interaction prediction |
title_sort | graph embeddings on gene ontology annotations for protein–protein interaction prediction |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7739483/ https://www.ncbi.nlm.nih.gov/pubmed/33323115 http://dx.doi.org/10.1186/s12859-020-03816-8 |
work_keys_str_mv | AT zhongxiaoshi graphembeddingsongeneontologyannotationsforproteinproteininteractionprediction AT rajapaksejagathc graphembeddingsongeneontologyannotationsforproteinproteininteractionprediction |