Cargando…

Predicting biomedical relationships using the knowledge and graph embedding cascade model

Advances in machine learning and deep learning methods, together with the increasing availability of large-scale pharmacological, genomic, and chemical datasets, have created opportunities for identifying potentially useful relationships within biochemical networks. Knowledge embedding models have b...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Xiaomin, Li, Daifeng, Song, Min, Madden, Andrew, Ding, Ying, Bu, Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6565371/
https://www.ncbi.nlm.nih.gov/pubmed/31194807
http://dx.doi.org/10.1371/journal.pone.0218264
_version_ 1783426656484982784
author Liang, Xiaomin
Li, Daifeng
Song, Min
Madden, Andrew
Ding, Ying
Bu, Yi
author_facet Liang, Xiaomin
Li, Daifeng
Song, Min
Madden, Andrew
Ding, Ying
Bu, Yi
author_sort Liang, Xiaomin
collection PubMed
description Advances in machine learning and deep learning methods, together with the increasing availability of large-scale pharmacological, genomic, and chemical datasets, have created opportunities for identifying potentially useful relationships within biochemical networks. Knowledge embedding models have been found to have value in detecting knowledge-based correlations among entities, but little effort has been made to apply them to networks of biochemical entities. This is because such networks tend to be unbalanced and sparse, and knowledge embedding models do not work well on them. However, to some extent, the shortcomings of knowledge embedding models can be compensated for if they are used in association with graph embedding. In this paper, we combine knowledge embedding and graph embedding to represent biochemical entities and their relations as dense and low-dimensional vectors. We build a cascade learning framework which incorporates semantic features from the knowledge embedding model, and graph features from the graph embedding model, to score the probability of linking. The proposed method performs noticeably better than the models with which it is compared. It predicted links and entities with an accuracy of 93%, and its average hits@10 score has an average of 8.6% absolute improvement compared with original knowledge embedding model, 1.1% to 9.7% absolute improvement compared with other knowledge and graph embedding algorithm. In addition, we designed a meta-path algorithm to detect path relations in the biomedical network. Case studies further verify the value of the proposed model in finding potential relationships between diseases, drugs, genes, treatments, etc. Amongst the findings of the proposed model are the suggestion that VDR (vitamin D receptor) may be linked to prostate cancer. This is backed by evidence from medical databases and published research, supporting the suggestion that our proposed model could be of value to biomedical researchers.
format Online
Article
Text
id pubmed-6565371
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-65653712019-06-20 Predicting biomedical relationships using the knowledge and graph embedding cascade model Liang, Xiaomin Li, Daifeng Song, Min Madden, Andrew Ding, Ying Bu, Yi PLoS One Research Article Advances in machine learning and deep learning methods, together with the increasing availability of large-scale pharmacological, genomic, and chemical datasets, have created opportunities for identifying potentially useful relationships within biochemical networks. Knowledge embedding models have been found to have value in detecting knowledge-based correlations among entities, but little effort has been made to apply them to networks of biochemical entities. This is because such networks tend to be unbalanced and sparse, and knowledge embedding models do not work well on them. However, to some extent, the shortcomings of knowledge embedding models can be compensated for if they are used in association with graph embedding. In this paper, we combine knowledge embedding and graph embedding to represent biochemical entities and their relations as dense and low-dimensional vectors. We build a cascade learning framework which incorporates semantic features from the knowledge embedding model, and graph features from the graph embedding model, to score the probability of linking. The proposed method performs noticeably better than the models with which it is compared. It predicted links and entities with an accuracy of 93%, and its average hits@10 score has an average of 8.6% absolute improvement compared with original knowledge embedding model, 1.1% to 9.7% absolute improvement compared with other knowledge and graph embedding algorithm. In addition, we designed a meta-path algorithm to detect path relations in the biomedical network. Case studies further verify the value of the proposed model in finding potential relationships between diseases, drugs, genes, treatments, etc. Amongst the findings of the proposed model are the suggestion that VDR (vitamin D receptor) may be linked to prostate cancer. This is backed by evidence from medical databases and published research, supporting the suggestion that our proposed model could be of value to biomedical researchers. Public Library of Science 2019-06-13 /pmc/articles/PMC6565371/ /pubmed/31194807 http://dx.doi.org/10.1371/journal.pone.0218264 Text en © 2019 Liang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Liang, Xiaomin
Li, Daifeng
Song, Min
Madden, Andrew
Ding, Ying
Bu, Yi
Predicting biomedical relationships using the knowledge and graph embedding cascade model
title Predicting biomedical relationships using the knowledge and graph embedding cascade model
title_full Predicting biomedical relationships using the knowledge and graph embedding cascade model
title_fullStr Predicting biomedical relationships using the knowledge and graph embedding cascade model
title_full_unstemmed Predicting biomedical relationships using the knowledge and graph embedding cascade model
title_short Predicting biomedical relationships using the knowledge and graph embedding cascade model
title_sort predicting biomedical relationships using the knowledge and graph embedding cascade model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6565371/
https://www.ncbi.nlm.nih.gov/pubmed/31194807
http://dx.doi.org/10.1371/journal.pone.0218264
work_keys_str_mv AT liangxiaomin predictingbiomedicalrelationshipsusingtheknowledgeandgraphembeddingcascademodel
AT lidaifeng predictingbiomedicalrelationshipsusingtheknowledgeandgraphembeddingcascademodel
AT songmin predictingbiomedicalrelationshipsusingtheknowledgeandgraphembeddingcascademodel
AT maddenandrew predictingbiomedicalrelationshipsusingtheknowledgeandgraphembeddingcascademodel
AT dingying predictingbiomedicalrelationshipsusingtheknowledgeandgraphembeddingcascademodel
AT buyi predictingbiomedicalrelationshipsusingtheknowledgeandgraphembeddingcascademodel