Cargando…

A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development

BACKGROUND: Knowledge graph embedding is an effective semantic representation method for entities and relations in knowledge graphs. Several translation-based algorithms, including TransE, TransH, TransR, TransD, and TranSparse, have been proposed to learn effective embedding vectors from typical kn...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Linfeng, Wang, Peng, Wang, Yao, Wang, Shenghui, Yan, Jun, Jiang, Jinpeng, Tang, Buzhou, Wang, Chengliang, Liu, Yuting
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2020
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7273238/ https://www.ncbi.nlm.nih.gov/pubmed/32436854 http://dx.doi.org/10.2196/17645

_version_	1783542361271304192
author	Li, Linfeng Wang, Peng Wang, Yao Wang, Shenghui Yan, Jun Jiang, Jinpeng Tang, Buzhou Wang, Chengliang Liu, Yuting
author_facet	Li, Linfeng Wang, Peng Wang, Yao Wang, Shenghui Yan, Jun Jiang, Jinpeng Tang, Buzhou Wang, Chengliang Liu, Yuting
author_sort	Li, Linfeng
collection	PubMed
description	BACKGROUND: Knowledge graph embedding is an effective semantic representation method for entities and relations in knowledge graphs. Several translation-based algorithms, including TransE, TransH, TransR, TransD, and TranSparse, have been proposed to learn effective embedding vectors from typical knowledge graphs in which the relations between head and tail entities are deterministic. However, in medical knowledge graphs, the relations between head and tail entities are inherently probabilistic. This difference introduces a challenge in embedding medical knowledge graphs. OBJECTIVE: We aimed to address the challenge of how to learn the probability values of triplets into representation vectors by making enhancements to existing TransX (where X is E, H, R, D, or Sparse) algorithms, including the following: (1) constructing a mapping function between the score value and the probability, and (2) introducing probability-based loss of triplets into the original margin-based loss function. METHODS: We performed the proposed PrTransX algorithm on a medical knowledge graph that we built from large-scale real-world electronic medical records data. We evaluated the embeddings using link prediction task. RESULTS: Compared with the corresponding TransX algorithms, the proposed PrTransX performed better than the TransX model in all evaluation indicators, achieving a higher proportion of corrected entities ranked in the top 10 and normalized discounted cumulative gain of the top 10 predicted tail entities, and lower mean rank. CONCLUSIONS: The proposed PrTransX successfully incorporated the uncertainty of the knowledge triplets into the embedding vectors.
format	Online Article Text
id	pubmed-7273238
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-72732382020-06-05 A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development Li, Linfeng Wang, Peng Wang, Yao Wang, Shenghui Yan, Jun Jiang, Jinpeng Tang, Buzhou Wang, Chengliang Liu, Yuting JMIR Med Inform Original Paper BACKGROUND: Knowledge graph embedding is an effective semantic representation method for entities and relations in knowledge graphs. Several translation-based algorithms, including TransE, TransH, TransR, TransD, and TranSparse, have been proposed to learn effective embedding vectors from typical knowledge graphs in which the relations between head and tail entities are deterministic. However, in medical knowledge graphs, the relations between head and tail entities are inherently probabilistic. This difference introduces a challenge in embedding medical knowledge graphs. OBJECTIVE: We aimed to address the challenge of how to learn the probability values of triplets into representation vectors by making enhancements to existing TransX (where X is E, H, R, D, or Sparse) algorithms, including the following: (1) constructing a mapping function between the score value and the probability, and (2) introducing probability-based loss of triplets into the original margin-based loss function. METHODS: We performed the proposed PrTransX algorithm on a medical knowledge graph that we built from large-scale real-world electronic medical records data. We evaluated the embeddings using link prediction task. RESULTS: Compared with the corresponding TransX algorithms, the proposed PrTransX performed better than the TransX model in all evaluation indicators, achieving a higher proportion of corrected entities ranked in the top 10 and normalized discounted cumulative gain of the top 10 predicted tail entities, and lower mean rank. CONCLUSIONS: The proposed PrTransX successfully incorporated the uncertainty of the knowledge triplets into the embedding vectors. JMIR Publications 2020-05-21 /pmc/articles/PMC7273238/ /pubmed/32436854 http://dx.doi.org/10.2196/17645 Text en ©Linfeng Li, Peng Wang, Yao Wang, Shenghui Wang, Jun Yan, Jinpeng Jiang, Buzhou Tang, Chengliang Wang, Yuting Liu. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 21.05.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Li, Linfeng Wang, Peng Wang, Yao Wang, Shenghui Yan, Jun Jiang, Jinpeng Tang, Buzhou Wang, Chengliang Liu, Yuting A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development
title	A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development
title_full	A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development
title_fullStr	A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development
title_full_unstemmed	A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development
title_short	A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development
title_sort	method to learn embedding of a probabilistic medical knowledge graph: algorithm development
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7273238/ https://www.ncbi.nlm.nih.gov/pubmed/32436854 http://dx.doi.org/10.2196/17645
work_keys_str_mv	AT lilinfeng amethodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT wangpeng amethodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT wangyao amethodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT wangshenghui amethodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT yanjun amethodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT jiangjinpeng amethodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT tangbuzhou amethodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT wangchengliang amethodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT liuyuting amethodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT lilinfeng methodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT wangpeng methodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT wangyao methodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT wangshenghui methodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT yanjun methodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT jiangjinpeng methodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT tangbuzhou methodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT wangchengliang methodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment AT liuyuting methodtolearnembeddingofaprobabilisticmedicalknowledgegraphalgorithmdevelopment

A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development

Ejemplares similares