Cargando…

Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion

BACKGROUND: Knowledge graphs (KGs), especially medical knowledge graphs, are often significantly incomplete, so it necessitating a demand for medical knowledge graph completion (MedKGC). MedKGC can find new facts based on the existed knowledge in the KGs. The path-based knowledge reasoning algorithm...

Descripción completa

Detalles Bibliográficos
Autores principales: Lan, Yinyu, He, Shizhu, Liu, Kang, Zeng, Xiangrong, Liu, Shengping, Zhao, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8628388/
https://www.ncbi.nlm.nih.gov/pubmed/34844576
http://dx.doi.org/10.1186/s12911-021-01622-7
_version_ 1784606996581842944
author Lan, Yinyu
He, Shizhu
Liu, Kang
Zeng, Xiangrong
Liu, Shengping
Zhao, Jun
author_facet Lan, Yinyu
He, Shizhu
Liu, Kang
Zeng, Xiangrong
Liu, Shengping
Zhao, Jun
author_sort Lan, Yinyu
collection PubMed
description BACKGROUND: Knowledge graphs (KGs), especially medical knowledge graphs, are often significantly incomplete, so it necessitating a demand for medical knowledge graph completion (MedKGC). MedKGC can find new facts based on the existed knowledge in the KGs. The path-based knowledge reasoning algorithm is one of the most important approaches to this task. This type of method has received great attention in recent years because of its high performance and interpretability. In fact, traditional methods such as path ranking algorithm take the paths between an entity pair as atomic features. However, the medical KGs are very sparse, which makes it difficult to model effective semantic representation for extremely sparse path features. The sparsity in the medical KGs is mainly reflected in the long-tailed distribution of entities and paths. Previous methods merely consider the context structure in the paths of knowledge graph and ignore the textual semantics of the symbols in the path. Therefore, their performance cannot be further improved due to the two aspects of entity sparseness and path sparseness. METHODS: To address the above issues, this paper proposes two novel path-based reasoning methods to solve the sparsity issues of entity and path respectively, which adopts the textual semantic information of entities and paths for MedKGC. By using the pre-trained model BERT, combining the textual semantic representations of the entities and the relationships, we model the task of symbolic reasoning in the medical KG as a numerical computing issue in textual semantic representation. RESULTS: Experiments results on the publicly authoritative Chinese symptom knowledge graph demonstrated that the proposed method is significantly better than the state-of-the-art path-based knowledge graph reasoning methods, and the average performance is improved by 5.83% for all relations. CONCLUSIONS: In this paper, we propose two new knowledge graph reasoning algorithms, which adopt textual semantic information of entities and paths and can effectively alleviate the sparsity problem of entities and paths in the MedKGC. As far as we know, it is the first method to use pre-trained language models and text path representations for medical knowledge reasoning. Our method can complete the impaired symptom knowledge graph in an interpretable way, and it outperforms the state-of-the-art path-based reasoning methods.
format Online
Article
Text
id pubmed-8628388
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-86283882021-12-01 Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion Lan, Yinyu He, Shizhu Liu, Kang Zeng, Xiangrong Liu, Shengping Zhao, Jun BMC Med Inform Decis Mak Review BACKGROUND: Knowledge graphs (KGs), especially medical knowledge graphs, are often significantly incomplete, so it necessitating a demand for medical knowledge graph completion (MedKGC). MedKGC can find new facts based on the existed knowledge in the KGs. The path-based knowledge reasoning algorithm is one of the most important approaches to this task. This type of method has received great attention in recent years because of its high performance and interpretability. In fact, traditional methods such as path ranking algorithm take the paths between an entity pair as atomic features. However, the medical KGs are very sparse, which makes it difficult to model effective semantic representation for extremely sparse path features. The sparsity in the medical KGs is mainly reflected in the long-tailed distribution of entities and paths. Previous methods merely consider the context structure in the paths of knowledge graph and ignore the textual semantics of the symbols in the path. Therefore, their performance cannot be further improved due to the two aspects of entity sparseness and path sparseness. METHODS: To address the above issues, this paper proposes two novel path-based reasoning methods to solve the sparsity issues of entity and path respectively, which adopts the textual semantic information of entities and paths for MedKGC. By using the pre-trained model BERT, combining the textual semantic representations of the entities and the relationships, we model the task of symbolic reasoning in the medical KG as a numerical computing issue in textual semantic representation. RESULTS: Experiments results on the publicly authoritative Chinese symptom knowledge graph demonstrated that the proposed method is significantly better than the state-of-the-art path-based knowledge graph reasoning methods, and the average performance is improved by 5.83% for all relations. CONCLUSIONS: In this paper, we propose two new knowledge graph reasoning algorithms, which adopt textual semantic information of entities and paths and can effectively alleviate the sparsity problem of entities and paths in the MedKGC. As far as we know, it is the first method to use pre-trained language models and text path representations for medical knowledge reasoning. Our method can complete the impaired symptom knowledge graph in an interpretable way, and it outperforms the state-of-the-art path-based reasoning methods. BioMed Central 2021-11-29 /pmc/articles/PMC8628388/ /pubmed/34844576 http://dx.doi.org/10.1186/s12911-021-01622-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Review
Lan, Yinyu
He, Shizhu
Liu, Kang
Zeng, Xiangrong
Liu, Shengping
Zhao, Jun
Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion
title Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion
title_full Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion
title_fullStr Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion
title_full_unstemmed Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion
title_short Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion
title_sort path-based knowledge reasoning with textual semantic information for medical knowledge graph completion
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8628388/
https://www.ncbi.nlm.nih.gov/pubmed/34844576
http://dx.doi.org/10.1186/s12911-021-01622-7
work_keys_str_mv AT lanyinyu pathbasedknowledgereasoningwithtextualsemanticinformationformedicalknowledgegraphcompletion
AT heshizhu pathbasedknowledgereasoningwithtextualsemanticinformationformedicalknowledgegraphcompletion
AT liukang pathbasedknowledgereasoningwithtextualsemanticinformationformedicalknowledgegraphcompletion
AT zengxiangrong pathbasedknowledgereasoningwithtextualsemanticinformationformedicalknowledgegraphcompletion
AT liushengping pathbasedknowledgereasoningwithtextualsemanticinformationformedicalknowledgegraphcompletion
AT zhaojun pathbasedknowledgereasoningwithtextualsemanticinformationformedicalknowledgegraphcompletion