Cargando…
Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma
BACKGROUND: Pituitary adenoma is one of the most common central nervous system tumors. The diagnosis and treatment of pituitary adenoma remain very difficult. Misdiagnosis and recurrence often occur, and experienced neurosurgeons are in serious shortage. A knowledge graph can help interns quickly un...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8367125/ https://www.ncbi.nlm.nih.gov/pubmed/34057414 http://dx.doi.org/10.2196/28218 |
_version_ | 1783739015210467328 |
---|---|
author | Fang, An Lou, Pei Hu, Jiahui Zhao, Wanqing Feng, Ming Ren, Huiling Chen, Xianlai |
author_facet | Fang, An Lou, Pei Hu, Jiahui Zhao, Wanqing Feng, Ming Ren, Huiling Chen, Xianlai |
author_sort | Fang, An |
collection | PubMed |
description | BACKGROUND: Pituitary adenoma is one of the most common central nervous system tumors. The diagnosis and treatment of pituitary adenoma remain very difficult. Misdiagnosis and recurrence often occur, and experienced neurosurgeons are in serious shortage. A knowledge graph can help interns quickly understand the medical knowledge related to pituitary tumor. OBJECTIVE: The aim of this study was to develop a data fusion method suitable for medical data using data of pituitary adenomas integrated from different sources. The overall goal was to construct a knowledge graph for pituitary adenoma (KGPA) to be used for knowledge discovery. METHODS: A complete framework suitable for the construction of a medical knowledge graph was developed, which was used to build the KGPA. The schema of the KGPA was manually constructed. Information of pituitary adenoma was automatically extracted from Chinese electronic medical records (CEMRs) and medical websites through a conditional random field model and newly designed web wrappers. An entity fusion method is proposed based on the head-and-tail entity fusion model to fuse the data from heterogeneous sources. RESULTS: Data were extracted from 300 CEMRs of pituitary adenoma and 4 health portals. Entity fusion was carried out using the proposed data fusion model. The F1 scores of the head and tail entity fusions were 97.32% and 98.57%, respectively. Triples from the constructed KGPA were selected for evaluation, demonstrating 95.4% accuracy. CONCLUSIONS: This paper introduces an approach to fuse triples extracted from heterogeneous data sources, which can be used to build a knowledge graph. The evaluation results showed that the data in the KGPA are of high quality. The constructed KGPA can help physicians in clinical practice. |
format | Online Article Text |
id | pubmed-8367125 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-83671252021-08-24 Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma Fang, An Lou, Pei Hu, Jiahui Zhao, Wanqing Feng, Ming Ren, Huiling Chen, Xianlai JMIR Med Inform Original Paper BACKGROUND: Pituitary adenoma is one of the most common central nervous system tumors. The diagnosis and treatment of pituitary adenoma remain very difficult. Misdiagnosis and recurrence often occur, and experienced neurosurgeons are in serious shortage. A knowledge graph can help interns quickly understand the medical knowledge related to pituitary tumor. OBJECTIVE: The aim of this study was to develop a data fusion method suitable for medical data using data of pituitary adenomas integrated from different sources. The overall goal was to construct a knowledge graph for pituitary adenoma (KGPA) to be used for knowledge discovery. METHODS: A complete framework suitable for the construction of a medical knowledge graph was developed, which was used to build the KGPA. The schema of the KGPA was manually constructed. Information of pituitary adenoma was automatically extracted from Chinese electronic medical records (CEMRs) and medical websites through a conditional random field model and newly designed web wrappers. An entity fusion method is proposed based on the head-and-tail entity fusion model to fuse the data from heterogeneous sources. RESULTS: Data were extracted from 300 CEMRs of pituitary adenoma and 4 health portals. Entity fusion was carried out using the proposed data fusion model. The F1 scores of the head and tail entity fusions were 97.32% and 98.57%, respectively. Triples from the constructed KGPA were selected for evaluation, demonstrating 95.4% accuracy. CONCLUSIONS: This paper introduces an approach to fuse triples extracted from heterogeneous data sources, which can be used to build a knowledge graph. The evaluation results showed that the data in the KGPA are of high quality. The constructed KGPA can help physicians in clinical practice. JMIR Publications 2021-07-22 /pmc/articles/PMC8367125/ /pubmed/34057414 http://dx.doi.org/10.2196/28218 Text en ©An Fang, Pei Lou, Jiahui Hu, Wanqing Zhao, Ming Feng, Huiling Ren, Xianlai Chen. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 22.07.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Fang, An Lou, Pei Hu, Jiahui Zhao, Wanqing Feng, Ming Ren, Huiling Chen, Xianlai Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma |
title | Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma |
title_full | Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma |
title_fullStr | Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma |
title_full_unstemmed | Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma |
title_short | Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma |
title_sort | head and tail entity fusion model in medical knowledge graph construction: case study for pituitary adenoma |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8367125/ https://www.ncbi.nlm.nih.gov/pubmed/34057414 http://dx.doi.org/10.2196/28218 |
work_keys_str_mv | AT fangan headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma AT loupei headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma AT hujiahui headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma AT zhaowanqing headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma AT fengming headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma AT renhuiling headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma AT chenxianlai headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma |