Cargando…

Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma

BACKGROUND: Pituitary adenoma is one of the most common central nervous system tumors. The diagnosis and treatment of pituitary adenoma remain very difficult. Misdiagnosis and recurrence often occur, and experienced neurosurgeons are in serious shortage. A knowledge graph can help interns quickly un...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, An, Lou, Pei, Hu, Jiahui, Zhao, Wanqing, Feng, Ming, Ren, Huiling, Chen, Xianlai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8367125/
https://www.ncbi.nlm.nih.gov/pubmed/34057414
http://dx.doi.org/10.2196/28218
_version_ 1783739015210467328
author Fang, An
Lou, Pei
Hu, Jiahui
Zhao, Wanqing
Feng, Ming
Ren, Huiling
Chen, Xianlai
author_facet Fang, An
Lou, Pei
Hu, Jiahui
Zhao, Wanqing
Feng, Ming
Ren, Huiling
Chen, Xianlai
author_sort Fang, An
collection PubMed
description BACKGROUND: Pituitary adenoma is one of the most common central nervous system tumors. The diagnosis and treatment of pituitary adenoma remain very difficult. Misdiagnosis and recurrence often occur, and experienced neurosurgeons are in serious shortage. A knowledge graph can help interns quickly understand the medical knowledge related to pituitary tumor. OBJECTIVE: The aim of this study was to develop a data fusion method suitable for medical data using data of pituitary adenomas integrated from different sources. The overall goal was to construct a knowledge graph for pituitary adenoma (KGPA) to be used for knowledge discovery. METHODS: A complete framework suitable for the construction of a medical knowledge graph was developed, which was used to build the KGPA. The schema of the KGPA was manually constructed. Information of pituitary adenoma was automatically extracted from Chinese electronic medical records (CEMRs) and medical websites through a conditional random field model and newly designed web wrappers. An entity fusion method is proposed based on the head-and-tail entity fusion model to fuse the data from heterogeneous sources. RESULTS: Data were extracted from 300 CEMRs of pituitary adenoma and 4 health portals. Entity fusion was carried out using the proposed data fusion model. The F1 scores of the head and tail entity fusions were 97.32% and 98.57%, respectively. Triples from the constructed KGPA were selected for evaluation, demonstrating 95.4% accuracy. CONCLUSIONS: This paper introduces an approach to fuse triples extracted from heterogeneous data sources, which can be used to build a knowledge graph. The evaluation results showed that the data in the KGPA are of high quality. The constructed KGPA can help physicians in clinical practice.
format Online
Article
Text
id pubmed-8367125
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-83671252021-08-24 Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma Fang, An Lou, Pei Hu, Jiahui Zhao, Wanqing Feng, Ming Ren, Huiling Chen, Xianlai JMIR Med Inform Original Paper BACKGROUND: Pituitary adenoma is one of the most common central nervous system tumors. The diagnosis and treatment of pituitary adenoma remain very difficult. Misdiagnosis and recurrence often occur, and experienced neurosurgeons are in serious shortage. A knowledge graph can help interns quickly understand the medical knowledge related to pituitary tumor. OBJECTIVE: The aim of this study was to develop a data fusion method suitable for medical data using data of pituitary adenomas integrated from different sources. The overall goal was to construct a knowledge graph for pituitary adenoma (KGPA) to be used for knowledge discovery. METHODS: A complete framework suitable for the construction of a medical knowledge graph was developed, which was used to build the KGPA. The schema of the KGPA was manually constructed. Information of pituitary adenoma was automatically extracted from Chinese electronic medical records (CEMRs) and medical websites through a conditional random field model and newly designed web wrappers. An entity fusion method is proposed based on the head-and-tail entity fusion model to fuse the data from heterogeneous sources. RESULTS: Data were extracted from 300 CEMRs of pituitary adenoma and 4 health portals. Entity fusion was carried out using the proposed data fusion model. The F1 scores of the head and tail entity fusions were 97.32% and 98.57%, respectively. Triples from the constructed KGPA were selected for evaluation, demonstrating 95.4% accuracy. CONCLUSIONS: This paper introduces an approach to fuse triples extracted from heterogeneous data sources, which can be used to build a knowledge graph. The evaluation results showed that the data in the KGPA are of high quality. The constructed KGPA can help physicians in clinical practice. JMIR Publications 2021-07-22 /pmc/articles/PMC8367125/ /pubmed/34057414 http://dx.doi.org/10.2196/28218 Text en ©An Fang, Pei Lou, Jiahui Hu, Wanqing Zhao, Ming Feng, Huiling Ren, Xianlai Chen. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 22.07.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Fang, An
Lou, Pei
Hu, Jiahui
Zhao, Wanqing
Feng, Ming
Ren, Huiling
Chen, Xianlai
Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma
title Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma
title_full Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma
title_fullStr Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma
title_full_unstemmed Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma
title_short Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma
title_sort head and tail entity fusion model in medical knowledge graph construction: case study for pituitary adenoma
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8367125/
https://www.ncbi.nlm.nih.gov/pubmed/34057414
http://dx.doi.org/10.2196/28218
work_keys_str_mv AT fangan headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma
AT loupei headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma
AT hujiahui headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma
AT zhaowanqing headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma
AT fengming headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma
AT renhuiling headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma
AT chenxianlai headandtailentityfusionmodelinmedicalknowledgegraphconstructioncasestudyforpituitaryadenoma