Cargando…

Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text

Online medical text is full of references to medical entities (MEs), which are valuable in many applications, including medical knowledge-based (KB) construction, decision support systems, and the treatment of diseases. However, the diverse and ambiguous nature of the surface forms gives rise to a g...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Jing, Gan, Liang, Cheng, Mian, Wu, Quanyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5932451/
https://www.ncbi.nlm.nih.gov/pubmed/29849994
http://dx.doi.org/10.1155/2018/2548537
_version_ 1783319818244456448
author Xu, Jing
Gan, Liang
Cheng, Mian
Wu, Quanyuan
author_facet Xu, Jing
Gan, Liang
Cheng, Mian
Wu, Quanyuan
author_sort Xu, Jing
collection PubMed
description Online medical text is full of references to medical entities (MEs), which are valuable in many applications, including medical knowledge-based (KB) construction, decision support systems, and the treatment of diseases. However, the diverse and ambiguous nature of the surface forms gives rise to a great difficulty for ME identification. Many existing solutions have focused on supervised approaches, which are often task-dependent. In other words, applying them to different kinds of corpora or identifying new entity categories requires major effort in data annotation and feature definition. In this paper, we propose unMERL, an unsupervised framework for recognizing and linking medical entities mentioned in Chinese online medical text. For ME recognition, unMERL first exploits a knowledge-driven approach to extract candidate entities from free text. Then, the categories of the candidate entities are determined using a distributed semantic-based approach. For ME linking, we propose a collaborative inference approach which takes full advantage of heterogenous entity knowledge and unstructured information in KB. Experimental results on real corpora demonstrate significant benefits compared to recent approaches with respect to both ME recognition and linking.
format Online
Article
Text
id pubmed-5932451
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-59324512018-05-30 Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text Xu, Jing Gan, Liang Cheng, Mian Wu, Quanyuan J Healthc Eng Research Article Online medical text is full of references to medical entities (MEs), which are valuable in many applications, including medical knowledge-based (KB) construction, decision support systems, and the treatment of diseases. However, the diverse and ambiguous nature of the surface forms gives rise to a great difficulty for ME identification. Many existing solutions have focused on supervised approaches, which are often task-dependent. In other words, applying them to different kinds of corpora or identifying new entity categories requires major effort in data annotation and feature definition. In this paper, we propose unMERL, an unsupervised framework for recognizing and linking medical entities mentioned in Chinese online medical text. For ME recognition, unMERL first exploits a knowledge-driven approach to extract candidate entities from free text. Then, the categories of the candidate entities are determined using a distributed semantic-based approach. For ME linking, we propose a collaborative inference approach which takes full advantage of heterogenous entity knowledge and unstructured information in KB. Experimental results on real corpora demonstrate significant benefits compared to recent approaches with respect to both ME recognition and linking. Hindawi 2018-04-18 /pmc/articles/PMC5932451/ /pubmed/29849994 http://dx.doi.org/10.1155/2018/2548537 Text en Copyright © 2018 Jing Xu et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xu, Jing
Gan, Liang
Cheng, Mian
Wu, Quanyuan
Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text
title Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text
title_full Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text
title_fullStr Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text
title_full_unstemmed Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text
title_short Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text
title_sort unsupervised medical entity recognition and linking in chinese online medical text
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5932451/
https://www.ncbi.nlm.nih.gov/pubmed/29849994
http://dx.doi.org/10.1155/2018/2548537
work_keys_str_mv AT xujing unsupervisedmedicalentityrecognitionandlinkinginchineseonlinemedicaltext
AT ganliang unsupervisedmedicalentityrecognitionandlinkinginchineseonlinemedicaltext
AT chengmian unsupervisedmedicalentityrecognitionandlinkinginchineseonlinemedicaltext
AT wuquanyuan unsupervisedmedicalentityrecognitionandlinkinginchineseonlinemedicaltext