Cargando…

Multi-task learning for Chinese clinical named entity recognition with external knowledge

BACKGROUND: Named entity recognition (NER) on Chinese electronic medical/healthcare records has attracted significantly attentions as it can be applied to building applications to understand these records. Most previous methods have been purely data-driven, requiring high-quality and large-scale lab...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Ming, Xiong, Shufeng, Li, Fei, Liang, Pan, Gao, Jianbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8719412/
https://www.ncbi.nlm.nih.gov/pubmed/34972505
http://dx.doi.org/10.1186/s12911-021-01717-1
Descripción
Sumario:BACKGROUND: Named entity recognition (NER) on Chinese electronic medical/healthcare records has attracted significantly attentions as it can be applied to building applications to understand these records. Most previous methods have been purely data-driven, requiring high-quality and large-scale labeled medical data. However, labeled data is expensive to obtain, and these data-driven methods are difficult to handle rare and unseen entities. METHODS: To tackle these problems, this study presents a novel multi-task deep neural network model for Chinese NER in the medical domain. We incorporate dictionary features into neural networks, and a general secondary named entity segmentation is used as auxiliary task to improve the performance of the primary task of named entity recognition. RESULTS: In order to evaluate the proposed method, we compare it with other currently popular methods, on three benchmark datasets. Two of the datasets are publicly available, and the other one is constructed by us. Experimental results show that the proposed model achieves 91.07% average f-measure on the two public datasets and 87.05% f-measure on private dataset. CONCLUSIONS: The comparison results of different models demonstrated the effectiveness of our model. The proposed model outperformed traditional statistical models.