Cargando…

A semi-supervised approach for extracting TCM clinical terms based on feature words

BACKGROUND: A semi-supervised model is proposed for extracting clinical terms of Traditional Chinese Medicine using feature words. METHODS: The extraction model is based on BiLSTM-CRF and combined with semi-supervised learning and feature word set, which reduces the cost of manual annotation and lev...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Liangliang, Wu, Xiaojing, Liu, Hui, Cao, Xinyu, Wang, Haitao, Zhou, Hongwei, Xie, Qi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7477860/
https://www.ncbi.nlm.nih.gov/pubmed/32646408
http://dx.doi.org/10.1186/s12911-020-1108-1
Descripción
Sumario:BACKGROUND: A semi-supervised model is proposed for extracting clinical terms of Traditional Chinese Medicine using feature words. METHODS: The extraction model is based on BiLSTM-CRF and combined with semi-supervised learning and feature word set, which reduces the cost of manual annotation and leverage extraction results. RESULTS: Experiment results show that the proposed model improves the extraction of five types of TCM clinical terms, including traditional Chinese medicine, symptoms, patterns, diseases and formulas. The best F1-value of the experiment reaches 78.70% on the test dataset. CONCLUSIONS: This method can reduce the cost of manual labeling and improve the result in the NER research of TCM clinical terms.