Cargando…

Named Entity Recognition of Medical Text Based on the Deep Neural Network

Medical text data records detailed clinical data; named entity recognition is the basis of text information processing and an important part of mining valuable information in medical texts. The named entity recognition technology can accurately identify the information needed in medical texts and he...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Tianjiao, He, Ying, Yang, Ning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8920682/
https://www.ncbi.nlm.nih.gov/pubmed/35295179
http://dx.doi.org/10.1155/2022/3990563
Descripción
Sumario:Medical text data records detailed clinical data; named entity recognition is the basis of text information processing and an important part of mining valuable information in medical texts. The named entity recognition technology can accurately identify the information needed in medical texts and help medical staff make clinical decision-making, evidence-based medicine, and epidemic disease monitoring. This paper proposes a hybrid neural network medical text named entity recognition model. First, a coding method based on a fully self-attentive mechanism is proposed. The vector representation of each word is related to the entire sentence through the attention mechanism. It determines the weight distribution by scoring the characters or words in all positions and obtains the position information in the sentence that needs the most attention. The encoding vector at each position is integrated with the context information of full sentence, which solves the ambiguity problem. Second, a multivariate convolutional decoding method is proposed. This method can effectively pay attention to the characteristics of medical text named entity recognition in the decoding process. It uses two-dimensional convolutional decoding to associate the current position word with surrounding words to improve decoding efficiency while extracting features from the logic of the preceding and following words. Using the same number of convolution kernels as the entity category, it can effectively extract effective features from the label dimension. Besides, according to the characteristics of the named entity recognition task, a special mixed loss is designed. The experimental results verify that the proposed method is effective, and it is improved compared with some existing medical text named entity recognition methods.