Cargando…

An imConvNet-based deep learning model for Chinese medical named entity recognition

BACKGROUND: With the development of current medical technology, information management becomes perfect in the medical field. Medical big data analysis is based on a large amount of medical and health data stored in the electronic medical system, such as electronic medical records and medical reports...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zheng, Yuchen, Han, Zhenggong, Cai, Yimin, Duan, Xubo, Sun, Jiangling, Yang, Wei, Huang, Haisong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9677659/ https://www.ncbi.nlm.nih.gov/pubmed/36411432 http://dx.doi.org/10.1186/s12911-022-02049-4

_version_	1784833852668116992
author	Zheng, Yuchen Han, Zhenggong Cai, Yimin Duan, Xubo Sun, Jiangling Yang, Wei Huang, Haisong
author_facet	Zheng, Yuchen Han, Zhenggong Cai, Yimin Duan, Xubo Sun, Jiangling Yang, Wei Huang, Haisong
author_sort	Zheng, Yuchen
collection	PubMed
description	BACKGROUND: With the development of current medical technology, information management becomes perfect in the medical field. Medical big data analysis is based on a large amount of medical and health data stored in the electronic medical system, such as electronic medical records and medical reports. How to fully exploit the resources of information included in these medical data has always been the subject of research by many scholars. The basis for text mining is named entity recognition (NER), which has its particularities in the medical field, where issues such as inadequate text resources and a large number of professional domain terms continue to face significant challenges in medical NER. METHODS: We improved the convolutional neural network model (imConvNet) to obtain additional text features. Concurrently, we continue to use the classical Bert pre-training model and BiLSTM model for named entity recognition. We use imConvNet model to extract additional word vector features and improve named entity recognition accuracy. The proposed model, named BERT-imConvNet-BiLSTM-CRF, is composed of four layers: BERT embedding layer—getting word embedding vector; imConvNet layer—capturing the context feature of each character; BiLSTM (Bidirectional Long Short-Term Memory) layer—capturing the long-distance dependencies; CRF (Conditional Random Field) layer—labeling characters based on their features and transfer rules. RESULTS: The average F1 score on the public medical data set yidu-s4k reached 91.38% when combined with the classical model; when real electronic medical record text in impacted wisdom teeth is used as the experimental object, the model's F1 score is 93.89%. They all show better results than classical models. CONCLUSIONS: The suggested novel model (imConvNet) significantly improves the recognition accuracy of Chinese medical named entities and applies to various medical corpora.
format	Online Article Text
id	pubmed-9677659
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-96776592022-11-22 An imConvNet-based deep learning model for Chinese medical named entity recognition Zheng, Yuchen Han, Zhenggong Cai, Yimin Duan, Xubo Sun, Jiangling Yang, Wei Huang, Haisong BMC Med Inform Decis Mak Research BACKGROUND: With the development of current medical technology, information management becomes perfect in the medical field. Medical big data analysis is based on a large amount of medical and health data stored in the electronic medical system, such as electronic medical records and medical reports. How to fully exploit the resources of information included in these medical data has always been the subject of research by many scholars. The basis for text mining is named entity recognition (NER), which has its particularities in the medical field, where issues such as inadequate text resources and a large number of professional domain terms continue to face significant challenges in medical NER. METHODS: We improved the convolutional neural network model (imConvNet) to obtain additional text features. Concurrently, we continue to use the classical Bert pre-training model and BiLSTM model for named entity recognition. We use imConvNet model to extract additional word vector features and improve named entity recognition accuracy. The proposed model, named BERT-imConvNet-BiLSTM-CRF, is composed of four layers: BERT embedding layer—getting word embedding vector; imConvNet layer—capturing the context feature of each character; BiLSTM (Bidirectional Long Short-Term Memory) layer—capturing the long-distance dependencies; CRF (Conditional Random Field) layer—labeling characters based on their features and transfer rules. RESULTS: The average F1 score on the public medical data set yidu-s4k reached 91.38% when combined with the classical model; when real electronic medical record text in impacted wisdom teeth is used as the experimental object, the model's F1 score is 93.89%. They all show better results than classical models. CONCLUSIONS: The suggested novel model (imConvNet) significantly improves the recognition accuracy of Chinese medical named entities and applies to various medical corpora. BioMed Central 2022-11-21 /pmc/articles/PMC9677659/ /pubmed/36411432 http://dx.doi.org/10.1186/s12911-022-02049-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Zheng, Yuchen Han, Zhenggong Cai, Yimin Duan, Xubo Sun, Jiangling Yang, Wei Huang, Haisong An imConvNet-based deep learning model for Chinese medical named entity recognition
title	An imConvNet-based deep learning model for Chinese medical named entity recognition
title_full	An imConvNet-based deep learning model for Chinese medical named entity recognition
title_fullStr	An imConvNet-based deep learning model for Chinese medical named entity recognition
title_full_unstemmed	An imConvNet-based deep learning model for Chinese medical named entity recognition
title_short	An imConvNet-based deep learning model for Chinese medical named entity recognition
title_sort	imconvnet-based deep learning model for chinese medical named entity recognition
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9677659/ https://www.ncbi.nlm.nih.gov/pubmed/36411432 http://dx.doi.org/10.1186/s12911-022-02049-4
work_keys_str_mv	AT zhengyuchen animconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT hanzhenggong animconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT caiyimin animconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT duanxubo animconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT sunjiangling animconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT yangwei animconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT huanghaisong animconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT zhengyuchen imconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT hanzhenggong imconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT caiyimin imconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT duanxubo imconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT sunjiangling imconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT yangwei imconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition AT huanghaisong imconvnetbaseddeeplearningmodelforchinesemedicalnamedentityrecognition

An imConvNet-based deep learning model for Chinese medical named entity recognition

Ejemplares similares