Cargando…

Constructing a Chinese electronic medical record corpus for named entity recognition on resident admit notes

BACKGROUND: Electronic Medical Records(EMRs) contain much medical information about patients. Medical named entity extracting from EMRs can provide value information to support doctors’ decision making. The research on information extraction of Chinese Electronic Medical Records is still behind that...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Yan, Gu, Lei, Wang, Yefeng, Wang, Yandong, Yang, Feng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454673/
https://www.ncbi.nlm.nih.gov/pubmed/30961596
http://dx.doi.org/10.1186/s12911-019-0759-2
_version_ 1783409585478959104
author Gao, Yan
Gu, Lei
Wang, Yefeng
Wang, Yandong
Yang, Feng
author_facet Gao, Yan
Gu, Lei
Wang, Yefeng
Wang, Yandong
Yang, Feng
author_sort Gao, Yan
collection PubMed
description BACKGROUND: Electronic Medical Records(EMRs) contain much medical information about patients. Medical named entity extracting from EMRs can provide value information to support doctors’ decision making. The research on information extraction of Chinese Electronic Medical Records is still behind that has done in English. METHODS: This paper proposed a practical annotation scheme for medical entity extraction on Resident Admit Notes (RANs), and a model which can automatic extract medical entity. Nine types of clinical entities, four types of clinical relationships were defined in our annotation scheme. An end-to-end deep neural network with convolution neural network and long-short term memory units was applied in our model for the medical named entity recognition(NER). RESULT: We annotated RANs in three rounds. The overall F-score of annotation consistency was up to 97.73%. And our NER model on the 255 annotated RANs achieved the best F-score of 91.08%. CONCLUSION: The annotation scheme and the model for NER in this paper are effective to extract medical named entity from RANs and provide the basis for fully excavating the patient’s information.
format Online
Article
Text
id pubmed-6454673
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64546732019-04-19 Constructing a Chinese electronic medical record corpus for named entity recognition on resident admit notes Gao, Yan Gu, Lei Wang, Yefeng Wang, Yandong Yang, Feng BMC Med Inform Decis Mak Research BACKGROUND: Electronic Medical Records(EMRs) contain much medical information about patients. Medical named entity extracting from EMRs can provide value information to support doctors’ decision making. The research on information extraction of Chinese Electronic Medical Records is still behind that has done in English. METHODS: This paper proposed a practical annotation scheme for medical entity extraction on Resident Admit Notes (RANs), and a model which can automatic extract medical entity. Nine types of clinical entities, four types of clinical relationships were defined in our annotation scheme. An end-to-end deep neural network with convolution neural network and long-short term memory units was applied in our model for the medical named entity recognition(NER). RESULT: We annotated RANs in three rounds. The overall F-score of annotation consistency was up to 97.73%. And our NER model on the 255 annotated RANs achieved the best F-score of 91.08%. CONCLUSION: The annotation scheme and the model for NER in this paper are effective to extract medical named entity from RANs and provide the basis for fully excavating the patient’s information. BioMed Central 2019-04-09 /pmc/articles/PMC6454673/ /pubmed/30961596 http://dx.doi.org/10.1186/s12911-019-0759-2 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Gao, Yan
Gu, Lei
Wang, Yefeng
Wang, Yandong
Yang, Feng
Constructing a Chinese electronic medical record corpus for named entity recognition on resident admit notes
title Constructing a Chinese electronic medical record corpus for named entity recognition on resident admit notes
title_full Constructing a Chinese electronic medical record corpus for named entity recognition on resident admit notes
title_fullStr Constructing a Chinese electronic medical record corpus for named entity recognition on resident admit notes
title_full_unstemmed Constructing a Chinese electronic medical record corpus for named entity recognition on resident admit notes
title_short Constructing a Chinese electronic medical record corpus for named entity recognition on resident admit notes
title_sort constructing a chinese electronic medical record corpus for named entity recognition on resident admit notes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454673/
https://www.ncbi.nlm.nih.gov/pubmed/30961596
http://dx.doi.org/10.1186/s12911-019-0759-2
work_keys_str_mv AT gaoyan constructingachineseelectronicmedicalrecordcorpusfornamedentityrecognitiononresidentadmitnotes
AT gulei constructingachineseelectronicmedicalrecordcorpusfornamedentityrecognitiononresidentadmitnotes
AT wangyefeng constructingachineseelectronicmedicalrecordcorpusfornamedentityrecognitiononresidentadmitnotes
AT wangyandong constructingachineseelectronicmedicalrecordcorpusfornamedentityrecognitiononresidentadmitnotes
AT yangfeng constructingachineseelectronicmedicalrecordcorpusfornamedentityrecognitiononresidentadmitnotes