Cargando…

Temporal indexing of medical entity in Chinese clinical notes

BACKGROUND: The goal of temporal indexing is to select an occurred time or time interval for each medical entity in clinical notes, so that all medical entities can be indexed on a united timeline, which could assist the understanding of clinical notes and the further application of medical entities...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zengjian, Wang, Xiaolong, Chen, Qingcai, Tang, Buzhou, Xu, Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6354334/
https://www.ncbi.nlm.nih.gov/pubmed/30700331
http://dx.doi.org/10.1186/s12911-019-0735-x
_version_ 1783391164193308672
author Liu, Zengjian
Wang, Xiaolong
Chen, Qingcai
Tang, Buzhou
Xu, Hua
author_facet Liu, Zengjian
Wang, Xiaolong
Chen, Qingcai
Tang, Buzhou
Xu, Hua
author_sort Liu, Zengjian
collection PubMed
description BACKGROUND: The goal of temporal indexing is to select an occurred time or time interval for each medical entity in clinical notes, so that all medical entities can be indexed on a united timeline, which could assist the understanding of clinical notes and the further application of medical entities. Some temporal relation shared tasks for the medical entity in English clinical notes have been organized in the past few years, such as the 2012 i2b2 NLP challenge, 2015 and 2016 clinical TempEval challenges. In these tasks, many heuristics rule-based and machine learning-based systems have been developed. In recent years, the deep neural network models have shown great potential on many problems including the relation classification. METHODS: In this paper, we propose a recurrent convolutional neural network (RNN-CNN) model for the temporal indexing task, which consists of four layers: input layer – generates representation for each context word of medical entities or temporal expressions; LSTM (long-short term memory) layer – learns the context information of each word in a sentence and outputs a new word representation sequence; CNN layer – extracts meaningful features from a sentence and outputs a new representation for medical entity or temporal expression; Output layer – takes the representations of medical entity, temporal expression and relation features as input and classifies the temporal relation. Finally, the time or time interval for each medical entity can be directly selected according to the probability of each temporal relation predicted by above model. RESULTS: To investigate the performance of our RNN-CNN model for the temporal indexing task, several baseline methods were also employed, such as the rule-based, support vector machine (SVM), convolutional neural network (CNN) and recurrent neural network (RNN) methods. Experiments conducted on a manually annotated corpus (including 563 clinical notes with 12,611 medical entities and 4006 temporal expressions) show that RNN-CNN model achieves the best F1-score of 75.97% for temporal relation classification and the best accuracy of 71.96% for temporal indexing. CONCLUSIONS: Neural network methods perform much better than the traditional rule-based and SVM-based method, which can capture more semantic information from the context of medical entities and temporal expressions. Besides, all our methods perform much better for the accurate time indexing than the time interval indexing, so how to improve the performance for time interval indexing will be the main focus in our future work.
format Online
Article
Text
id pubmed-6354334
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63543342019-02-06 Temporal indexing of medical entity in Chinese clinical notes Liu, Zengjian Wang, Xiaolong Chen, Qingcai Tang, Buzhou Xu, Hua BMC Med Inform Decis Mak Research BACKGROUND: The goal of temporal indexing is to select an occurred time or time interval for each medical entity in clinical notes, so that all medical entities can be indexed on a united timeline, which could assist the understanding of clinical notes and the further application of medical entities. Some temporal relation shared tasks for the medical entity in English clinical notes have been organized in the past few years, such as the 2012 i2b2 NLP challenge, 2015 and 2016 clinical TempEval challenges. In these tasks, many heuristics rule-based and machine learning-based systems have been developed. In recent years, the deep neural network models have shown great potential on many problems including the relation classification. METHODS: In this paper, we propose a recurrent convolutional neural network (RNN-CNN) model for the temporal indexing task, which consists of four layers: input layer – generates representation for each context word of medical entities or temporal expressions; LSTM (long-short term memory) layer – learns the context information of each word in a sentence and outputs a new word representation sequence; CNN layer – extracts meaningful features from a sentence and outputs a new representation for medical entity or temporal expression; Output layer – takes the representations of medical entity, temporal expression and relation features as input and classifies the temporal relation. Finally, the time or time interval for each medical entity can be directly selected according to the probability of each temporal relation predicted by above model. RESULTS: To investigate the performance of our RNN-CNN model for the temporal indexing task, several baseline methods were also employed, such as the rule-based, support vector machine (SVM), convolutional neural network (CNN) and recurrent neural network (RNN) methods. Experiments conducted on a manually annotated corpus (including 563 clinical notes with 12,611 medical entities and 4006 temporal expressions) show that RNN-CNN model achieves the best F1-score of 75.97% for temporal relation classification and the best accuracy of 71.96% for temporal indexing. CONCLUSIONS: Neural network methods perform much better than the traditional rule-based and SVM-based method, which can capture more semantic information from the context of medical entities and temporal expressions. Besides, all our methods perform much better for the accurate time indexing than the time interval indexing, so how to improve the performance for time interval indexing will be the main focus in our future work. BioMed Central 2019-01-31 /pmc/articles/PMC6354334/ /pubmed/30700331 http://dx.doi.org/10.1186/s12911-019-0735-x Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Liu, Zengjian
Wang, Xiaolong
Chen, Qingcai
Tang, Buzhou
Xu, Hua
Temporal indexing of medical entity in Chinese clinical notes
title Temporal indexing of medical entity in Chinese clinical notes
title_full Temporal indexing of medical entity in Chinese clinical notes
title_fullStr Temporal indexing of medical entity in Chinese clinical notes
title_full_unstemmed Temporal indexing of medical entity in Chinese clinical notes
title_short Temporal indexing of medical entity in Chinese clinical notes
title_sort temporal indexing of medical entity in chinese clinical notes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6354334/
https://www.ncbi.nlm.nih.gov/pubmed/30700331
http://dx.doi.org/10.1186/s12911-019-0735-x
work_keys_str_mv AT liuzengjian temporalindexingofmedicalentityinchineseclinicalnotes
AT wangxiaolong temporalindexingofmedicalentityinchineseclinicalnotes
AT chenqingcai temporalindexingofmedicalentityinchineseclinicalnotes
AT tangbuzhou temporalindexingofmedicalentityinchineseclinicalnotes
AT xuhua temporalindexingofmedicalentityinchineseclinicalnotes