Cargando…
An approach for medical event detection in Chinese clinical notes of electronic health records
BACKGROUND: Medical event detection in narrative clinical notes of electronic health records (EHRs) is a task designed for reading text and extracting information. Most of the previous work of medical event detection treats the task as extracting concepts at word granularity, which omits the overall...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454668/ https://www.ncbi.nlm.nih.gov/pubmed/30961587 http://dx.doi.org/10.1186/s12911-019-0756-5 |
_version_ | 1783409584288825344 |
---|---|
author | Zhou, Xuesi Xiong, Haoqi Zeng, Sihan Fu, Xiangling Wu, Ji |
author_facet | Zhou, Xuesi Xiong, Haoqi Zeng, Sihan Fu, Xiangling Wu, Ji |
author_sort | Zhou, Xuesi |
collection | PubMed |
description | BACKGROUND: Medical event detection in narrative clinical notes of electronic health records (EHRs) is a task designed for reading text and extracting information. Most of the previous work of medical event detection treats the task as extracting concepts at word granularity, which omits the overall structural information of the clinical notes. In this work, we treat each clinical note as a sequence of short sentences and propose an end-to-end deep neural network framework. METHODS: We redefined the task as a sequence labelling task at short sentence granularity, and proposed a novel tag system correspondingly. The dataset were derived from a third-level grade-A hospital, consisting of 2000 annotated clinical notes according to our proposed tag system. The proposed end-to-end deep neural network framework consists of a feature extractor and a sequence labeller, and we explored different implementations respectively. We additionally proposed a smoothed Viterbi decoder as sequence labeller without additional parameter training, which can be a good alternative to conditional random field (CRF) when computing resources are limited. RESULTS: Our sequence labelling models were compared to four baselines which treat the task as text classification of short sentences. Experimental results showed that our approach significantly outperforms the baselines. The best result was obtained by using the convolutional neural networks (CNNs) feature extractor and the sequential CRF sequence labeller, achieving an accuracy of 92.6%. Our proposed smoothed Viterbi decoder achieved a comparable accuracy of 90.07% with reduced training parameters, and brought more balanced performance across all categories, which means better generalization ability. CONCLUSIONS: Evaluated on our annotated dataset, the comparison results demonstrated the effectiveness of our approach for medical event detection in Chinese clinical notes of EHRs. The best feature extractor is the CNNs feature extractor, and the best sequence labeller is the sequential CRF decoder. And it was empirically verified that our proposed smoothed Viterbi decoder could bring better generalization ability while achieving comparable performance to the sequential CRF decoder. |
format | Online Article Text |
id | pubmed-6454668 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64546682019-04-19 An approach for medical event detection in Chinese clinical notes of electronic health records Zhou, Xuesi Xiong, Haoqi Zeng, Sihan Fu, Xiangling Wu, Ji BMC Med Inform Decis Mak Research BACKGROUND: Medical event detection in narrative clinical notes of electronic health records (EHRs) is a task designed for reading text and extracting information. Most of the previous work of medical event detection treats the task as extracting concepts at word granularity, which omits the overall structural information of the clinical notes. In this work, we treat each clinical note as a sequence of short sentences and propose an end-to-end deep neural network framework. METHODS: We redefined the task as a sequence labelling task at short sentence granularity, and proposed a novel tag system correspondingly. The dataset were derived from a third-level grade-A hospital, consisting of 2000 annotated clinical notes according to our proposed tag system. The proposed end-to-end deep neural network framework consists of a feature extractor and a sequence labeller, and we explored different implementations respectively. We additionally proposed a smoothed Viterbi decoder as sequence labeller without additional parameter training, which can be a good alternative to conditional random field (CRF) when computing resources are limited. RESULTS: Our sequence labelling models were compared to four baselines which treat the task as text classification of short sentences. Experimental results showed that our approach significantly outperforms the baselines. The best result was obtained by using the convolutional neural networks (CNNs) feature extractor and the sequential CRF sequence labeller, achieving an accuracy of 92.6%. Our proposed smoothed Viterbi decoder achieved a comparable accuracy of 90.07% with reduced training parameters, and brought more balanced performance across all categories, which means better generalization ability. CONCLUSIONS: Evaluated on our annotated dataset, the comparison results demonstrated the effectiveness of our approach for medical event detection in Chinese clinical notes of EHRs. The best feature extractor is the CNNs feature extractor, and the best sequence labeller is the sequential CRF decoder. And it was empirically verified that our proposed smoothed Viterbi decoder could bring better generalization ability while achieving comparable performance to the sequential CRF decoder. BioMed Central 2019-04-09 /pmc/articles/PMC6454668/ /pubmed/30961587 http://dx.doi.org/10.1186/s12911-019-0756-5 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zhou, Xuesi Xiong, Haoqi Zeng, Sihan Fu, Xiangling Wu, Ji An approach for medical event detection in Chinese clinical notes of electronic health records |
title | An approach for medical event detection in Chinese clinical notes of electronic health records |
title_full | An approach for medical event detection in Chinese clinical notes of electronic health records |
title_fullStr | An approach for medical event detection in Chinese clinical notes of electronic health records |
title_full_unstemmed | An approach for medical event detection in Chinese clinical notes of electronic health records |
title_short | An approach for medical event detection in Chinese clinical notes of electronic health records |
title_sort | approach for medical event detection in chinese clinical notes of electronic health records |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454668/ https://www.ncbi.nlm.nih.gov/pubmed/30961587 http://dx.doi.org/10.1186/s12911-019-0756-5 |
work_keys_str_mv | AT zhouxuesi anapproachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords AT xionghaoqi anapproachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords AT zengsihan anapproachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords AT fuxiangling anapproachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords AT wuji anapproachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords AT zhouxuesi approachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords AT xionghaoqi approachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords AT zengsihan approachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords AT fuxiangling approachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords AT wuji approachformedicaleventdetectioninchineseclinicalnotesofelectronichealthrecords |