Cargando…
Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization
OBJECTIVES: Acquiring temporal information is important because knowledge in clinical narratives is time-sensitive. In this paper, we describe an approach that can be used to extract the temporal information found in Korean clinical narrative texts. METHODS: We developed a two-stage system, which em...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korean Society of Medical Informatics
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3212741/ https://www.ncbi.nlm.nih.gov/pubmed/22084809 http://dx.doi.org/10.4258/hir.2011.17.3.150 |
_version_ | 1782216015608807424 |
---|---|
author | Kim, Youngho Choi, Jinwook |
author_facet | Kim, Youngho Choi, Jinwook |
author_sort | Kim, Youngho |
collection | PubMed |
description | OBJECTIVES: Acquiring temporal information is important because knowledge in clinical narratives is time-sensitive. In this paper, we describe an approach that can be used to extract the temporal information found in Korean clinical narrative texts. METHODS: We developed a two-stage system, which employs an exhaustive text analysis phase and a temporal expression recognition phase. Since our target document may include tokens that are made up of both Korean and English text joined together, the minimal semantic units are analyzed and then separated from the concatenated phrases and linguistic derivations within a token using a corpus-based approach to decompose complex tokens. A finite state machine is then used on the minimal semantic units in order to find phrases that possess time-related information. RESULTS: In the experiment, the temporal expressions within Korean clinical narratives were extracted using our system. The system performance was evaluated through the use of 100 discharge summaries from Seoul National University Hospital containing a total of 805 temporal expressions. Our system scored a phrase-level precision and recall of 0.895 and 0.919, respectively. CONCLUSIONS: Finding information in Korean clinical narrative is challenging task, since the text is written in both Korean and English and frequently omits syntactic elements and word spacing, which makes it extremely noisy. This study presents an effective method that can be used to aquire the temporal information found in Korean clinical documents. |
format | Online Article Text |
id | pubmed-3212741 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Korean Society of Medical Informatics |
record_format | MEDLINE/PubMed |
spelling | pubmed-32127412011-11-16 Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization Kim, Youngho Choi, Jinwook Healthc Inform Res Original Article OBJECTIVES: Acquiring temporal information is important because knowledge in clinical narratives is time-sensitive. In this paper, we describe an approach that can be used to extract the temporal information found in Korean clinical narrative texts. METHODS: We developed a two-stage system, which employs an exhaustive text analysis phase and a temporal expression recognition phase. Since our target document may include tokens that are made up of both Korean and English text joined together, the minimal semantic units are analyzed and then separated from the concatenated phrases and linguistic derivations within a token using a corpus-based approach to decompose complex tokens. A finite state machine is then used on the minimal semantic units in order to find phrases that possess time-related information. RESULTS: In the experiment, the temporal expressions within Korean clinical narratives were extracted using our system. The system performance was evaluated through the use of 100 discharge summaries from Seoul National University Hospital containing a total of 805 temporal expressions. Our system scored a phrase-level precision and recall of 0.895 and 0.919, respectively. CONCLUSIONS: Finding information in Korean clinical narrative is challenging task, since the text is written in both Korean and English and frequently omits syntactic elements and word spacing, which makes it extremely noisy. This study presents an effective method that can be used to aquire the temporal information found in Korean clinical documents. Korean Society of Medical Informatics 2011-09 2011-09-30 /pmc/articles/PMC3212741/ /pubmed/22084809 http://dx.doi.org/10.4258/hir.2011.17.3.150 Text en © 2011 The Korean Society of Medical Informatics http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Kim, Youngho Choi, Jinwook Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization |
title | Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization |
title_full | Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization |
title_fullStr | Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization |
title_full_unstemmed | Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization |
title_short | Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization |
title_sort | recognizing temporal information in korean clinical narratives through text normalization |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3212741/ https://www.ncbi.nlm.nih.gov/pubmed/22084809 http://dx.doi.org/10.4258/hir.2011.17.3.150 |
work_keys_str_mv | AT kimyoungho recognizingtemporalinformationinkoreanclinicalnarrativesthroughtextnormalization AT choijinwook recognizingtemporalinformationinkoreanclinicalnarrativesthroughtextnormalization |