Cargando…

Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization

OBJECTIVES: Acquiring temporal information is important because knowledge in clinical narratives is time-sensitive. In this paper, we describe an approach that can be used to extract the temporal information found in Korean clinical narrative texts. METHODS: We developed a two-stage system, which em...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Youngho, Choi, Jinwook
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Medical Informatics 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3212741/
https://www.ncbi.nlm.nih.gov/pubmed/22084809
http://dx.doi.org/10.4258/hir.2011.17.3.150
_version_ 1782216015608807424
author Kim, Youngho
Choi, Jinwook
author_facet Kim, Youngho
Choi, Jinwook
author_sort Kim, Youngho
collection PubMed
description OBJECTIVES: Acquiring temporal information is important because knowledge in clinical narratives is time-sensitive. In this paper, we describe an approach that can be used to extract the temporal information found in Korean clinical narrative texts. METHODS: We developed a two-stage system, which employs an exhaustive text analysis phase and a temporal expression recognition phase. Since our target document may include tokens that are made up of both Korean and English text joined together, the minimal semantic units are analyzed and then separated from the concatenated phrases and linguistic derivations within a token using a corpus-based approach to decompose complex tokens. A finite state machine is then used on the minimal semantic units in order to find phrases that possess time-related information. RESULTS: In the experiment, the temporal expressions within Korean clinical narratives were extracted using our system. The system performance was evaluated through the use of 100 discharge summaries from Seoul National University Hospital containing a total of 805 temporal expressions. Our system scored a phrase-level precision and recall of 0.895 and 0.919, respectively. CONCLUSIONS: Finding information in Korean clinical narrative is challenging task, since the text is written in both Korean and English and frequently omits syntactic elements and word spacing, which makes it extremely noisy. This study presents an effective method that can be used to aquire the temporal information found in Korean clinical documents.
format Online
Article
Text
id pubmed-3212741
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Korean Society of Medical Informatics
record_format MEDLINE/PubMed
spelling pubmed-32127412011-11-16 Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization Kim, Youngho Choi, Jinwook Healthc Inform Res Original Article OBJECTIVES: Acquiring temporal information is important because knowledge in clinical narratives is time-sensitive. In this paper, we describe an approach that can be used to extract the temporal information found in Korean clinical narrative texts. METHODS: We developed a two-stage system, which employs an exhaustive text analysis phase and a temporal expression recognition phase. Since our target document may include tokens that are made up of both Korean and English text joined together, the minimal semantic units are analyzed and then separated from the concatenated phrases and linguistic derivations within a token using a corpus-based approach to decompose complex tokens. A finite state machine is then used on the minimal semantic units in order to find phrases that possess time-related information. RESULTS: In the experiment, the temporal expressions within Korean clinical narratives were extracted using our system. The system performance was evaluated through the use of 100 discharge summaries from Seoul National University Hospital containing a total of 805 temporal expressions. Our system scored a phrase-level precision and recall of 0.895 and 0.919, respectively. CONCLUSIONS: Finding information in Korean clinical narrative is challenging task, since the text is written in both Korean and English and frequently omits syntactic elements and word spacing, which makes it extremely noisy. This study presents an effective method that can be used to aquire the temporal information found in Korean clinical documents. Korean Society of Medical Informatics 2011-09 2011-09-30 /pmc/articles/PMC3212741/ /pubmed/22084809 http://dx.doi.org/10.4258/hir.2011.17.3.150 Text en © 2011 The Korean Society of Medical Informatics http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Kim, Youngho
Choi, Jinwook
Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization
title Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization
title_full Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization
title_fullStr Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization
title_full_unstemmed Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization
title_short Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization
title_sort recognizing temporal information in korean clinical narratives through text normalization
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3212741/
https://www.ncbi.nlm.nih.gov/pubmed/22084809
http://dx.doi.org/10.4258/hir.2011.17.3.150
work_keys_str_mv AT kimyoungho recognizingtemporalinformationinkoreanclinicalnarrativesthroughtextnormalization
AT choijinwook recognizingtemporalinformationinkoreanclinicalnarrativesthroughtextnormalization