Cargando…

Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach

BACKGROUND: Temporal information frequently exists in the representation of the disease progress, prescription, medication, surgery progress, or discharge summary in narrative clinical text. The accurate extraction and normalization of temporal expressions can positively boost the analysis and under...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Xiaoyi, Chen, Boyu, Weng, Heng, Gong, Yongyi, Qu, Yingying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418025/
https://www.ncbi.nlm.nih.gov/pubmed/32716307
http://dx.doi.org/10.2196/17652
_version_ 1783569617912856576
author Pan, Xiaoyi
Chen, Boyu
Weng, Heng
Gong, Yongyi
Qu, Yingying
author_facet Pan, Xiaoyi
Chen, Boyu
Weng, Heng
Gong, Yongyi
Qu, Yingying
author_sort Pan, Xiaoyi
collection PubMed
description BACKGROUND: Temporal information frequently exists in the representation of the disease progress, prescription, medication, surgery progress, or discharge summary in narrative clinical text. The accurate extraction and normalization of temporal expressions can positively boost the analysis and understanding of narrative clinical texts to promote clinical research and practice. OBJECTIVE: The goal of the study was to propose a novel approach for extracting and normalizing temporal expressions from Chinese narrative clinical text. METHODS: TNorm, a rule-based and pattern learning-based approach, has been developed for automatic temporal expression extraction and normalization from unstructured Chinese clinical text data. TNorm consists of three stages: extraction, classification, and normalization. It applies a set of heuristic rules and automatically generated patterns for temporal expression identification and extraction of clinical texts. Then, it collects the features of extracted temporal expressions for temporal type prediction and classification by using machine learning algorithms. Finally, the features are combined with the rule-based and a pattern learning-based approach to normalize the extracted temporal expressions. RESULTS: The evaluation dataset is a set of narrative clinical texts in Chinese containing 1459 discharge summaries of a domestic Grade A Class 3 hospital. The results show that TNorm, combined with temporal expressions extraction and temporal types prediction, achieves a precision of 0.8491, a recall of 0.8328, and a F1 score of 0.8409 in temporal expressions normalization. CONCLUSIONS: This study illustrates an automatic approach, TNorm, that extracts and normalizes temporal expression from Chinese narrative clinical texts. TNorm was evaluated on the basis of discharge summary data, and results demonstrate its effectiveness on temporal expression normalization.
format Online
Article
Text
id pubmed-7418025
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-74180252020-08-20 Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach Pan, Xiaoyi Chen, Boyu Weng, Heng Gong, Yongyi Qu, Yingying JMIR Med Inform Original Paper BACKGROUND: Temporal information frequently exists in the representation of the disease progress, prescription, medication, surgery progress, or discharge summary in narrative clinical text. The accurate extraction and normalization of temporal expressions can positively boost the analysis and understanding of narrative clinical texts to promote clinical research and practice. OBJECTIVE: The goal of the study was to propose a novel approach for extracting and normalizing temporal expressions from Chinese narrative clinical text. METHODS: TNorm, a rule-based and pattern learning-based approach, has been developed for automatic temporal expression extraction and normalization from unstructured Chinese clinical text data. TNorm consists of three stages: extraction, classification, and normalization. It applies a set of heuristic rules and automatically generated patterns for temporal expression identification and extraction of clinical texts. Then, it collects the features of extracted temporal expressions for temporal type prediction and classification by using machine learning algorithms. Finally, the features are combined with the rule-based and a pattern learning-based approach to normalize the extracted temporal expressions. RESULTS: The evaluation dataset is a set of narrative clinical texts in Chinese containing 1459 discharge summaries of a domestic Grade A Class 3 hospital. The results show that TNorm, combined with temporal expressions extraction and temporal types prediction, achieves a precision of 0.8491, a recall of 0.8328, and a F1 score of 0.8409 in temporal expressions normalization. CONCLUSIONS: This study illustrates an automatic approach, TNorm, that extracts and normalizes temporal expression from Chinese narrative clinical texts. TNorm was evaluated on the basis of discharge summary data, and results demonstrate its effectiveness on temporal expression normalization. JMIR Publications 2020-07-27 /pmc/articles/PMC7418025/ /pubmed/32716307 http://dx.doi.org/10.2196/17652 Text en ©Xiaoyi Pan, Boyu Chen, Heng Weng, Yongyi Gong, Yingying Qu. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 27.07.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Pan, Xiaoyi
Chen, Boyu
Weng, Heng
Gong, Yongyi
Qu, Yingying
Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach
title Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach
title_full Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach
title_fullStr Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach
title_full_unstemmed Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach
title_short Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach
title_sort temporal expression classification and normalization from chinese narrative clinical texts: pattern learning approach
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418025/
https://www.ncbi.nlm.nih.gov/pubmed/32716307
http://dx.doi.org/10.2196/17652
work_keys_str_mv AT panxiaoyi temporalexpressionclassificationandnormalizationfromchinesenarrativeclinicaltextspatternlearningapproach
AT chenboyu temporalexpressionclassificationandnormalizationfromchinesenarrativeclinicaltextspatternlearningapproach
AT wengheng temporalexpressionclassificationandnormalizationfromchinesenarrativeclinicaltextspatternlearningapproach
AT gongyongyi temporalexpressionclassificationandnormalizationfromchinesenarrativeclinicaltextspatternlearningapproach
AT quyingying temporalexpressionclassificationandnormalizationfromchinesenarrativeclinicaltextspatternlearningapproach