Cargando…

Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine

Evidence-based treatment is the basis of traditional Chinese medicine (TCM), and the accurate differentiation of syndromes is important for treatment in this context. The automatic differentiation of syndromes of unstructured medical records requires two important steps: Chinese word segmentation an...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Chenyuan, Zhang, Shuoyan, Gu, Tianyu, Yan, Zhuangzhi, Jiang, Jiehui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9103751/
https://www.ncbi.nlm.nih.gov/pubmed/35564995
http://dx.doi.org/10.3390/ijerph19095601
_version_ 1784707627287052288
author Hu, Chenyuan
Zhang, Shuoyan
Gu, Tianyu
Yan, Zhuangzhi
Jiang, Jiehui
author_facet Hu, Chenyuan
Zhang, Shuoyan
Gu, Tianyu
Yan, Zhuangzhi
Jiang, Jiehui
author_sort Hu, Chenyuan
collection PubMed
description Evidence-based treatment is the basis of traditional Chinese medicine (TCM), and the accurate differentiation of syndromes is important for treatment in this context. The automatic differentiation of syndromes of unstructured medical records requires two important steps: Chinese word segmentation and text classification. Due to the ambiguity of the Chinese language and the peculiarities of syndrome differentiation, these tasks pose a daunting challenge. We use text classification to model syndrome differentiation for TCM, and use multi-task learning (MTL) and deep learning to accomplish the two challenging tasks of Chinese word segmentation and syndrome differentiation. Two classic deep neural networks—bidirectional long short-term memory (Bi-LSTM) and text-based convolutional neural networks (TextCNN)—are fused into MTL to simultaneously carry out these two tasks. We used our proposed method to conduct a large number of comparative experiments. The experimental comparisons showed that it was superior to other methods on both tasks. Our model yielded values of accuracy, specificity, and sensitivity of 0.93, 0.94, and 0.90, and 0.80, 0.82, and 0.78 on the Chinese word segmentation task and the syndrome differentiation task, respectively. Moreover, statistical analyses showed that the accuracies of the non-joint and joint models were both within the 95% confidence interval, with [Formula: see text] < 0.05. The experimental comparison showed that our method is superior to prevalent methods on both tasks. The work here can help modernize TCM through intelligent differentiation.
format Online
Article
Text
id pubmed-9103751
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91037512022-05-14 Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine Hu, Chenyuan Zhang, Shuoyan Gu, Tianyu Yan, Zhuangzhi Jiang, Jiehui Int J Environ Res Public Health Article Evidence-based treatment is the basis of traditional Chinese medicine (TCM), and the accurate differentiation of syndromes is important for treatment in this context. The automatic differentiation of syndromes of unstructured medical records requires two important steps: Chinese word segmentation and text classification. Due to the ambiguity of the Chinese language and the peculiarities of syndrome differentiation, these tasks pose a daunting challenge. We use text classification to model syndrome differentiation for TCM, and use multi-task learning (MTL) and deep learning to accomplish the two challenging tasks of Chinese word segmentation and syndrome differentiation. Two classic deep neural networks—bidirectional long short-term memory (Bi-LSTM) and text-based convolutional neural networks (TextCNN)—are fused into MTL to simultaneously carry out these two tasks. We used our proposed method to conduct a large number of comparative experiments. The experimental comparisons showed that it was superior to other methods on both tasks. Our model yielded values of accuracy, specificity, and sensitivity of 0.93, 0.94, and 0.90, and 0.80, 0.82, and 0.78 on the Chinese word segmentation task and the syndrome differentiation task, respectively. Moreover, statistical analyses showed that the accuracies of the non-joint and joint models were both within the 95% confidence interval, with [Formula: see text] < 0.05. The experimental comparison showed that our method is superior to prevalent methods on both tasks. The work here can help modernize TCM through intelligent differentiation. MDPI 2022-05-05 /pmc/articles/PMC9103751/ /pubmed/35564995 http://dx.doi.org/10.3390/ijerph19095601 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hu, Chenyuan
Zhang, Shuoyan
Gu, Tianyu
Yan, Zhuangzhi
Jiang, Jiehui
Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine
title Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine
title_full Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine
title_fullStr Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine
title_full_unstemmed Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine
title_short Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine
title_sort multi-task joint learning model for chinese word segmentation and syndrome differentiation in traditional chinese medicine
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9103751/
https://www.ncbi.nlm.nih.gov/pubmed/35564995
http://dx.doi.org/10.3390/ijerph19095601
work_keys_str_mv AT huchenyuan multitaskjointlearningmodelforchinesewordsegmentationandsyndromedifferentiationintraditionalchinesemedicine
AT zhangshuoyan multitaskjointlearningmodelforchinesewordsegmentationandsyndromedifferentiationintraditionalchinesemedicine
AT gutianyu multitaskjointlearningmodelforchinesewordsegmentationandsyndromedifferentiationintraditionalchinesemedicine
AT yanzhuangzhi multitaskjointlearningmodelforchinesewordsegmentationandsyndromedifferentiationintraditionalchinesemedicine
AT jiangjiehui multitaskjointlearningmodelforchinesewordsegmentationandsyndromedifferentiationintraditionalchinesemedicine