Cargando…

End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation

BACKGROUND: Traditional Chinese medicine (TCM) has been shown to be an efficient mode to manage advanced lung cancer, and accurate syndrome differentiation is crucial to treatment. Documented evidence of TCM treatment cases and the progress of artificial intelligence technology are enabling the deve...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Ziqing, He, Haiyang, Yan, Shixing, Wang, Yong, Yang, Tao, Li, Guo-Zheng
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2020
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7327597/ https://www.ncbi.nlm.nih.gov/pubmed/32543445 http://dx.doi.org/10.2196/17821

_version_	1783552577471774720
author	Liu, Ziqing He, Haiyang Yan, Shixing Wang, Yong Yang, Tao Li, Guo-Zheng
author_facet	Liu, Ziqing He, Haiyang Yan, Shixing Wang, Yong Yang, Tao Li, Guo-Zheng
author_sort	Liu, Ziqing
collection	PubMed
description	BACKGROUND: Traditional Chinese medicine (TCM) has been shown to be an efficient mode to manage advanced lung cancer, and accurate syndrome differentiation is crucial to treatment. Documented evidence of TCM treatment cases and the progress of artificial intelligence technology are enabling the development of intelligent TCM syndrome differentiation models. This is expected to expand the benefits of TCM to lung cancer patients. OBJECTIVE: The objective of this work was to establish end-to-end TCM diagnostic models to imitate lung cancer syndrome differentiation. The proposed models used unstructured medical records as inputs to capitalize on data collected for practical TCM treatment cases by lung cancer experts. The resulting models were expected to be more efficient than approaches that leverage structured TCM datasets. METHODS: We approached lung cancer TCM syndrome differentiation as a multilabel text classification problem. First, entity representation was conducted with Bidirectional Encoder Representations from Transformers and conditional random fields models. Then, five deep learning–based text classification models were applied to the construction of a medical record multilabel classifier, during which two data augmentation strategies were adopted to address overfitting issues. Finally, a fusion model approach was used to elevate the performance of the models. RESULTS: The F1 score of the recurrent convolutional neural network (RCNN) model with augmentation was 0.8650, a 2.41% improvement over the unaugmented model. The Hamming loss for RCNN with augmentation was 0.0987, which is 1.8% lower than that of the same model without augmentation. Among the models, the text-hierarchical attention network (Text-HAN) model achieved the highest F1 scores of 0.8676 and 0.8751. The mean average precision for the word encoding–based RCNN was 10% higher than that of the character encoding–based representation. A fusion model of the text-convolutional neural network, text-recurrent neural network, and Text-HAN models achieved an F1 score of 0.8884, which showed the best performance among the models. CONCLUSIONS: Medical records could be used more productively by constructing end-to-end models to facilitate TCM diagnosis. With the aid of entity-level representation, data augmentation, and model fusion, deep learning–based multilabel classification approaches can better imitate TCM syndrome differentiation in complex cases such as advanced lung cancer.
format	Online Article Text
id	pubmed-7327597
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-73275972020-07-06 End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation Liu, Ziqing He, Haiyang Yan, Shixing Wang, Yong Yang, Tao Li, Guo-Zheng JMIR Med Inform Original Paper BACKGROUND: Traditional Chinese medicine (TCM) has been shown to be an efficient mode to manage advanced lung cancer, and accurate syndrome differentiation is crucial to treatment. Documented evidence of TCM treatment cases and the progress of artificial intelligence technology are enabling the development of intelligent TCM syndrome differentiation models. This is expected to expand the benefits of TCM to lung cancer patients. OBJECTIVE: The objective of this work was to establish end-to-end TCM diagnostic models to imitate lung cancer syndrome differentiation. The proposed models used unstructured medical records as inputs to capitalize on data collected for practical TCM treatment cases by lung cancer experts. The resulting models were expected to be more efficient than approaches that leverage structured TCM datasets. METHODS: We approached lung cancer TCM syndrome differentiation as a multilabel text classification problem. First, entity representation was conducted with Bidirectional Encoder Representations from Transformers and conditional random fields models. Then, five deep learning–based text classification models were applied to the construction of a medical record multilabel classifier, during which two data augmentation strategies were adopted to address overfitting issues. Finally, a fusion model approach was used to elevate the performance of the models. RESULTS: The F1 score of the recurrent convolutional neural network (RCNN) model with augmentation was 0.8650, a 2.41% improvement over the unaugmented model. The Hamming loss for RCNN with augmentation was 0.0987, which is 1.8% lower than that of the same model without augmentation. Among the models, the text-hierarchical attention network (Text-HAN) model achieved the highest F1 scores of 0.8676 and 0.8751. The mean average precision for the word encoding–based RCNN was 10% higher than that of the character encoding–based representation. A fusion model of the text-convolutional neural network, text-recurrent neural network, and Text-HAN models achieved an F1 score of 0.8884, which showed the best performance among the models. CONCLUSIONS: Medical records could be used more productively by constructing end-to-end models to facilitate TCM diagnosis. With the aid of entity-level representation, data augmentation, and model fusion, deep learning–based multilabel classification approaches can better imitate TCM syndrome differentiation in complex cases such as advanced lung cancer. JMIR Publications 2020-06-16 /pmc/articles/PMC7327597/ /pubmed/32543445 http://dx.doi.org/10.2196/17821 Text en ©Ziqing Liu, Haiyang He, Shixing Yan, Yong Wang, Tao Yang, Guo-Zheng Li. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 16.06.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Liu, Ziqing He, Haiyang Yan, Shixing Wang, Yong Yang, Tao Li, Guo-Zheng End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation
title	End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation
title_full	End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation
title_fullStr	End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation
title_full_unstemmed	End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation
title_short	End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation
title_sort	end-to-end models to imitate traditional chinese medicine syndrome differentiation in lung cancer diagnosis: model development and validation
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7327597/ https://www.ncbi.nlm.nih.gov/pubmed/32543445 http://dx.doi.org/10.2196/17821
work_keys_str_mv	AT liuziqing endtoendmodelstoimitatetraditionalchinesemedicinesyndromedifferentiationinlungcancerdiagnosismodeldevelopmentandvalidation AT hehaiyang endtoendmodelstoimitatetraditionalchinesemedicinesyndromedifferentiationinlungcancerdiagnosismodeldevelopmentandvalidation AT yanshixing endtoendmodelstoimitatetraditionalchinesemedicinesyndromedifferentiationinlungcancerdiagnosismodeldevelopmentandvalidation AT wangyong endtoendmodelstoimitatetraditionalchinesemedicinesyndromedifferentiationinlungcancerdiagnosismodeldevelopmentandvalidation AT yangtao endtoendmodelstoimitatetraditionalchinesemedicinesyndromedifferentiationinlungcancerdiagnosismodeldevelopmentandvalidation AT liguozheng endtoendmodelstoimitatetraditionalchinesemedicinesyndromedifferentiationinlungcancerdiagnosismodeldevelopmentandvalidation

End-to-End Models to Imitate Traditional Chinese Medicine Syndrome Differentiation in Lung Cancer Diagnosis: Model Development and Validation

Ejemplares similares