Cargando…
Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks
BACKGROUND: Benefiting from big data, powerful computation and new algorithmic techniques, we have been witnessing the renaissance of deep learning, particularly the combination of natural language processing (NLP) and deep neural networks. The advent of electronic medical records (EMRs) has not onl...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6359854/ https://www.ncbi.nlm.nih.gov/pubmed/30709336 http://dx.doi.org/10.1186/s12859-019-2617-8 |
_version_ | 1783392371632766976 |
---|---|
author | Li, Xiaozheng Wang, Huazhen He, Huixin Du, Jixiang Chen, Jian Wu, Jinzhun |
author_facet | Li, Xiaozheng Wang, Huazhen He, Huixin Du, Jixiang Chen, Jian Wu, Jinzhun |
author_sort | Li, Xiaozheng |
collection | PubMed |
description | BACKGROUND: Benefiting from big data, powerful computation and new algorithmic techniques, we have been witnessing the renaissance of deep learning, particularly the combination of natural language processing (NLP) and deep neural networks. The advent of electronic medical records (EMRs) has not only changed the format of medical records but also helped users to obtain information faster. However, there are many challenges regarding researching directly using Chinese EMRs, such as low quality, huge quantity, imbalance, semi-structure and non-structure, particularly the high density of the Chinese language compared with English. Therefore, effective word segmentation, word representation and model architecture are the core technologies in the literature on Chinese EMRs. RESULTS: In this paper, we propose a deep learning framework to study intelligent diagnosis using Chinese EMR data, which incorporates a convolutional neural network (CNN) into an EMR classification application. The novelty of this paper is reflected in the following: (1) We construct a pediatric medical dictionary based on Chinese EMRs. (2) Word2vec adopted in word embedding is used to achieve the semantic description of the content of Chinese EMRs. (3) A fine-tuning CNN model is constructed to feed the pediatric diagnosis with Chinese EMR data. Our results on real-world pediatric Chinese EMRs demonstrate that the average accuracy and F1-score of the CNN models are up to 81%, which indicates the effectiveness of the CNN model for the classification of EMRs. Particularly, a fine-tuning one-layer CNN performs best among all CNNs, recurrent neural network (RNN) (long short-term memory, gated recurrent unit) and CNN-RNN models, and the average accuracy and F1-score are both up to 83%. CONCLUSION: The CNN framework that includes word segmentation, word embedding and model training can serve as an intelligent auxiliary diagnosis tool for pediatricians. Particularly, a fine-tuning one-layer CNN performs well, which indicates that word order does not appear to have a useful effect on our Chinese EMRs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2617-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6359854 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63598542019-02-07 Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks Li, Xiaozheng Wang, Huazhen He, Huixin Du, Jixiang Chen, Jian Wu, Jinzhun BMC Bioinformatics Research Article BACKGROUND: Benefiting from big data, powerful computation and new algorithmic techniques, we have been witnessing the renaissance of deep learning, particularly the combination of natural language processing (NLP) and deep neural networks. The advent of electronic medical records (EMRs) has not only changed the format of medical records but also helped users to obtain information faster. However, there are many challenges regarding researching directly using Chinese EMRs, such as low quality, huge quantity, imbalance, semi-structure and non-structure, particularly the high density of the Chinese language compared with English. Therefore, effective word segmentation, word representation and model architecture are the core technologies in the literature on Chinese EMRs. RESULTS: In this paper, we propose a deep learning framework to study intelligent diagnosis using Chinese EMR data, which incorporates a convolutional neural network (CNN) into an EMR classification application. The novelty of this paper is reflected in the following: (1) We construct a pediatric medical dictionary based on Chinese EMRs. (2) Word2vec adopted in word embedding is used to achieve the semantic description of the content of Chinese EMRs. (3) A fine-tuning CNN model is constructed to feed the pediatric diagnosis with Chinese EMR data. Our results on real-world pediatric Chinese EMRs demonstrate that the average accuracy and F1-score of the CNN models are up to 81%, which indicates the effectiveness of the CNN model for the classification of EMRs. Particularly, a fine-tuning one-layer CNN performs best among all CNNs, recurrent neural network (RNN) (long short-term memory, gated recurrent unit) and CNN-RNN models, and the average accuracy and F1-score are both up to 83%. CONCLUSION: The CNN framework that includes word segmentation, word embedding and model training can serve as an intelligent auxiliary diagnosis tool for pediatricians. Particularly, a fine-tuning one-layer CNN performs well, which indicates that word order does not appear to have a useful effect on our Chinese EMRs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2617-8) contains supplementary material, which is available to authorized users. BioMed Central 2019-02-01 /pmc/articles/PMC6359854/ /pubmed/30709336 http://dx.doi.org/10.1186/s12859-019-2617-8 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Li, Xiaozheng Wang, Huazhen He, Huixin Du, Jixiang Chen, Jian Wu, Jinzhun Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks |
title | Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks |
title_full | Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks |
title_fullStr | Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks |
title_full_unstemmed | Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks |
title_short | Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks |
title_sort | intelligent diagnosis with chinese electronic medical records based on convolutional neural networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6359854/ https://www.ncbi.nlm.nih.gov/pubmed/30709336 http://dx.doi.org/10.1186/s12859-019-2617-8 |
work_keys_str_mv | AT lixiaozheng intelligentdiagnosiswithchineseelectronicmedicalrecordsbasedonconvolutionalneuralnetworks AT wanghuazhen intelligentdiagnosiswithchineseelectronicmedicalrecordsbasedonconvolutionalneuralnetworks AT hehuixin intelligentdiagnosiswithchineseelectronicmedicalrecordsbasedonconvolutionalneuralnetworks AT dujixiang intelligentdiagnosiswithchineseelectronicmedicalrecordsbasedonconvolutionalneuralnetworks AT chenjian intelligentdiagnosiswithchineseelectronicmedicalrecordsbasedonconvolutionalneuralnetworks AT wujinzhun intelligentdiagnosiswithchineseelectronicmedicalrecordsbasedonconvolutionalneuralnetworks |