Cargando…
An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough
BACKGROUND: Due to the phenotypic similarities among different pediatric respiratory diseases with chronic cough, primary doctors often misdiagnose and the misuse of examinations is prevalent. In the pre-diagnosis stage, the patients' chief complaints and other information in the electronic med...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9360821/ https://www.ncbi.nlm.nih.gov/pubmed/35958012 http://dx.doi.org/10.21037/tp-22-275 |
_version_ | 1784764409870024704 |
---|---|
author | Zhu, Zhu Li, Jing Huang, Jian Li, Zheming Zhang, Hongjian Chen, Siyu Zhong, Qianhui Xie, Yulan Hu, Shasha Wang, Yinshuo Wang, Dejian Yu, Gang |
author_facet | Zhu, Zhu Li, Jing Huang, Jian Li, Zheming Zhang, Hongjian Chen, Siyu Zhong, Qianhui Xie, Yulan Hu, Shasha Wang, Yinshuo Wang, Dejian Yu, Gang |
author_sort | Zhu, Zhu |
collection | PubMed |
description | BACKGROUND: Due to the phenotypic similarities among different pediatric respiratory diseases with chronic cough, primary doctors often misdiagnose and the misuse of examinations is prevalent. In the pre-diagnosis stage, the patients' chief complaints and other information in the electronic medical record (EMR) provide a powerful reference for respiratory experts to make preliminary disease judgment and examination plan. In this paper, we proposed an intelligent prediagnosis system to predict disease diagnosis and recommend examinations based on EMR text. METHODS: We examined the clinical notes of 178,293 children with chronic cough symptoms from retrospective EMR data. The dataset is split into 7:3 for training and testing. From the testing set, we also extract 5% of samples for validation. We proposed a medical-semantic-aware convolution neural network (MSCNN) framework that can accomplish two downstream tasks from the same medical language model through transfer learning. First, a medical language model based on the word2vec algorithm was built to generate embeddings for the text data. Then, text convolutional neural network (TextCNN) was used to build models for disease prediction and examination recommendation. RESULTS: We implemented 5 algorithms for disease prediction. In the disease prediction task, our algorithm outperformed the baseline methods on all metrics, with a top-1 accuracy (AC) of 0.68 and a top-3 AC of 0.923 on the testing set. By adding data enhancement, the top-3 AC reached 0.926. In the examination recommendation task, the overall AC on the testing set was 0.93 and the macro average (MA) F1-score was 0.88. The average area under the curve (AUC) on the training set was 0.97 while on the testing set it was 0.86. CONCLUSIONS: We constructed an intelligent prediagnosis system with an MSCNN framework that can predict diseases and make examination recommendations based on EMR data. Our approach achieved good results on a retrospective clinical dataset and thus has great potential for the application of automated diagnosis assist in clinical practice during pre-diagnosis stage, which will provide help for primary level doctors or doctors in basic-level hospitals. Due to the generality of the proposed framework, it can be straight forwardly extended to prediagnosis for other diseases. |
format | Online Article Text |
id | pubmed-9360821 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-93608212022-08-10 An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough Zhu, Zhu Li, Jing Huang, Jian Li, Zheming Zhang, Hongjian Chen, Siyu Zhong, Qianhui Xie, Yulan Hu, Shasha Wang, Yinshuo Wang, Dejian Yu, Gang Transl Pediatr Original Article BACKGROUND: Due to the phenotypic similarities among different pediatric respiratory diseases with chronic cough, primary doctors often misdiagnose and the misuse of examinations is prevalent. In the pre-diagnosis stage, the patients' chief complaints and other information in the electronic medical record (EMR) provide a powerful reference for respiratory experts to make preliminary disease judgment and examination plan. In this paper, we proposed an intelligent prediagnosis system to predict disease diagnosis and recommend examinations based on EMR text. METHODS: We examined the clinical notes of 178,293 children with chronic cough symptoms from retrospective EMR data. The dataset is split into 7:3 for training and testing. From the testing set, we also extract 5% of samples for validation. We proposed a medical-semantic-aware convolution neural network (MSCNN) framework that can accomplish two downstream tasks from the same medical language model through transfer learning. First, a medical language model based on the word2vec algorithm was built to generate embeddings for the text data. Then, text convolutional neural network (TextCNN) was used to build models for disease prediction and examination recommendation. RESULTS: We implemented 5 algorithms for disease prediction. In the disease prediction task, our algorithm outperformed the baseline methods on all metrics, with a top-1 accuracy (AC) of 0.68 and a top-3 AC of 0.923 on the testing set. By adding data enhancement, the top-3 AC reached 0.926. In the examination recommendation task, the overall AC on the testing set was 0.93 and the macro average (MA) F1-score was 0.88. The average area under the curve (AUC) on the training set was 0.97 while on the testing set it was 0.86. CONCLUSIONS: We constructed an intelligent prediagnosis system with an MSCNN framework that can predict diseases and make examination recommendations based on EMR data. Our approach achieved good results on a retrospective clinical dataset and thus has great potential for the application of automated diagnosis assist in clinical practice during pre-diagnosis stage, which will provide help for primary level doctors or doctors in basic-level hospitals. Due to the generality of the proposed framework, it can be straight forwardly extended to prediagnosis for other diseases. AME Publishing Company 2022-07 /pmc/articles/PMC9360821/ /pubmed/35958012 http://dx.doi.org/10.21037/tp-22-275 Text en 2022 Translational Pediatrics. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Zhu, Zhu Li, Jing Huang, Jian Li, Zheming Zhang, Hongjian Chen, Siyu Zhong, Qianhui Xie, Yulan Hu, Shasha Wang, Yinshuo Wang, Dejian Yu, Gang An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough |
title | An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough |
title_full | An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough |
title_fullStr | An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough |
title_full_unstemmed | An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough |
title_short | An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough |
title_sort | intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (mscnn) for pediatric chronic cough |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9360821/ https://www.ncbi.nlm.nih.gov/pubmed/35958012 http://dx.doi.org/10.21037/tp-22-275 |
work_keys_str_mv | AT zhuzhu anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT lijing anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT huangjian anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT lizheming anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT zhanghongjian anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT chensiyu anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT zhongqianhui anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT xieyulan anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT hushasha anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT wangyinshuo anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT wangdejian anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT yugang anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT zhuzhu intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT lijing intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT huangjian intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT lizheming intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT zhanghongjian intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT chensiyu intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT zhongqianhui intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT xieyulan intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT hushasha intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT wangyinshuo intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT wangdejian intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough AT yugang intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough |