Cargando…

An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough

BACKGROUND: Due to the phenotypic similarities among different pediatric respiratory diseases with chronic cough, primary doctors often misdiagnose and the misuse of examinations is prevalent. In the pre-diagnosis stage, the patients' chief complaints and other information in the electronic med...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Zhu, Li, Jing, Huang, Jian, Li, Zheming, Zhang, Hongjian, Chen, Siyu, Zhong, Qianhui, Xie, Yulan, Hu, Shasha, Wang, Yinshuo, Wang, Dejian, Yu, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9360821/
https://www.ncbi.nlm.nih.gov/pubmed/35958012
http://dx.doi.org/10.21037/tp-22-275
_version_ 1784764409870024704
author Zhu, Zhu
Li, Jing
Huang, Jian
Li, Zheming
Zhang, Hongjian
Chen, Siyu
Zhong, Qianhui
Xie, Yulan
Hu, Shasha
Wang, Yinshuo
Wang, Dejian
Yu, Gang
author_facet Zhu, Zhu
Li, Jing
Huang, Jian
Li, Zheming
Zhang, Hongjian
Chen, Siyu
Zhong, Qianhui
Xie, Yulan
Hu, Shasha
Wang, Yinshuo
Wang, Dejian
Yu, Gang
author_sort Zhu, Zhu
collection PubMed
description BACKGROUND: Due to the phenotypic similarities among different pediatric respiratory diseases with chronic cough, primary doctors often misdiagnose and the misuse of examinations is prevalent. In the pre-diagnosis stage, the patients' chief complaints and other information in the electronic medical record (EMR) provide a powerful reference for respiratory experts to make preliminary disease judgment and examination plan. In this paper, we proposed an intelligent prediagnosis system to predict disease diagnosis and recommend examinations based on EMR text. METHODS: We examined the clinical notes of 178,293 children with chronic cough symptoms from retrospective EMR data. The dataset is split into 7:3 for training and testing. From the testing set, we also extract 5% of samples for validation. We proposed a medical-semantic-aware convolution neural network (MSCNN) framework that can accomplish two downstream tasks from the same medical language model through transfer learning. First, a medical language model based on the word2vec algorithm was built to generate embeddings for the text data. Then, text convolutional neural network (TextCNN) was used to build models for disease prediction and examination recommendation. RESULTS: We implemented 5 algorithms for disease prediction. In the disease prediction task, our algorithm outperformed the baseline methods on all metrics, with a top-1 accuracy (AC) of 0.68 and a top-3 AC of 0.923 on the testing set. By adding data enhancement, the top-3 AC reached 0.926. In the examination recommendation task, the overall AC on the testing set was 0.93 and the macro average (MA) F1-score was 0.88. The average area under the curve (AUC) on the training set was 0.97 while on the testing set it was 0.86. CONCLUSIONS: We constructed an intelligent prediagnosis system with an MSCNN framework that can predict diseases and make examination recommendations based on EMR data. Our approach achieved good results on a retrospective clinical dataset and thus has great potential for the application of automated diagnosis assist in clinical practice during pre-diagnosis stage, which will provide help for primary level doctors or doctors in basic-level hospitals. Due to the generality of the proposed framework, it can be straight forwardly extended to prediagnosis for other diseases.
format Online
Article
Text
id pubmed-9360821
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-93608212022-08-10 An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough Zhu, Zhu Li, Jing Huang, Jian Li, Zheming Zhang, Hongjian Chen, Siyu Zhong, Qianhui Xie, Yulan Hu, Shasha Wang, Yinshuo Wang, Dejian Yu, Gang Transl Pediatr Original Article BACKGROUND: Due to the phenotypic similarities among different pediatric respiratory diseases with chronic cough, primary doctors often misdiagnose and the misuse of examinations is prevalent. In the pre-diagnosis stage, the patients' chief complaints and other information in the electronic medical record (EMR) provide a powerful reference for respiratory experts to make preliminary disease judgment and examination plan. In this paper, we proposed an intelligent prediagnosis system to predict disease diagnosis and recommend examinations based on EMR text. METHODS: We examined the clinical notes of 178,293 children with chronic cough symptoms from retrospective EMR data. The dataset is split into 7:3 for training and testing. From the testing set, we also extract 5% of samples for validation. We proposed a medical-semantic-aware convolution neural network (MSCNN) framework that can accomplish two downstream tasks from the same medical language model through transfer learning. First, a medical language model based on the word2vec algorithm was built to generate embeddings for the text data. Then, text convolutional neural network (TextCNN) was used to build models for disease prediction and examination recommendation. RESULTS: We implemented 5 algorithms for disease prediction. In the disease prediction task, our algorithm outperformed the baseline methods on all metrics, with a top-1 accuracy (AC) of 0.68 and a top-3 AC of 0.923 on the testing set. By adding data enhancement, the top-3 AC reached 0.926. In the examination recommendation task, the overall AC on the testing set was 0.93 and the macro average (MA) F1-score was 0.88. The average area under the curve (AUC) on the training set was 0.97 while on the testing set it was 0.86. CONCLUSIONS: We constructed an intelligent prediagnosis system with an MSCNN framework that can predict diseases and make examination recommendations based on EMR data. Our approach achieved good results on a retrospective clinical dataset and thus has great potential for the application of automated diagnosis assist in clinical practice during pre-diagnosis stage, which will provide help for primary level doctors or doctors in basic-level hospitals. Due to the generality of the proposed framework, it can be straight forwardly extended to prediagnosis for other diseases. AME Publishing Company 2022-07 /pmc/articles/PMC9360821/ /pubmed/35958012 http://dx.doi.org/10.21037/tp-22-275 Text en 2022 Translational Pediatrics. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Original Article
Zhu, Zhu
Li, Jing
Huang, Jian
Li, Zheming
Zhang, Hongjian
Chen, Siyu
Zhong, Qianhui
Xie, Yulan
Hu, Shasha
Wang, Yinshuo
Wang, Dejian
Yu, Gang
An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough
title An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough
title_full An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough
title_fullStr An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough
title_full_unstemmed An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough
title_short An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough
title_sort intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (mscnn) for pediatric chronic cough
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9360821/
https://www.ncbi.nlm.nih.gov/pubmed/35958012
http://dx.doi.org/10.21037/tp-22-275
work_keys_str_mv AT zhuzhu anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT lijing anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT huangjian anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT lizheming anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT zhanghongjian anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT chensiyu anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT zhongqianhui anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT xieyulan anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT hushasha anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT wangyinshuo anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT wangdejian anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT yugang anintelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT zhuzhu intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT lijing intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT huangjian intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT lizheming intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT zhanghongjian intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT chensiyu intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT zhongqianhui intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT xieyulan intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT hushasha intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT wangyinshuo intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT wangdejian intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough
AT yugang intelligentprediagnosissystemfordiseasepredictionandexaminationrecommendationbasedonelectronicmedicalrecordandamedicalsemanticawareconvolutionneuralnetworkmscnnforpediatricchroniccough