Cargando…
Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports
BACKGROUND: Cardiogenic stroke has increasing morbidity in China and brought economic burden to patient families. In cardiogenic stroke diagnosis, echocardiograph examination is one of the most important examinations. Sonographers will investigate patients’ heart via echocardiograph, and describe th...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7346320/ https://www.ncbi.nlm.nih.gov/pubmed/32646410 http://dx.doi.org/10.1186/s12911-020-1106-3 |
_version_ | 1783556383344427008 |
---|---|
author | Qin, Lu Xu, Xiaowei Ding, Lingling Li, Zixiao Li, Jiao |
author_facet | Qin, Lu Xu, Xiaowei Ding, Lingling Li, Zixiao Li, Jiao |
author_sort | Qin, Lu |
collection | PubMed |
description | BACKGROUND: Cardiogenic stroke has increasing morbidity in China and brought economic burden to patient families. In cardiogenic stroke diagnosis, echocardiograph examination is one of the most important examinations. Sonographers will investigate patients’ heart via echocardiograph, and describe them in the echocardiograph reports. In this study, we developed a machine learning model to automatically identify diagnosis evidences of cardiogenic stroke providing to neurologist for clinical decision making. METHODS: We collected 4188 Chinese echocardiograph reports of 4018 patients, with average length 177 Chinese characters in free-text style. Collaborating with neurologists and sonographers, we summarized 149 phrases on diagnosis evidence of cardiogenic stroke such as “二尖瓣重度狭窄” (severe mitral stenosis), “主动脉瓣退行性变” (aortic valve degeneration) and so on. Furthermore, we developed an annotated corpus via mapping 149 phrases to the 4188 reports. We selected 11 most frequent diagnosis evidence types such as “二尖瓣狭窄” (mitral stenosis) for further identifying. The generated corpus is divided into training set and testing set in the ratio of 8:2, which is used to train and validate a machine learning model to identify the evidence of cardiogenic stroke using BiLSTM-CRF algorithm. RESULTS: Our machine learning method achieved the average performance on the diagnosis evidence identification is 98.03, 90.17 and 93.94% respectively. In addition, our method is capable to identify the novel diagnosis evidence of cardiogenic stroke description such as “二尖瓣中-重度狭窄” (mitral stenosis), “主动脉瓣退行性病变” (aortic valve calcification) et al. CONCLUSIONS: In this study, we analyze the structure of the echocardiograph reports and summarized 149 phrases on diagnosis evidence of cardiogenic stroke. We use the phrases to generate an annotated corpus automatically, which greatly reduces the cost of manual annotation. The model trained based on the corpus also has a good performance on the testing set. The method of automatically identifying diagnosis evidence of cardiogenic stroke proposed in this study will be further refined in the practice. |
format | Online Article Text |
id | pubmed-7346320 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-73463202020-07-14 Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports Qin, Lu Xu, Xiaowei Ding, Lingling Li, Zixiao Li, Jiao BMC Med Inform Decis Mak Research BACKGROUND: Cardiogenic stroke has increasing morbidity in China and brought economic burden to patient families. In cardiogenic stroke diagnosis, echocardiograph examination is one of the most important examinations. Sonographers will investigate patients’ heart via echocardiograph, and describe them in the echocardiograph reports. In this study, we developed a machine learning model to automatically identify diagnosis evidences of cardiogenic stroke providing to neurologist for clinical decision making. METHODS: We collected 4188 Chinese echocardiograph reports of 4018 patients, with average length 177 Chinese characters in free-text style. Collaborating with neurologists and sonographers, we summarized 149 phrases on diagnosis evidence of cardiogenic stroke such as “二尖瓣重度狭窄” (severe mitral stenosis), “主动脉瓣退行性变” (aortic valve degeneration) and so on. Furthermore, we developed an annotated corpus via mapping 149 phrases to the 4188 reports. We selected 11 most frequent diagnosis evidence types such as “二尖瓣狭窄” (mitral stenosis) for further identifying. The generated corpus is divided into training set and testing set in the ratio of 8:2, which is used to train and validate a machine learning model to identify the evidence of cardiogenic stroke using BiLSTM-CRF algorithm. RESULTS: Our machine learning method achieved the average performance on the diagnosis evidence identification is 98.03, 90.17 and 93.94% respectively. In addition, our method is capable to identify the novel diagnosis evidence of cardiogenic stroke description such as “二尖瓣中-重度狭窄” (mitral stenosis), “主动脉瓣退行性病变” (aortic valve calcification) et al. CONCLUSIONS: In this study, we analyze the structure of the echocardiograph reports and summarized 149 phrases on diagnosis evidence of cardiogenic stroke. We use the phrases to generate an annotated corpus automatically, which greatly reduces the cost of manual annotation. The model trained based on the corpus also has a good performance on the testing set. The method of automatically identifying diagnosis evidence of cardiogenic stroke proposed in this study will be further refined in the practice. BioMed Central 2020-07-09 /pmc/articles/PMC7346320/ /pubmed/32646410 http://dx.doi.org/10.1186/s12911-020-1106-3 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Qin, Lu Xu, Xiaowei Ding, Lingling Li, Zixiao Li, Jiao Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports |
title | Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports |
title_full | Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports |
title_fullStr | Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports |
title_full_unstemmed | Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports |
title_short | Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports |
title_sort | identifying diagnosis evidence of cardiogenic stroke from chinese echocardiograph reports |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7346320/ https://www.ncbi.nlm.nih.gov/pubmed/32646410 http://dx.doi.org/10.1186/s12911-020-1106-3 |
work_keys_str_mv | AT qinlu identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports AT xuxiaowei identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports AT dinglingling identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports AT lizixiao identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports AT lijiao identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports |