Cargando…

Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports

BACKGROUND: Cardiogenic stroke has increasing morbidity in China and brought economic burden to patient families. In cardiogenic stroke diagnosis, echocardiograph examination is one of the most important examinations. Sonographers will investigate patients’ heart via echocardiograph, and describe th...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Lu, Xu, Xiaowei, Ding, Lingling, Li, Zixiao, Li, Jiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7346320/
https://www.ncbi.nlm.nih.gov/pubmed/32646410
http://dx.doi.org/10.1186/s12911-020-1106-3
_version_ 1783556383344427008
author Qin, Lu
Xu, Xiaowei
Ding, Lingling
Li, Zixiao
Li, Jiao
author_facet Qin, Lu
Xu, Xiaowei
Ding, Lingling
Li, Zixiao
Li, Jiao
author_sort Qin, Lu
collection PubMed
description BACKGROUND: Cardiogenic stroke has increasing morbidity in China and brought economic burden to patient families. In cardiogenic stroke diagnosis, echocardiograph examination is one of the most important examinations. Sonographers will investigate patients’ heart via echocardiograph, and describe them in the echocardiograph reports. In this study, we developed a machine learning model to automatically identify diagnosis evidences of cardiogenic stroke providing to neurologist for clinical decision making. METHODS: We collected 4188 Chinese echocardiograph reports of 4018 patients, with average length 177 Chinese characters in free-text style. Collaborating with neurologists and sonographers, we summarized 149 phrases on diagnosis evidence of cardiogenic stroke such as “二尖瓣重度狭窄” (severe mitral stenosis), “主动脉瓣退行性变” (aortic valve degeneration) and so on. Furthermore, we developed an annotated corpus via mapping 149 phrases to the 4188 reports. We selected 11 most frequent diagnosis evidence types such as “二尖瓣狭窄” (mitral stenosis) for further identifying. The generated corpus is divided into training set and testing set in the ratio of 8:2, which is used to train and validate a machine learning model to identify the evidence of cardiogenic stroke using BiLSTM-CRF algorithm. RESULTS: Our machine learning method achieved the average performance on the diagnosis evidence identification is 98.03, 90.17 and 93.94% respectively. In addition, our method is capable to identify the novel diagnosis evidence of cardiogenic stroke description such as “二尖瓣中-重度狭窄” (mitral stenosis), “主动脉瓣退行性病变” (aortic valve calcification) et al. CONCLUSIONS: In this study, we analyze the structure of the echocardiograph reports and summarized 149 phrases on diagnosis evidence of cardiogenic stroke. We use the phrases to generate an annotated corpus automatically, which greatly reduces the cost of manual annotation. The model trained based on the corpus also has a good performance on the testing set. The method of automatically identifying diagnosis evidence of cardiogenic stroke proposed in this study will be further refined in the practice.
format Online
Article
Text
id pubmed-7346320
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73463202020-07-14 Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports Qin, Lu Xu, Xiaowei Ding, Lingling Li, Zixiao Li, Jiao BMC Med Inform Decis Mak Research BACKGROUND: Cardiogenic stroke has increasing morbidity in China and brought economic burden to patient families. In cardiogenic stroke diagnosis, echocardiograph examination is one of the most important examinations. Sonographers will investigate patients’ heart via echocardiograph, and describe them in the echocardiograph reports. In this study, we developed a machine learning model to automatically identify diagnosis evidences of cardiogenic stroke providing to neurologist for clinical decision making. METHODS: We collected 4188 Chinese echocardiograph reports of 4018 patients, with average length 177 Chinese characters in free-text style. Collaborating with neurologists and sonographers, we summarized 149 phrases on diagnosis evidence of cardiogenic stroke such as “二尖瓣重度狭窄” (severe mitral stenosis), “主动脉瓣退行性变” (aortic valve degeneration) and so on. Furthermore, we developed an annotated corpus via mapping 149 phrases to the 4188 reports. We selected 11 most frequent diagnosis evidence types such as “二尖瓣狭窄” (mitral stenosis) for further identifying. The generated corpus is divided into training set and testing set in the ratio of 8:2, which is used to train and validate a machine learning model to identify the evidence of cardiogenic stroke using BiLSTM-CRF algorithm. RESULTS: Our machine learning method achieved the average performance on the diagnosis evidence identification is 98.03, 90.17 and 93.94% respectively. In addition, our method is capable to identify the novel diagnosis evidence of cardiogenic stroke description such as “二尖瓣中-重度狭窄” (mitral stenosis), “主动脉瓣退行性病变” (aortic valve calcification) et al. CONCLUSIONS: In this study, we analyze the structure of the echocardiograph reports and summarized 149 phrases on diagnosis evidence of cardiogenic stroke. We use the phrases to generate an annotated corpus automatically, which greatly reduces the cost of manual annotation. The model trained based on the corpus also has a good performance on the testing set. The method of automatically identifying diagnosis evidence of cardiogenic stroke proposed in this study will be further refined in the practice. BioMed Central 2020-07-09 /pmc/articles/PMC7346320/ /pubmed/32646410 http://dx.doi.org/10.1186/s12911-020-1106-3 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Qin, Lu
Xu, Xiaowei
Ding, Lingling
Li, Zixiao
Li, Jiao
Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports
title Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports
title_full Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports
title_fullStr Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports
title_full_unstemmed Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports
title_short Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports
title_sort identifying diagnosis evidence of cardiogenic stroke from chinese echocardiograph reports
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7346320/
https://www.ncbi.nlm.nih.gov/pubmed/32646410
http://dx.doi.org/10.1186/s12911-020-1106-3
work_keys_str_mv AT qinlu identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports
AT xuxiaowei identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports
AT dinglingling identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports
AT lizixiao identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports
AT lijiao identifyingdiagnosisevidenceofcardiogenicstrokefromchineseechocardiographreports