Cargando…

Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches

Patients with intracranial artery stenosis show high incidence of stroke. Angiography reports contain rich but underutilized information that can enable the detection of cerebrovascular diseases. This study evaluated various natural language processing (NLP) techniques to accurately identify eleven...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Ching-Heng, Hsu, Kai-Cheng, Liang, Chih-Kuang, Lee, Tsong-Hai, Shih, Ching-Sen, Fann, Yang C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9406429/
https://www.ncbi.nlm.nih.gov/pubmed/36010232
http://dx.doi.org/10.3390/diagnostics12081882
_version_ 1784774118779912192
author Lin, Ching-Heng
Hsu, Kai-Cheng
Liang, Chih-Kuang
Lee, Tsong-Hai
Shih, Ching-Sen
Fann, Yang C.
author_facet Lin, Ching-Heng
Hsu, Kai-Cheng
Liang, Chih-Kuang
Lee, Tsong-Hai
Shih, Ching-Sen
Fann, Yang C.
author_sort Lin, Ching-Heng
collection PubMed
description Patients with intracranial artery stenosis show high incidence of stroke. Angiography reports contain rich but underutilized information that can enable the detection of cerebrovascular diseases. This study evaluated various natural language processing (NLP) techniques to accurately identify eleven intracranial artery stenosis from angiography reports. Three NLP models, including a rule-based model, a recurrent neural network (RNN), and a contextualized language model, XLNet, were developed and evaluated by internal–external cross-validation. In this study, angiography reports from two independent medical centers (9614 for training and internal validation testing and 315 as external validation) were assessed. The internal testing results showed that XLNet had the best performance, with a receiver operating characteristic curve (AUROC) ranging from 0.97 to 0.99 using eleven targeted arteries. The rule-based model attained an AUROC from 0.92 to 0.96, and the RNN long short-term memory model attained an AUROC from 0.95 to 0.97. The study showed the potential application of NLP techniques such as the XLNet model for the routine and automatic screening of patients with high risk of intracranial artery stenosis using angiography reports. However, the NLP models were investigated based on relatively small sample sizes with very different report writing styles and a prevalence of stenosis case distributions, revealing challenges for model generalization.
format Online
Article
Text
id pubmed-9406429
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94064292022-08-26 Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches Lin, Ching-Heng Hsu, Kai-Cheng Liang, Chih-Kuang Lee, Tsong-Hai Shih, Ching-Sen Fann, Yang C. Diagnostics (Basel) Article Patients with intracranial artery stenosis show high incidence of stroke. Angiography reports contain rich but underutilized information that can enable the detection of cerebrovascular diseases. This study evaluated various natural language processing (NLP) techniques to accurately identify eleven intracranial artery stenosis from angiography reports. Three NLP models, including a rule-based model, a recurrent neural network (RNN), and a contextualized language model, XLNet, were developed and evaluated by internal–external cross-validation. In this study, angiography reports from two independent medical centers (9614 for training and internal validation testing and 315 as external validation) were assessed. The internal testing results showed that XLNet had the best performance, with a receiver operating characteristic curve (AUROC) ranging from 0.97 to 0.99 using eleven targeted arteries. The rule-based model attained an AUROC from 0.92 to 0.96, and the RNN long short-term memory model attained an AUROC from 0.95 to 0.97. The study showed the potential application of NLP techniques such as the XLNet model for the routine and automatic screening of patients with high risk of intracranial artery stenosis using angiography reports. However, the NLP models were investigated based on relatively small sample sizes with very different report writing styles and a prevalence of stenosis case distributions, revealing challenges for model generalization. MDPI 2022-08-03 /pmc/articles/PMC9406429/ /pubmed/36010232 http://dx.doi.org/10.3390/diagnostics12081882 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lin, Ching-Heng
Hsu, Kai-Cheng
Liang, Chih-Kuang
Lee, Tsong-Hai
Shih, Ching-Sen
Fann, Yang C.
Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches
title Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches
title_full Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches
title_fullStr Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches
title_full_unstemmed Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches
title_short Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches
title_sort accurately identifying cerebroarterial stenosis from angiography reports using natural language processing approaches
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9406429/
https://www.ncbi.nlm.nih.gov/pubmed/36010232
http://dx.doi.org/10.3390/diagnostics12081882
work_keys_str_mv AT linchingheng accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches
AT hsukaicheng accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches
AT liangchihkuang accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches
AT leetsonghai accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches
AT shihchingsen accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches
AT fannyangc accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches