Cargando…
Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches
Patients with intracranial artery stenosis show high incidence of stroke. Angiography reports contain rich but underutilized information that can enable the detection of cerebrovascular diseases. This study evaluated various natural language processing (NLP) techniques to accurately identify eleven...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9406429/ https://www.ncbi.nlm.nih.gov/pubmed/36010232 http://dx.doi.org/10.3390/diagnostics12081882 |
_version_ | 1784774118779912192 |
---|---|
author | Lin, Ching-Heng Hsu, Kai-Cheng Liang, Chih-Kuang Lee, Tsong-Hai Shih, Ching-Sen Fann, Yang C. |
author_facet | Lin, Ching-Heng Hsu, Kai-Cheng Liang, Chih-Kuang Lee, Tsong-Hai Shih, Ching-Sen Fann, Yang C. |
author_sort | Lin, Ching-Heng |
collection | PubMed |
description | Patients with intracranial artery stenosis show high incidence of stroke. Angiography reports contain rich but underutilized information that can enable the detection of cerebrovascular diseases. This study evaluated various natural language processing (NLP) techniques to accurately identify eleven intracranial artery stenosis from angiography reports. Three NLP models, including a rule-based model, a recurrent neural network (RNN), and a contextualized language model, XLNet, were developed and evaluated by internal–external cross-validation. In this study, angiography reports from two independent medical centers (9614 for training and internal validation testing and 315 as external validation) were assessed. The internal testing results showed that XLNet had the best performance, with a receiver operating characteristic curve (AUROC) ranging from 0.97 to 0.99 using eleven targeted arteries. The rule-based model attained an AUROC from 0.92 to 0.96, and the RNN long short-term memory model attained an AUROC from 0.95 to 0.97. The study showed the potential application of NLP techniques such as the XLNet model for the routine and automatic screening of patients with high risk of intracranial artery stenosis using angiography reports. However, the NLP models were investigated based on relatively small sample sizes with very different report writing styles and a prevalence of stenosis case distributions, revealing challenges for model generalization. |
format | Online Article Text |
id | pubmed-9406429 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-94064292022-08-26 Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches Lin, Ching-Heng Hsu, Kai-Cheng Liang, Chih-Kuang Lee, Tsong-Hai Shih, Ching-Sen Fann, Yang C. Diagnostics (Basel) Article Patients with intracranial artery stenosis show high incidence of stroke. Angiography reports contain rich but underutilized information that can enable the detection of cerebrovascular diseases. This study evaluated various natural language processing (NLP) techniques to accurately identify eleven intracranial artery stenosis from angiography reports. Three NLP models, including a rule-based model, a recurrent neural network (RNN), and a contextualized language model, XLNet, were developed and evaluated by internal–external cross-validation. In this study, angiography reports from two independent medical centers (9614 for training and internal validation testing and 315 as external validation) were assessed. The internal testing results showed that XLNet had the best performance, with a receiver operating characteristic curve (AUROC) ranging from 0.97 to 0.99 using eleven targeted arteries. The rule-based model attained an AUROC from 0.92 to 0.96, and the RNN long short-term memory model attained an AUROC from 0.95 to 0.97. The study showed the potential application of NLP techniques such as the XLNet model for the routine and automatic screening of patients with high risk of intracranial artery stenosis using angiography reports. However, the NLP models were investigated based on relatively small sample sizes with very different report writing styles and a prevalence of stenosis case distributions, revealing challenges for model generalization. MDPI 2022-08-03 /pmc/articles/PMC9406429/ /pubmed/36010232 http://dx.doi.org/10.3390/diagnostics12081882 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lin, Ching-Heng Hsu, Kai-Cheng Liang, Chih-Kuang Lee, Tsong-Hai Shih, Ching-Sen Fann, Yang C. Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches |
title | Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches |
title_full | Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches |
title_fullStr | Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches |
title_full_unstemmed | Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches |
title_short | Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches |
title_sort | accurately identifying cerebroarterial stenosis from angiography reports using natural language processing approaches |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9406429/ https://www.ncbi.nlm.nih.gov/pubmed/36010232 http://dx.doi.org/10.3390/diagnostics12081882 |
work_keys_str_mv | AT linchingheng accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches AT hsukaicheng accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches AT liangchihkuang accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches AT leetsonghai accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches AT shihchingsen accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches AT fannyangc accuratelyidentifyingcerebroarterialstenosisfromangiographyreportsusingnaturallanguageprocessingapproaches |