Cargando…

Exploring the Applicability of Using Natural Language Processing to Support Nationwide Venous Thromboembolism Surveillance: Model Evaluation Study

BACKGROUND: Venous thromboembolism (VTE) is a preventable, common vascular disease that has been estimated to affect up to 900,000 people per year. It has been associated with risk factors such as recent surgery, cancer, and hospitalization. VTE surveillance for patient management and safety can be...

Descripción completa

Detalles Bibliográficos
Autores principales: Wendelboe, Aaron, Saber, Ibrahim, Dvorak, Justin, Adamski, Alys, Feland, Natalie, Reyes, Nimia, Abe, Karon, Ortel, Thomas, Raskob, Gary
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193259/
https://www.ncbi.nlm.nih.gov/pubmed/37206160
http://dx.doi.org/10.2196/36877
_version_ 1785043799633821696
author Wendelboe, Aaron
Saber, Ibrahim
Dvorak, Justin
Adamski, Alys
Feland, Natalie
Reyes, Nimia
Abe, Karon
Ortel, Thomas
Raskob, Gary
author_facet Wendelboe, Aaron
Saber, Ibrahim
Dvorak, Justin
Adamski, Alys
Feland, Natalie
Reyes, Nimia
Abe, Karon
Ortel, Thomas
Raskob, Gary
author_sort Wendelboe, Aaron
collection PubMed
description BACKGROUND: Venous thromboembolism (VTE) is a preventable, common vascular disease that has been estimated to affect up to 900,000 people per year. It has been associated with risk factors such as recent surgery, cancer, and hospitalization. VTE surveillance for patient management and safety can be improved via natural language processing (NLP). NLP tools have the ability to access electronic medical records, identify patients that meet the VTE case definition, and subsequently enter the relevant information into a database for hospital review. OBJECTIVE: We aimed to evaluate the performance of a VTE identification model of IDEAL-X (Information and Data Extraction Using Adaptive Learning; Emory University)—an NLP tool—in automatically classifying cases of VTE by “reading” unstructured text from diagnostic imaging records collected from 2012 to 2014. METHODS: After accessing imaging records from pilot surveillance systems for VTE from Duke University and the University of Oklahoma Health Sciences Center (OUHSC), we used a VTE identification model of IDEAL-X to classify cases of VTE that had previously been manually classified. Experts reviewed the technicians’ comments in each record to determine if a VTE event occurred. The performance measures calculated (with 95% CIs) were accuracy, sensitivity, specificity, and positive and negative predictive values. Chi-square tests of homogeneity were conducted to evaluate differences in performance measures by site, using a significance level of .05. RESULTS: The VTE model of IDEAL-X “read” 1591 records from Duke University and 1487 records from the OUHSC, for a total of 3078 records. The combined performance measures were 93.7% accuracy (95% CI 93.7%−93.8%), 96.3% sensitivity (95% CI 96.2%−96.4%), 92% specificity (95% CI 91.9%−92%), an 89.1% positive predictive value (95% CI 89%−89.2%), and a 97.3% negative predictive value (95% CI 97.3%−97.4%). The sensitivity was higher at Duke University (97.9%, 95% CI 97.8%−98%) than at the OUHSC (93.3%, 95% CI 93.1%−93.4%; P<.001), but the specificity was higher at the OUHSC (95.9%, 95% CI 95.8%−96%) than at Duke University (86.5%, 95% CI 86.4%−86.7%; P<.001). CONCLUSIONS: The VTE model of IDEAL-X accurately classified cases of VTE from the pilot surveillance systems of two separate health systems in Durham, North Carolina, and Oklahoma City, Oklahoma. NLP is a promising tool for the design and implementation of an automated, cost-effective national surveillance system for VTE. Conducting public health surveillance at a national scale is important for measuring disease burden and the impact of prevention measures. We recommend additional studies to identify how integrating IDEAL-X in a medical record system could further automate the surveillance process.
format Online
Article
Text
id pubmed-10193259
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-101932592023-05-18 Exploring the Applicability of Using Natural Language Processing to Support Nationwide Venous Thromboembolism Surveillance: Model Evaluation Study Wendelboe, Aaron Saber, Ibrahim Dvorak, Justin Adamski, Alys Feland, Natalie Reyes, Nimia Abe, Karon Ortel, Thomas Raskob, Gary JMIR Bioinform Biotech Article BACKGROUND: Venous thromboembolism (VTE) is a preventable, common vascular disease that has been estimated to affect up to 900,000 people per year. It has been associated with risk factors such as recent surgery, cancer, and hospitalization. VTE surveillance for patient management and safety can be improved via natural language processing (NLP). NLP tools have the ability to access electronic medical records, identify patients that meet the VTE case definition, and subsequently enter the relevant information into a database for hospital review. OBJECTIVE: We aimed to evaluate the performance of a VTE identification model of IDEAL-X (Information and Data Extraction Using Adaptive Learning; Emory University)—an NLP tool—in automatically classifying cases of VTE by “reading” unstructured text from diagnostic imaging records collected from 2012 to 2014. METHODS: After accessing imaging records from pilot surveillance systems for VTE from Duke University and the University of Oklahoma Health Sciences Center (OUHSC), we used a VTE identification model of IDEAL-X to classify cases of VTE that had previously been manually classified. Experts reviewed the technicians’ comments in each record to determine if a VTE event occurred. The performance measures calculated (with 95% CIs) were accuracy, sensitivity, specificity, and positive and negative predictive values. Chi-square tests of homogeneity were conducted to evaluate differences in performance measures by site, using a significance level of .05. RESULTS: The VTE model of IDEAL-X “read” 1591 records from Duke University and 1487 records from the OUHSC, for a total of 3078 records. The combined performance measures were 93.7% accuracy (95% CI 93.7%−93.8%), 96.3% sensitivity (95% CI 96.2%−96.4%), 92% specificity (95% CI 91.9%−92%), an 89.1% positive predictive value (95% CI 89%−89.2%), and a 97.3% negative predictive value (95% CI 97.3%−97.4%). The sensitivity was higher at Duke University (97.9%, 95% CI 97.8%−98%) than at the OUHSC (93.3%, 95% CI 93.1%−93.4%; P<.001), but the specificity was higher at the OUHSC (95.9%, 95% CI 95.8%−96%) than at Duke University (86.5%, 95% CI 86.4%−86.7%; P<.001). CONCLUSIONS: The VTE model of IDEAL-X accurately classified cases of VTE from the pilot surveillance systems of two separate health systems in Durham, North Carolina, and Oklahoma City, Oklahoma. NLP is a promising tool for the design and implementation of an automated, cost-effective national surveillance system for VTE. Conducting public health surveillance at a national scale is important for measuring disease burden and the impact of prevention measures. We recommend additional studies to identify how integrating IDEAL-X in a medical record system could further automate the surveillance process. 2022-05-08 /pmc/articles/PMC10193259/ /pubmed/37206160 http://dx.doi.org/10.2196/36877 Text en https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Bioinformatics and Biotechnology, is properly cited. The complete bibliographic information, a link to the original publication on https://bioinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Article
Wendelboe, Aaron
Saber, Ibrahim
Dvorak, Justin
Adamski, Alys
Feland, Natalie
Reyes, Nimia
Abe, Karon
Ortel, Thomas
Raskob, Gary
Exploring the Applicability of Using Natural Language Processing to Support Nationwide Venous Thromboembolism Surveillance: Model Evaluation Study
title Exploring the Applicability of Using Natural Language Processing to Support Nationwide Venous Thromboembolism Surveillance: Model Evaluation Study
title_full Exploring the Applicability of Using Natural Language Processing to Support Nationwide Venous Thromboembolism Surveillance: Model Evaluation Study
title_fullStr Exploring the Applicability of Using Natural Language Processing to Support Nationwide Venous Thromboembolism Surveillance: Model Evaluation Study
title_full_unstemmed Exploring the Applicability of Using Natural Language Processing to Support Nationwide Venous Thromboembolism Surveillance: Model Evaluation Study
title_short Exploring the Applicability of Using Natural Language Processing to Support Nationwide Venous Thromboembolism Surveillance: Model Evaluation Study
title_sort exploring the applicability of using natural language processing to support nationwide venous thromboembolism surveillance: model evaluation study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193259/
https://www.ncbi.nlm.nih.gov/pubmed/37206160
http://dx.doi.org/10.2196/36877
work_keys_str_mv AT wendelboeaaron exploringtheapplicabilityofusingnaturallanguageprocessingtosupportnationwidevenousthromboembolismsurveillancemodelevaluationstudy
AT saberibrahim exploringtheapplicabilityofusingnaturallanguageprocessingtosupportnationwidevenousthromboembolismsurveillancemodelevaluationstudy
AT dvorakjustin exploringtheapplicabilityofusingnaturallanguageprocessingtosupportnationwidevenousthromboembolismsurveillancemodelevaluationstudy
AT adamskialys exploringtheapplicabilityofusingnaturallanguageprocessingtosupportnationwidevenousthromboembolismsurveillancemodelevaluationstudy
AT felandnatalie exploringtheapplicabilityofusingnaturallanguageprocessingtosupportnationwidevenousthromboembolismsurveillancemodelevaluationstudy
AT reyesnimia exploringtheapplicabilityofusingnaturallanguageprocessingtosupportnationwidevenousthromboembolismsurveillancemodelevaluationstudy
AT abekaron exploringtheapplicabilityofusingnaturallanguageprocessingtosupportnationwidevenousthromboembolismsurveillancemodelevaluationstudy
AT ortelthomas exploringtheapplicabilityofusingnaturallanguageprocessingtosupportnationwidevenousthromboembolismsurveillancemodelevaluationstudy
AT raskobgary exploringtheapplicabilityofusingnaturallanguageprocessingtosupportnationwidevenousthromboembolismsurveillancemodelevaluationstudy