Cargando…

Using Artificial Intelligence With Natural Language Processing to Combine Electronic Health Record’s Structured and Free Text Data to Identify Nonvalvular Atrial Fibrillation to Decrease Strokes and Death: Evaluation and Case-Control Study

BACKGROUND: Nonvalvular atrial fibrillation (NVAF) affects almost 6 million Americans and is a major contributor to stroke but is significantly undiagnosed and undertreated despite explicit guidelines for oral anticoagulation. OBJECTIVE: The aim of this study is to investigate whether the use of sem...

Descripción completa

Detalles Bibliográficos
Autores principales: Elkin, Peter L, Mullin, Sarah, Mardekian, Jack, Crowner, Christopher, Sakilay, Sylvester, Sinha, Shyamashree, Brady, Gary, Wright, Marcia, Nolen, Kimberly, Trainer, JoAnn, Koppel, Ross, Schlegel, Daniel, Kaushik, Sashank, Zhao, Jane, Song, Buer, Anand, Edwin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8663460/
https://www.ncbi.nlm.nih.gov/pubmed/34751659
http://dx.doi.org/10.2196/28946
_version_ 1784613641984671744
author Elkin, Peter L
Mullin, Sarah
Mardekian, Jack
Crowner, Christopher
Sakilay, Sylvester
Sinha, Shyamashree
Brady, Gary
Wright, Marcia
Nolen, Kimberly
Trainer, JoAnn
Koppel, Ross
Schlegel, Daniel
Kaushik, Sashank
Zhao, Jane
Song, Buer
Anand, Edwin
author_facet Elkin, Peter L
Mullin, Sarah
Mardekian, Jack
Crowner, Christopher
Sakilay, Sylvester
Sinha, Shyamashree
Brady, Gary
Wright, Marcia
Nolen, Kimberly
Trainer, JoAnn
Koppel, Ross
Schlegel, Daniel
Kaushik, Sashank
Zhao, Jane
Song, Buer
Anand, Edwin
author_sort Elkin, Peter L
collection PubMed
description BACKGROUND: Nonvalvular atrial fibrillation (NVAF) affects almost 6 million Americans and is a major contributor to stroke but is significantly undiagnosed and undertreated despite explicit guidelines for oral anticoagulation. OBJECTIVE: The aim of this study is to investigate whether the use of semisupervised natural language processing (NLP) of electronic health record’s (EHR) free-text information combined with structured EHR data improves NVAF discovery and treatment and perhaps offers a method to prevent thousands of deaths and save billions of dollars. METHODS: We abstracted 96,681 participants from the University of Buffalo faculty practice’s EHR. NLP was used to index the notes and compare the ability to identify NVAF, congestive heart failure, hypertension, age ≥75 years, diabetes mellitus, stroke or transient ischemic attack, vascular disease, age 65 to 74 years, sex category (CHA(2)DS(2)-VASc), and Hypertension, Abnormal liver/renal function, Stroke history, Bleeding history or predisposition, Labile INR, Elderly, Drug/alcohol usage (HAS-BLED) scores using unstructured data (International Classification of Diseases codes) versus structured and unstructured data from clinical notes. In addition, we analyzed data from 63,296,120 participants in the Optum and Truven databases to determine the NVAF frequency, rates of CHA(2)DS(2)‑VASc ≥2, and no contraindications to oral anticoagulants, rates of stroke and death in the untreated population, and first year’s costs after stroke. RESULTS: The structured-plus-unstructured method would have identified 3,976,056 additional true NVAF cases (P<.001) and improved sensitivity for CHA(2)DS(2)-VASc and HAS-BLED scores compared with the structured data alone (P=.002 and P<.001, respectively), causing a 32.1% improvement. For the United States, this method would prevent an estimated 176,537 strokes, save 10,575 lives, and save >US $13.5 billion. CONCLUSIONS: Artificial intelligence–informed bio-surveillance combining NLP of free-text information with structured EHR data improves data completeness, prevents thousands of strokes, and saves lives and funds. This method is applicable to many disorders with profound public health consequences.
format Online
Article
Text
id pubmed-8663460
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-86634602022-01-05 Using Artificial Intelligence With Natural Language Processing to Combine Electronic Health Record’s Structured and Free Text Data to Identify Nonvalvular Atrial Fibrillation to Decrease Strokes and Death: Evaluation and Case-Control Study Elkin, Peter L Mullin, Sarah Mardekian, Jack Crowner, Christopher Sakilay, Sylvester Sinha, Shyamashree Brady, Gary Wright, Marcia Nolen, Kimberly Trainer, JoAnn Koppel, Ross Schlegel, Daniel Kaushik, Sashank Zhao, Jane Song, Buer Anand, Edwin J Med Internet Res Original Paper BACKGROUND: Nonvalvular atrial fibrillation (NVAF) affects almost 6 million Americans and is a major contributor to stroke but is significantly undiagnosed and undertreated despite explicit guidelines for oral anticoagulation. OBJECTIVE: The aim of this study is to investigate whether the use of semisupervised natural language processing (NLP) of electronic health record’s (EHR) free-text information combined with structured EHR data improves NVAF discovery and treatment and perhaps offers a method to prevent thousands of deaths and save billions of dollars. METHODS: We abstracted 96,681 participants from the University of Buffalo faculty practice’s EHR. NLP was used to index the notes and compare the ability to identify NVAF, congestive heart failure, hypertension, age ≥75 years, diabetes mellitus, stroke or transient ischemic attack, vascular disease, age 65 to 74 years, sex category (CHA(2)DS(2)-VASc), and Hypertension, Abnormal liver/renal function, Stroke history, Bleeding history or predisposition, Labile INR, Elderly, Drug/alcohol usage (HAS-BLED) scores using unstructured data (International Classification of Diseases codes) versus structured and unstructured data from clinical notes. In addition, we analyzed data from 63,296,120 participants in the Optum and Truven databases to determine the NVAF frequency, rates of CHA(2)DS(2)‑VASc ≥2, and no contraindications to oral anticoagulants, rates of stroke and death in the untreated population, and first year’s costs after stroke. RESULTS: The structured-plus-unstructured method would have identified 3,976,056 additional true NVAF cases (P<.001) and improved sensitivity for CHA(2)DS(2)-VASc and HAS-BLED scores compared with the structured data alone (P=.002 and P<.001, respectively), causing a 32.1% improvement. For the United States, this method would prevent an estimated 176,537 strokes, save 10,575 lives, and save >US $13.5 billion. CONCLUSIONS: Artificial intelligence–informed bio-surveillance combining NLP of free-text information with structured EHR data improves data completeness, prevents thousands of strokes, and saves lives and funds. This method is applicable to many disorders with profound public health consequences. JMIR Publications 2021-11-09 /pmc/articles/PMC8663460/ /pubmed/34751659 http://dx.doi.org/10.2196/28946 Text en ©Peter L Elkin, Sarah Mullin, Jack Mardekian, Christopher Crowner, Sylvester Sakilay, Shyamashree Sinha, Gary Brady, Marcia Wright, Kimberly Nolen, JoAnn Trainer, Ross Koppel, Daniel Schlegel, Sashank Kaushik, Jane Zhao, Buer Song, Edwin Anand. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 09.11.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Elkin, Peter L
Mullin, Sarah
Mardekian, Jack
Crowner, Christopher
Sakilay, Sylvester
Sinha, Shyamashree
Brady, Gary
Wright, Marcia
Nolen, Kimberly
Trainer, JoAnn
Koppel, Ross
Schlegel, Daniel
Kaushik, Sashank
Zhao, Jane
Song, Buer
Anand, Edwin
Using Artificial Intelligence With Natural Language Processing to Combine Electronic Health Record’s Structured and Free Text Data to Identify Nonvalvular Atrial Fibrillation to Decrease Strokes and Death: Evaluation and Case-Control Study
title Using Artificial Intelligence With Natural Language Processing to Combine Electronic Health Record’s Structured and Free Text Data to Identify Nonvalvular Atrial Fibrillation to Decrease Strokes and Death: Evaluation and Case-Control Study
title_full Using Artificial Intelligence With Natural Language Processing to Combine Electronic Health Record’s Structured and Free Text Data to Identify Nonvalvular Atrial Fibrillation to Decrease Strokes and Death: Evaluation and Case-Control Study
title_fullStr Using Artificial Intelligence With Natural Language Processing to Combine Electronic Health Record’s Structured and Free Text Data to Identify Nonvalvular Atrial Fibrillation to Decrease Strokes and Death: Evaluation and Case-Control Study
title_full_unstemmed Using Artificial Intelligence With Natural Language Processing to Combine Electronic Health Record’s Structured and Free Text Data to Identify Nonvalvular Atrial Fibrillation to Decrease Strokes and Death: Evaluation and Case-Control Study
title_short Using Artificial Intelligence With Natural Language Processing to Combine Electronic Health Record’s Structured and Free Text Data to Identify Nonvalvular Atrial Fibrillation to Decrease Strokes and Death: Evaluation and Case-Control Study
title_sort using artificial intelligence with natural language processing to combine electronic health record’s structured and free text data to identify nonvalvular atrial fibrillation to decrease strokes and death: evaluation and case-control study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8663460/
https://www.ncbi.nlm.nih.gov/pubmed/34751659
http://dx.doi.org/10.2196/28946
work_keys_str_mv AT elkinpeterl usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT mullinsarah usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT mardekianjack usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT crownerchristopher usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT sakilaysylvester usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT sinhashyamashree usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT bradygary usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT wrightmarcia usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT nolenkimberly usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT trainerjoann usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT koppelross usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT schlegeldaniel usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT kaushiksashank usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT zhaojane usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT songbuer usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy
AT anandedwin usingartificialintelligencewithnaturallanguageprocessingtocombineelectronichealthrecordsstructuredandfreetextdatatoidentifynonvalvularatrialfibrillationtodecreasestrokesanddeathevaluationandcasecontrolstudy