Cargando…

Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method

BACKGROUND: Big data tools provide opportunities to monitor adverse events (patient harm associated with medical care) (AEs) in the unstructured text of electronic health care records (EHRs). Writers may explicitly state an apparent association between treatment and adverse outcome (“attributed”) or...

Descripción completa

Detalles Bibliográficos
Autores principales: Bright, Roselie A, Rankin, Summer K, Dowdy, Katherine, Blok, Sergey V, Bright, Susan J, Palmer, Lee Anne M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10414364/
https://www.ncbi.nlm.nih.gov/pubmed/37725533
http://dx.doi.org/10.2196/27017
_version_ 1785087321973981184
author Bright, Roselie A
Rankin, Summer K
Dowdy, Katherine
Blok, Sergey V
Bright, Susan J
Palmer, Lee Anne M
author_facet Bright, Roselie A
Rankin, Summer K
Dowdy, Katherine
Blok, Sergey V
Bright, Susan J
Palmer, Lee Anne M
author_sort Bright, Roselie A
collection PubMed
description BACKGROUND: Big data tools provide opportunities to monitor adverse events (patient harm associated with medical care) (AEs) in the unstructured text of electronic health care records (EHRs). Writers may explicitly state an apparent association between treatment and adverse outcome (“attributed”) or state the simple treatment and outcome without an association (“unattributed”). Many methods for finding AEs in text rely on predefining possible AEs before searching for prespecified words and phrases or manual labeling (standardization) by investigators. We developed a method to identify possible AEs, even if unknown or unattributed, without any prespecifications or standardization of notes. Our method was inspired by word-frequency analysis methods used to uncover the true authorship of disputed works credited to William Shakespeare. We chose two use cases, “transfusion” and “time-based.” Transfusion was chosen because new transfusion AE types were becoming recognized during the study data period; therefore, we anticipated an opportunity to find unattributed potential AEs (PAEs) in the notes. With the time-based case, we wanted to simulate near real-time surveillance. We chose time periods in the hope of detecting PAEs due to contaminated heparin from mid-2007 to mid-2008 that were announced in early 2008. We hypothesized that the prevalence of contaminated heparin may have been widespread enough to manifest in EHRs through symptoms related to heparin AEs, independent of clinicians’ documentation of attributed AEs. OBJECTIVE: We aimed to develop a new method to identify attributed and unattributed PAEs using the unstructured text of EHRs. METHODS: We used EHRs for adult critical care admissions at a major teaching hospital (2001-2012). For each case, we formed a group of interest and a comparison group. We concatenated the text notes for each admission into one document sorted by date, and deleted replicate sentences and lists. We identified statistically significant words in the group of interest versus the comparison group. Documents in the group of interest were filtered to those words, followed by topic modeling on the filtered documents to produce topics. For each topic, the three documents with the maximum topic scores were manually reviewed to identify PAEs. RESULTS: Topics centered around medical conditions that were unique to or more common in the group of interest, including PAEs. In each use case, most PAEs were unattributed in the notes. Among the transfusion PAEs was unattributed evidence of transfusion-associated cardiac overload and transfusion-related acute lung injury. Some of the PAEs from mid-2007 to mid-2008 were increased unattributed events consistent with AEs related to heparin contamination. CONCLUSIONS: The Shakespeare method could be a useful supplement to AE reporting and surveillance of structured EHR data. Future improvements should include automation of the manual review process.
format Online
Article
Text
id pubmed-10414364
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-104143642023-09-12 Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method Bright, Roselie A Rankin, Summer K Dowdy, Katherine Blok, Sergey V Bright, Susan J Palmer, Lee Anne M JMIRx Med Original Paper BACKGROUND: Big data tools provide opportunities to monitor adverse events (patient harm associated with medical care) (AEs) in the unstructured text of electronic health care records (EHRs). Writers may explicitly state an apparent association between treatment and adverse outcome (“attributed”) or state the simple treatment and outcome without an association (“unattributed”). Many methods for finding AEs in text rely on predefining possible AEs before searching for prespecified words and phrases or manual labeling (standardization) by investigators. We developed a method to identify possible AEs, even if unknown or unattributed, without any prespecifications or standardization of notes. Our method was inspired by word-frequency analysis methods used to uncover the true authorship of disputed works credited to William Shakespeare. We chose two use cases, “transfusion” and “time-based.” Transfusion was chosen because new transfusion AE types were becoming recognized during the study data period; therefore, we anticipated an opportunity to find unattributed potential AEs (PAEs) in the notes. With the time-based case, we wanted to simulate near real-time surveillance. We chose time periods in the hope of detecting PAEs due to contaminated heparin from mid-2007 to mid-2008 that were announced in early 2008. We hypothesized that the prevalence of contaminated heparin may have been widespread enough to manifest in EHRs through symptoms related to heparin AEs, independent of clinicians’ documentation of attributed AEs. OBJECTIVE: We aimed to develop a new method to identify attributed and unattributed PAEs using the unstructured text of EHRs. METHODS: We used EHRs for adult critical care admissions at a major teaching hospital (2001-2012). For each case, we formed a group of interest and a comparison group. We concatenated the text notes for each admission into one document sorted by date, and deleted replicate sentences and lists. We identified statistically significant words in the group of interest versus the comparison group. Documents in the group of interest were filtered to those words, followed by topic modeling on the filtered documents to produce topics. For each topic, the three documents with the maximum topic scores were manually reviewed to identify PAEs. RESULTS: Topics centered around medical conditions that were unique to or more common in the group of interest, including PAEs. In each use case, most PAEs were unattributed in the notes. Among the transfusion PAEs was unattributed evidence of transfusion-associated cardiac overload and transfusion-related acute lung injury. Some of the PAEs from mid-2007 to mid-2008 were increased unattributed events consistent with AEs related to heparin contamination. CONCLUSIONS: The Shakespeare method could be a useful supplement to AE reporting and surveillance of structured EHR data. Future improvements should include automation of the manual review process. JMIR Publications 2021-08-11 /pmc/articles/PMC10414364/ /pubmed/37725533 http://dx.doi.org/10.2196/27017 Text en ©Roselie A Bright, Summer K Rankin, Katherine Dowdy, Sergey V Blok, Susan J Bright, Lee Anne M Palmer. Originally published in JMIRx Med (https://med.jmirx.org), 11.08.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIRx Med, is properly cited. The complete bibliographic information, a link to the original publication on https://med.jmirx.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Bright, Roselie A
Rankin, Summer K
Dowdy, Katherine
Blok, Sergey V
Bright, Susan J
Palmer, Lee Anne M
Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method
title Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method
title_full Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method
title_fullStr Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method
title_full_unstemmed Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method
title_short Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method
title_sort finding potential adverse events in the unstructured text of electronic health care records: development of the shakespeare method
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10414364/
https://www.ncbi.nlm.nih.gov/pubmed/37725533
http://dx.doi.org/10.2196/27017
work_keys_str_mv AT brightroseliea findingpotentialadverseeventsintheunstructuredtextofelectronichealthcarerecordsdevelopmentoftheshakespearemethod
AT rankinsummerk findingpotentialadverseeventsintheunstructuredtextofelectronichealthcarerecordsdevelopmentoftheshakespearemethod
AT dowdykatherine findingpotentialadverseeventsintheunstructuredtextofelectronichealthcarerecordsdevelopmentoftheshakespearemethod
AT bloksergeyv findingpotentialadverseeventsintheunstructuredtextofelectronichealthcarerecordsdevelopmentoftheshakespearemethod
AT brightsusanj findingpotentialadverseeventsintheunstructuredtextofelectronichealthcarerecordsdevelopmentoftheshakespearemethod
AT palmerleeannem findingpotentialadverseeventsintheunstructuredtextofelectronichealthcarerecordsdevelopmentoftheshakespearemethod