Cargando…

Mining Severe Drug Hypersensitivity Reaction Cases in Pediatric Electronic Health Records: Methodology Development and Applications

BACKGROUND: Severe drug hypersensitivity reactions (DHRs) refer to allergic reactions caused by drugs and usually present with severe skin rashes and internal damage as the main symptoms. Reporting of severe DHRs in hospitals now solely occurs through spontaneous reporting systems (SRSs), which clin...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Yuncui, Zhao, Qiuye, Cao, Wang, Wang, Xiaochuan, Li, Yanming, Xie, Yuefeng, Wang, Xiaoling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9516376/
https://www.ncbi.nlm.nih.gov/pubmed/36099001
http://dx.doi.org/10.2196/37812
_version_ 1784798696237432832
author Yu, Yuncui
Zhao, Qiuye
Cao, Wang
Wang, Xiaochuan
Li, Yanming
Xie, Yuefeng
Wang, Xiaoling
author_facet Yu, Yuncui
Zhao, Qiuye
Cao, Wang
Wang, Xiaochuan
Li, Yanming
Xie, Yuefeng
Wang, Xiaoling
author_sort Yu, Yuncui
collection PubMed
description BACKGROUND: Severe drug hypersensitivity reactions (DHRs) refer to allergic reactions caused by drugs and usually present with severe skin rashes and internal damage as the main symptoms. Reporting of severe DHRs in hospitals now solely occurs through spontaneous reporting systems (SRSs), which clinicians in charge operate. An automatic identification system scrutinizes clinical notes and reports potential severe DHR cases. OBJECTIVE: The goal of the research was to develop an automatic identification system for mining severe DHR cases and discover more DHR cases for further study. The proposed method was applied to 9 years of data in pediatrics electronic health records (EHRs) of Beijing Children’s Hospital. METHODS: The phenotyping task was approached as a document classification problem. A DHR dataset containing tagged documents for training was prepared. Each document contains all the clinical notes generated during 1 inpatient visit in this data set. Document-level tags correspond to DHR types and a negative category. Strategies were evaluated for long document classification on the openly available National NLP Clinical Challenges 2016 smoking task. Four strategies were evaluated in this work: document truncation, hierarchy representation, efficient self-attention, and key sentence selection. In-domain and open-domain pretrained embeddings were evaluated on the DHR dataset. An automatic grid search was performed to tune statistical classifiers for the best performance over the transformed data. Inference efficiency and memory requirements of the best performing models were analyzed. The most efficient model for mining DHR cases from millions of documents in the EHR system was run. RESULTS: For long document classification, key sentence selection with guideline keywords achieved the best performance and was 9 times faster than hierarchy representation models for inference. The best model discovered 1155 DHR cases in Beijing Children’s Hospital EHR system. After double-checking by clinician experts, 357 cases of severe DHRs were finally identified. For the smoking challenge, our model reached the record of state-of-the-art performance (94.1% vs 94.2%). CONCLUSIONS: The proposed method discovered 357 positive DHR cases from a large archive of EHR records, about 90% of which were missed by SRSs. SRSs reported only 36 cases during the same period. The case analysis also found more suspected drugs associated with severe DHRs in pediatrics.
format Online
Article
Text
id pubmed-9516376
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-95163762022-09-29 Mining Severe Drug Hypersensitivity Reaction Cases in Pediatric Electronic Health Records: Methodology Development and Applications Yu, Yuncui Zhao, Qiuye Cao, Wang Wang, Xiaochuan Li, Yanming Xie, Yuefeng Wang, Xiaoling JMIR Med Inform Original Paper BACKGROUND: Severe drug hypersensitivity reactions (DHRs) refer to allergic reactions caused by drugs and usually present with severe skin rashes and internal damage as the main symptoms. Reporting of severe DHRs in hospitals now solely occurs through spontaneous reporting systems (SRSs), which clinicians in charge operate. An automatic identification system scrutinizes clinical notes and reports potential severe DHR cases. OBJECTIVE: The goal of the research was to develop an automatic identification system for mining severe DHR cases and discover more DHR cases for further study. The proposed method was applied to 9 years of data in pediatrics electronic health records (EHRs) of Beijing Children’s Hospital. METHODS: The phenotyping task was approached as a document classification problem. A DHR dataset containing tagged documents for training was prepared. Each document contains all the clinical notes generated during 1 inpatient visit in this data set. Document-level tags correspond to DHR types and a negative category. Strategies were evaluated for long document classification on the openly available National NLP Clinical Challenges 2016 smoking task. Four strategies were evaluated in this work: document truncation, hierarchy representation, efficient self-attention, and key sentence selection. In-domain and open-domain pretrained embeddings were evaluated on the DHR dataset. An automatic grid search was performed to tune statistical classifiers for the best performance over the transformed data. Inference efficiency and memory requirements of the best performing models were analyzed. The most efficient model for mining DHR cases from millions of documents in the EHR system was run. RESULTS: For long document classification, key sentence selection with guideline keywords achieved the best performance and was 9 times faster than hierarchy representation models for inference. The best model discovered 1155 DHR cases in Beijing Children’s Hospital EHR system. After double-checking by clinician experts, 357 cases of severe DHRs were finally identified. For the smoking challenge, our model reached the record of state-of-the-art performance (94.1% vs 94.2%). CONCLUSIONS: The proposed method discovered 357 positive DHR cases from a large archive of EHR records, about 90% of which were missed by SRSs. SRSs reported only 36 cases during the same period. The case analysis also found more suspected drugs associated with severe DHRs in pediatrics. JMIR Publications 2022-09-13 /pmc/articles/PMC9516376/ /pubmed/36099001 http://dx.doi.org/10.2196/37812 Text en ©Yuncui Yu, Qiuye Zhao, Wang Cao, Xiaochuan Wang, Yanming Li, Yuefeng Xie, Xiaoling Wang. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 13.09.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Yu, Yuncui
Zhao, Qiuye
Cao, Wang
Wang, Xiaochuan
Li, Yanming
Xie, Yuefeng
Wang, Xiaoling
Mining Severe Drug Hypersensitivity Reaction Cases in Pediatric Electronic Health Records: Methodology Development and Applications
title Mining Severe Drug Hypersensitivity Reaction Cases in Pediatric Electronic Health Records: Methodology Development and Applications
title_full Mining Severe Drug Hypersensitivity Reaction Cases in Pediatric Electronic Health Records: Methodology Development and Applications
title_fullStr Mining Severe Drug Hypersensitivity Reaction Cases in Pediatric Electronic Health Records: Methodology Development and Applications
title_full_unstemmed Mining Severe Drug Hypersensitivity Reaction Cases in Pediatric Electronic Health Records: Methodology Development and Applications
title_short Mining Severe Drug Hypersensitivity Reaction Cases in Pediatric Electronic Health Records: Methodology Development and Applications
title_sort mining severe drug hypersensitivity reaction cases in pediatric electronic health records: methodology development and applications
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9516376/
https://www.ncbi.nlm.nih.gov/pubmed/36099001
http://dx.doi.org/10.2196/37812
work_keys_str_mv AT yuyuncui miningseveredrughypersensitivityreactioncasesinpediatricelectronichealthrecordsmethodologydevelopmentandapplications
AT zhaoqiuye miningseveredrughypersensitivityreactioncasesinpediatricelectronichealthrecordsmethodologydevelopmentandapplications
AT caowang miningseveredrughypersensitivityreactioncasesinpediatricelectronichealthrecordsmethodologydevelopmentandapplications
AT wangxiaochuan miningseveredrughypersensitivityreactioncasesinpediatricelectronichealthrecordsmethodologydevelopmentandapplications
AT liyanming miningseveredrughypersensitivityreactioncasesinpediatricelectronichealthrecordsmethodologydevelopmentandapplications
AT xieyuefeng miningseveredrughypersensitivityreactioncasesinpediatricelectronichealthrecordsmethodologydevelopmentandapplications
AT wangxiaoling miningseveredrughypersensitivityreactioncasesinpediatricelectronichealthrecordsmethodologydevelopmentandapplications