Cargando…

Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels

BACKGROUND: Use of medication can cause adverse drug reactions (ADRs), unwanted or unexpected events, which are a major safety concern. Drug labels, or prescribing information or package inserts, describe ADRs. Therefore, systematically identifying ADR information from drug labels is critical in mul...

Descripción completa

Detalles Bibliográficos
Autores principales: Tiftikci, Mert, Özgür, Arzucan, He, Yongqun, Hur, Junguk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927101/
https://www.ncbi.nlm.nih.gov/pubmed/31865904
http://dx.doi.org/10.1186/s12859-019-3195-5
_version_ 1783482239524274176
author Tiftikci, Mert
Özgür, Arzucan
He, Yongqun
Hur, Junguk
author_facet Tiftikci, Mert
Özgür, Arzucan
He, Yongqun
Hur, Junguk
author_sort Tiftikci, Mert
collection PubMed
description BACKGROUND: Use of medication can cause adverse drug reactions (ADRs), unwanted or unexpected events, which are a major safety concern. Drug labels, or prescribing information or package inserts, describe ADRs. Therefore, systematically identifying ADR information from drug labels is critical in multiple aspects; however, this task is challenging due to the nature of the natural language of drug labels. RESULTS: In this paper, we present a machine learning- and rule-based system for the identification of ADR entity mentions in the text of drug labels and their normalization through the Medical Dictionary for Regulatory Activities (MedDRA) dictionary. The machine learning approach is based on a recently proposed deep learning architecture, which integrates bi-directional Long Short-Term Memory (Bi-LSTM), Convolutional Neural Network (CNN), and Conditional Random Fields (CRF) for entity recognition. The rule-based approach, used for normalizing the identified ADR mentions to MedDRA terms, is based on an extension of our in-house text-mining system, SciMiner. We evaluated our system on the Text Analysis Conference (TAC) Adverse Drug Reaction 2017 challenge test data set, consisting of 200 manually curated US FDA drug labels. Our ML-based system achieved 77.0% F1 score on the task of ADR mention recognition and 82.6% micro-averaged F1 score on the task of ADR normalization, while rule-based system achieved 67.4 and 77.6% F1 scores, respectively. CONCLUSION: Our study demonstrates that a system composed of a deep learning architecture for entity recognition and a rule-based model for entity normalization is a promising approach for ADR extraction from drug labels.
format Online
Article
Text
id pubmed-6927101
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69271012019-12-30 Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels Tiftikci, Mert Özgür, Arzucan He, Yongqun Hur, Junguk BMC Bioinformatics Research BACKGROUND: Use of medication can cause adverse drug reactions (ADRs), unwanted or unexpected events, which are a major safety concern. Drug labels, or prescribing information or package inserts, describe ADRs. Therefore, systematically identifying ADR information from drug labels is critical in multiple aspects; however, this task is challenging due to the nature of the natural language of drug labels. RESULTS: In this paper, we present a machine learning- and rule-based system for the identification of ADR entity mentions in the text of drug labels and their normalization through the Medical Dictionary for Regulatory Activities (MedDRA) dictionary. The machine learning approach is based on a recently proposed deep learning architecture, which integrates bi-directional Long Short-Term Memory (Bi-LSTM), Convolutional Neural Network (CNN), and Conditional Random Fields (CRF) for entity recognition. The rule-based approach, used for normalizing the identified ADR mentions to MedDRA terms, is based on an extension of our in-house text-mining system, SciMiner. We evaluated our system on the Text Analysis Conference (TAC) Adverse Drug Reaction 2017 challenge test data set, consisting of 200 manually curated US FDA drug labels. Our ML-based system achieved 77.0% F1 score on the task of ADR mention recognition and 82.6% micro-averaged F1 score on the task of ADR normalization, while rule-based system achieved 67.4 and 77.6% F1 scores, respectively. CONCLUSION: Our study demonstrates that a system composed of a deep learning architecture for entity recognition and a rule-based model for entity normalization is a promising approach for ADR extraction from drug labels. BioMed Central 2019-12-23 /pmc/articles/PMC6927101/ /pubmed/31865904 http://dx.doi.org/10.1186/s12859-019-3195-5 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Tiftikci, Mert
Özgür, Arzucan
He, Yongqun
Hur, Junguk
Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels
title Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels
title_full Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels
title_fullStr Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels
title_full_unstemmed Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels
title_short Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels
title_sort machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927101/
https://www.ncbi.nlm.nih.gov/pubmed/31865904
http://dx.doi.org/10.1186/s12859-019-3195-5
work_keys_str_mv AT tiftikcimert machinelearningbasedidentificationandrulebasednormalizationofadversedrugreactionsindruglabels
AT ozgurarzucan machinelearningbasedidentificationandrulebasednormalizationofadversedrugreactionsindruglabels
AT heyongqun machinelearningbasedidentificationandrulebasednormalizationofadversedrugreactionsindruglabels
AT hurjunguk machinelearningbasedidentificationandrulebasednormalizationofadversedrugreactionsindruglabels