Cargando…

DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter

OBJECTIVE: Research on pharmacovigilance from social media data has focused on mining adverse drug events (ADEs) using annotated datasets, with publications generally focusing on 1 of 3 tasks: ADE classification, named entity recognition for identifying the span of ADE mentions, and ADE mention norm...

Descripción completa

Detalles Bibliográficos
Autores principales: Magge, Arjun, Tutubalina, Elena, Miftahutdinov, Zulfat, Alimova, Ilseyar, Dirkson, Anne, Verberne, Suzan, Weissenbacher, Davy, Gonzalez-Hernandez, Graciela
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449608/
https://www.ncbi.nlm.nih.gov/pubmed/34270701
http://dx.doi.org/10.1093/jamia/ocab114
_version_ 1784569453330038784
author Magge, Arjun
Tutubalina, Elena
Miftahutdinov, Zulfat
Alimova, Ilseyar
Dirkson, Anne
Verberne, Suzan
Weissenbacher, Davy
Gonzalez-Hernandez, Graciela
author_facet Magge, Arjun
Tutubalina, Elena
Miftahutdinov, Zulfat
Alimova, Ilseyar
Dirkson, Anne
Verberne, Suzan
Weissenbacher, Davy
Gonzalez-Hernandez, Graciela
author_sort Magge, Arjun
collection PubMed
description OBJECTIVE: Research on pharmacovigilance from social media data has focused on mining adverse drug events (ADEs) using annotated datasets, with publications generally focusing on 1 of 3 tasks: ADE classification, named entity recognition for identifying the span of ADE mentions, and ADE mention normalization to standardized terminologies. While the common goal of such systems is to detect ADE signals that can be used to inform public policy, it has been impeded largely by limited end-to-end solutions for large-scale analysis of social media reports for different drugs. MATERIALS AND METHODS: We present a dataset for training and evaluation of ADE pipelines where the ADE distribution is closer to the average ‘natural balance’ with ADEs present in about 7% of the tweets. The deep learning architecture involves an ADE extraction pipeline with individual components for all 3 tasks. RESULTS: The system presented achieved state-of-the-art performance on comparable datasets and scored a classification performance of F(1) = 0.63, span extraction performance of F(1) = 0.44 and an end-to-end entity resolution performance of F(1) = 0.34 on the presented dataset. DISCUSSION: The performance of the models continues to highlight multiple challenges when deploying pharmacovigilance systems that use social media data. We discuss the implications of such models in the downstream tasks of signal detection and suggest future enhancements. CONCLUSION: Mining ADEs from Twitter posts using a pipeline architecture requires the different components to be trained and tuned based on input data imbalance in order to ensure optimal performance on the end-to-end resolution task.
format Online
Article
Text
id pubmed-8449608
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84496082021-09-20 DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter Magge, Arjun Tutubalina, Elena Miftahutdinov, Zulfat Alimova, Ilseyar Dirkson, Anne Verberne, Suzan Weissenbacher, Davy Gonzalez-Hernandez, Graciela J Am Med Inform Assoc Research and Applications OBJECTIVE: Research on pharmacovigilance from social media data has focused on mining adverse drug events (ADEs) using annotated datasets, with publications generally focusing on 1 of 3 tasks: ADE classification, named entity recognition for identifying the span of ADE mentions, and ADE mention normalization to standardized terminologies. While the common goal of such systems is to detect ADE signals that can be used to inform public policy, it has been impeded largely by limited end-to-end solutions for large-scale analysis of social media reports for different drugs. MATERIALS AND METHODS: We present a dataset for training and evaluation of ADE pipelines where the ADE distribution is closer to the average ‘natural balance’ with ADEs present in about 7% of the tweets. The deep learning architecture involves an ADE extraction pipeline with individual components for all 3 tasks. RESULTS: The system presented achieved state-of-the-art performance on comparable datasets and scored a classification performance of F(1) = 0.63, span extraction performance of F(1) = 0.44 and an end-to-end entity resolution performance of F(1) = 0.34 on the presented dataset. DISCUSSION: The performance of the models continues to highlight multiple challenges when deploying pharmacovigilance systems that use social media data. We discuss the implications of such models in the downstream tasks of signal detection and suggest future enhancements. CONCLUSION: Mining ADEs from Twitter posts using a pipeline architecture requires the different components to be trained and tuned based on input data imbalance in order to ensure optimal performance on the end-to-end resolution task. Oxford University Press 2021-07-16 /pmc/articles/PMC8449608/ /pubmed/34270701 http://dx.doi.org/10.1093/jamia/ocab114 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Magge, Arjun
Tutubalina, Elena
Miftahutdinov, Zulfat
Alimova, Ilseyar
Dirkson, Anne
Verberne, Suzan
Weissenbacher, Davy
Gonzalez-Hernandez, Graciela
DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter
title DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter
title_full DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter
title_fullStr DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter
title_full_unstemmed DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter
title_short DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter
title_sort deepademiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on twitter
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449608/
https://www.ncbi.nlm.nih.gov/pubmed/34270701
http://dx.doi.org/10.1093/jamia/ocab114
work_keys_str_mv AT maggearjun deepademineradeeplearningpharmacovigilancepipelineforextractionandnormalizationofadversedrugeventmentionsontwitter
AT tutubalinaelena deepademineradeeplearningpharmacovigilancepipelineforextractionandnormalizationofadversedrugeventmentionsontwitter
AT miftahutdinovzulfat deepademineradeeplearningpharmacovigilancepipelineforextractionandnormalizationofadversedrugeventmentionsontwitter
AT alimovailseyar deepademineradeeplearningpharmacovigilancepipelineforextractionandnormalizationofadversedrugeventmentionsontwitter
AT dirksonanne deepademineradeeplearningpharmacovigilancepipelineforextractionandnormalizationofadversedrugeventmentionsontwitter
AT verbernesuzan deepademineradeeplearningpharmacovigilancepipelineforextractionandnormalizationofadversedrugeventmentionsontwitter
AT weissenbacherdavy deepademineradeeplearningpharmacovigilancepipelineforextractionandnormalizationofadversedrugeventmentionsontwitter
AT gonzalezhernandezgraciela deepademineradeeplearningpharmacovigilancepipelineforextractionandnormalizationofadversedrugeventmentionsontwitter