Cargando…

Semi-Supervised Recurrent Neural Network for Adverse Drug Reaction mention extraction

BACKGROUND: Social media is a useful platform to share health-related information due to its vast reach. This makes it a good candidate for public-health monitoring tasks, specifically for pharmacovigilance. We study the problem of extraction of Adverse-Drug-Reaction (ADR) mentions from social media...

Descripción completa

Detalles Bibliográficos
Autores principales: Gupta, Shashank, Pawar, Sachin, Ramrakhiyani, Nitin, Palshikar, Girish Keshav, Varma, Vasudeva
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5998760/
https://www.ncbi.nlm.nih.gov/pubmed/29897321
http://dx.doi.org/10.1186/s12859-018-2192-4
_version_ 1783331292736126976
author Gupta, Shashank
Pawar, Sachin
Ramrakhiyani, Nitin
Palshikar, Girish Keshav
Varma, Vasudeva
author_facet Gupta, Shashank
Pawar, Sachin
Ramrakhiyani, Nitin
Palshikar, Girish Keshav
Varma, Vasudeva
author_sort Gupta, Shashank
collection PubMed
description BACKGROUND: Social media is a useful platform to share health-related information due to its vast reach. This makes it a good candidate for public-health monitoring tasks, specifically for pharmacovigilance. We study the problem of extraction of Adverse-Drug-Reaction (ADR) mentions from social media, particularly from Twitter. Medical information extraction from social media is challenging, mainly due to short and highly informal nature of text, as compared to more technical and formal medical reports. METHODS: Current methods in ADR mention extraction rely on supervised learning methods, which suffer from labeled data scarcity problem. The state-of-the-art method uses deep neural networks, specifically a class of Recurrent Neural Network (RNN) which is Long-Short-Term-Memory network (LSTM). Deep neural networks, due to their large number of free parameters rely heavily on large annotated corpora for learning the end task. But in the real-world, it is hard to get large labeled data, mainly due to the heavy cost associated with the manual annotation. RESULTS: To this end, we propose a novel semi-supervised learning based RNN model, which can leverage unlabeled data also present in abundance on social media. Through experiments we demonstrate the effectiveness of our method, achieving state-of-the-art performance in ADR mention extraction. CONCLUSION: In this study, we tackle the problem of labeled data scarcity for Adverse Drug Reaction mention extraction from social media and propose a novel semi-supervised learning based method which can leverage large unlabeled corpus available in abundance on the web. Through empirical study, we demonstrate that our proposed method outperforms fully supervised learning based baseline which relies on large manually annotated corpus for a good performance.
format Online
Article
Text
id pubmed-5998760
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59987602018-06-25 Semi-Supervised Recurrent Neural Network for Adverse Drug Reaction mention extraction Gupta, Shashank Pawar, Sachin Ramrakhiyani, Nitin Palshikar, Girish Keshav Varma, Vasudeva BMC Bioinformatics Research BACKGROUND: Social media is a useful platform to share health-related information due to its vast reach. This makes it a good candidate for public-health monitoring tasks, specifically for pharmacovigilance. We study the problem of extraction of Adverse-Drug-Reaction (ADR) mentions from social media, particularly from Twitter. Medical information extraction from social media is challenging, mainly due to short and highly informal nature of text, as compared to more technical and formal medical reports. METHODS: Current methods in ADR mention extraction rely on supervised learning methods, which suffer from labeled data scarcity problem. The state-of-the-art method uses deep neural networks, specifically a class of Recurrent Neural Network (RNN) which is Long-Short-Term-Memory network (LSTM). Deep neural networks, due to their large number of free parameters rely heavily on large annotated corpora for learning the end task. But in the real-world, it is hard to get large labeled data, mainly due to the heavy cost associated with the manual annotation. RESULTS: To this end, we propose a novel semi-supervised learning based RNN model, which can leverage unlabeled data also present in abundance on social media. Through experiments we demonstrate the effectiveness of our method, achieving state-of-the-art performance in ADR mention extraction. CONCLUSION: In this study, we tackle the problem of labeled data scarcity for Adverse Drug Reaction mention extraction from social media and propose a novel semi-supervised learning based method which can leverage large unlabeled corpus available in abundance on the web. Through empirical study, we demonstrate that our proposed method outperforms fully supervised learning based baseline which relies on large manually annotated corpus for a good performance. BioMed Central 2018-06-13 /pmc/articles/PMC5998760/ /pubmed/29897321 http://dx.doi.org/10.1186/s12859-018-2192-4 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Gupta, Shashank
Pawar, Sachin
Ramrakhiyani, Nitin
Palshikar, Girish Keshav
Varma, Vasudeva
Semi-Supervised Recurrent Neural Network for Adverse Drug Reaction mention extraction
title Semi-Supervised Recurrent Neural Network for Adverse Drug Reaction mention extraction
title_full Semi-Supervised Recurrent Neural Network for Adverse Drug Reaction mention extraction
title_fullStr Semi-Supervised Recurrent Neural Network for Adverse Drug Reaction mention extraction
title_full_unstemmed Semi-Supervised Recurrent Neural Network for Adverse Drug Reaction mention extraction
title_short Semi-Supervised Recurrent Neural Network for Adverse Drug Reaction mention extraction
title_sort semi-supervised recurrent neural network for adverse drug reaction mention extraction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5998760/
https://www.ncbi.nlm.nih.gov/pubmed/29897321
http://dx.doi.org/10.1186/s12859-018-2192-4
work_keys_str_mv AT guptashashank semisupervisedrecurrentneuralnetworkforadversedrugreactionmentionextraction
AT pawarsachin semisupervisedrecurrentneuralnetworkforadversedrugreactionmentionextraction
AT ramrakhiyaninitin semisupervisedrecurrentneuralnetworkforadversedrugreactionmentionextraction
AT palshikargirishkeshav semisupervisedrecurrentneuralnetworkforadversedrugreactionmentionextraction
AT varmavasudeva semisupervisedrecurrentneuralnetworkforadversedrugreactionmentionextraction