Cargando…

Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model

BACKGROUND: Adverse Drug Reactions are one of the leading causes of injury or death among patients undergoing medical treatments. Not all Adverse Drug Reactions are identified before a drug is made available in the market. Current post-marketing drug surveillance methods, which are based purely on v...

Descripción completa

Detalles Bibliográficos
Autores principales: Sampathkumar, Hariprasad, Chen, Xue-wen, Luo, Bo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4283122/
https://www.ncbi.nlm.nih.gov/pubmed/25341686
http://dx.doi.org/10.1186/1472-6947-14-91
_version_ 1782351220763000832
author Sampathkumar, Hariprasad
Chen, Xue-wen
Luo, Bo
author_facet Sampathkumar, Hariprasad
Chen, Xue-wen
Luo, Bo
author_sort Sampathkumar, Hariprasad
collection PubMed
description BACKGROUND: Adverse Drug Reactions are one of the leading causes of injury or death among patients undergoing medical treatments. Not all Adverse Drug Reactions are identified before a drug is made available in the market. Current post-marketing drug surveillance methods, which are based purely on voluntary spontaneous reports, are unable to provide the early indications necessary to prevent the occurrence of such injuries or fatalities. The objective of this research is to extract reports of adverse drug side-effects from messages in online healthcare forums and use them as early indicators to assist in post-marketing drug surveillance. METHODS: We treat the task of extracting adverse side-effects of drugs from healthcare forum messages as a sequence labeling problem and present a Hidden Markov Model(HMM) based Text Mining system that can be used to classify a message as containing drug side-effect information and then extract the adverse side-effect mentions from it. A manually annotated dataset from http://www.medications.comis used in the training and validation of the HMM based Text Mining system. RESULTS: A 10-fold cross-validation on the manually annotated dataset yielded on average an F-Score of 0.76 from the HMM Classifier, in comparison to 0.575 from the Baseline classifier. Without the Plain Text Filter component as a part of the Text Processing module, the F-Score of the HMM Classifier was reduced to 0.378 on average, while absence of the HTML Filter component was found to have no impact. Reducing the Drug names dictionary size by half, on average reduced the F-Score of the HMM Classifier to 0.359, while a similar reduction to the side-effects dictionary yielded an F-Score of 0.651 on average. Adverse side-effects mined from http://www.medications.comand http://www.steadyhealth.comwere found to match the Adverse Drug Reactions on the Drug Package Labels of several drugs. In addition, some novel adverse side-effects, which can be potential Adverse Drug Reactions, were also identified. CONCLUSIONS: The results from the HMM based Text Miner are encouraging to pursue further enhancements to this approach. The mined novel side-effects can act as early indicators for health authorities to help focus their efforts in post-marketing drug surveillance.
format Online
Article
Text
id pubmed-4283122
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42831222015-01-06 Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model Sampathkumar, Hariprasad Chen, Xue-wen Luo, Bo BMC Med Inform Decis Mak Research Article BACKGROUND: Adverse Drug Reactions are one of the leading causes of injury or death among patients undergoing medical treatments. Not all Adverse Drug Reactions are identified before a drug is made available in the market. Current post-marketing drug surveillance methods, which are based purely on voluntary spontaneous reports, are unable to provide the early indications necessary to prevent the occurrence of such injuries or fatalities. The objective of this research is to extract reports of adverse drug side-effects from messages in online healthcare forums and use them as early indicators to assist in post-marketing drug surveillance. METHODS: We treat the task of extracting adverse side-effects of drugs from healthcare forum messages as a sequence labeling problem and present a Hidden Markov Model(HMM) based Text Mining system that can be used to classify a message as containing drug side-effect information and then extract the adverse side-effect mentions from it. A manually annotated dataset from http://www.medications.comis used in the training and validation of the HMM based Text Mining system. RESULTS: A 10-fold cross-validation on the manually annotated dataset yielded on average an F-Score of 0.76 from the HMM Classifier, in comparison to 0.575 from the Baseline classifier. Without the Plain Text Filter component as a part of the Text Processing module, the F-Score of the HMM Classifier was reduced to 0.378 on average, while absence of the HTML Filter component was found to have no impact. Reducing the Drug names dictionary size by half, on average reduced the F-Score of the HMM Classifier to 0.359, while a similar reduction to the side-effects dictionary yielded an F-Score of 0.651 on average. Adverse side-effects mined from http://www.medications.comand http://www.steadyhealth.comwere found to match the Adverse Drug Reactions on the Drug Package Labels of several drugs. In addition, some novel adverse side-effects, which can be potential Adverse Drug Reactions, were also identified. CONCLUSIONS: The results from the HMM based Text Miner are encouraging to pursue further enhancements to this approach. The mined novel side-effects can act as early indicators for health authorities to help focus their efforts in post-marketing drug surveillance. BioMed Central 2014-10-23 /pmc/articles/PMC4283122/ /pubmed/25341686 http://dx.doi.org/10.1186/1472-6947-14-91 Text en © Sampathkumar et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Sampathkumar, Hariprasad
Chen, Xue-wen
Luo, Bo
Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model
title Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model
title_full Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model
title_fullStr Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model
title_full_unstemmed Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model
title_short Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model
title_sort mining adverse drug reactions from online healthcare forums using hidden markov model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4283122/
https://www.ncbi.nlm.nih.gov/pubmed/25341686
http://dx.doi.org/10.1186/1472-6947-14-91
work_keys_str_mv AT sampathkumarhariprasad miningadversedrugreactionsfromonlinehealthcareforumsusinghiddenmarkovmodel
AT chenxuewen miningadversedrugreactionsfromonlinehealthcareforumsusinghiddenmarkovmodel
AT luobo miningadversedrugreactionsfromonlinehealthcareforumsusinghiddenmarkovmodel