Cargando…

Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure

BACKGROUND: The study of adverse drug events (ADEs) is a tenured topic in medical literature. In recent years, increasing numbers of scientific articles and health-related social media posts have been generated and shared daily, albeit with very limited use for ADE study and with little known about...

Descripción completa

Detalles Bibliográficos
Autores principales: P Tafti, Ahmad, Badger, Jonathan, LaRose, Eric, Shirzadi, Ehsan, Mahnke, Andrea, Mayer, John, Ye, Zhan, Page, David, Peissig, Peggy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5741828/
https://www.ncbi.nlm.nih.gov/pubmed/29222076
http://dx.doi.org/10.2196/medinform.9170
_version_ 1783288259697180672
author P Tafti, Ahmad
Badger, Jonathan
LaRose, Eric
Shirzadi, Ehsan
Mahnke, Andrea
Mayer, John
Ye, Zhan
Page, David
Peissig, Peggy
author_facet P Tafti, Ahmad
Badger, Jonathan
LaRose, Eric
Shirzadi, Ehsan
Mahnke, Andrea
Mayer, John
Ye, Zhan
Page, David
Peissig, Peggy
author_sort P Tafti, Ahmad
collection PubMed
description BACKGROUND: The study of adverse drug events (ADEs) is a tenured topic in medical literature. In recent years, increasing numbers of scientific articles and health-related social media posts have been generated and shared daily, albeit with very limited use for ADE study and with little known about the content with respect to ADEs. OBJECTIVE: The aim of this study was to develop a big data analytics strategy that mines the content of scientific articles and health-related Web-based social media to detect and identify ADEs. METHODS: We analyzed the following two data sources: (1) biomedical articles and (2) health-related social media blog posts. We developed an intelligent and scalable text mining solution on big data infrastructures composed of Apache Spark, natural language processing, and machine learning. This was combined with an Elasticsearch No-SQL distributed database to explore and visualize ADEs. RESULTS: The accuracy, precision, recall, and area under receiver operating characteristic of the system were 92.7%, 93.6%, 93.0%, and 0.905, respectively, and showed better results in comparison with traditional approaches in the literature. This work not only detected and classified ADE sentences from big data biomedical literature but also scientifically visualized ADE interactions. CONCLUSIONS: To the best of our knowledge, this work is the first to investigate a big data machine learning strategy for ADE discovery on massive datasets downloaded from PubMed Central and social media. This contribution illustrates possible capacities in big data biomedical text analysis using advanced computational methods with real-time update from new data published on a daily basis.
format Online
Article
Text
id pubmed-5741828
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-57418282018-01-02 Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure P Tafti, Ahmad Badger, Jonathan LaRose, Eric Shirzadi, Ehsan Mahnke, Andrea Mayer, John Ye, Zhan Page, David Peissig, Peggy JMIR Med Inform Original Paper BACKGROUND: The study of adverse drug events (ADEs) is a tenured topic in medical literature. In recent years, increasing numbers of scientific articles and health-related social media posts have been generated and shared daily, albeit with very limited use for ADE study and with little known about the content with respect to ADEs. OBJECTIVE: The aim of this study was to develop a big data analytics strategy that mines the content of scientific articles and health-related Web-based social media to detect and identify ADEs. METHODS: We analyzed the following two data sources: (1) biomedical articles and (2) health-related social media blog posts. We developed an intelligent and scalable text mining solution on big data infrastructures composed of Apache Spark, natural language processing, and machine learning. This was combined with an Elasticsearch No-SQL distributed database to explore and visualize ADEs. RESULTS: The accuracy, precision, recall, and area under receiver operating characteristic of the system were 92.7%, 93.6%, 93.0%, and 0.905, respectively, and showed better results in comparison with traditional approaches in the literature. This work not only detected and classified ADE sentences from big data biomedical literature but also scientifically visualized ADE interactions. CONCLUSIONS: To the best of our knowledge, this work is the first to investigate a big data machine learning strategy for ADE discovery on massive datasets downloaded from PubMed Central and social media. This contribution illustrates possible capacities in big data biomedical text analysis using advanced computational methods with real-time update from new data published on a daily basis. JMIR Publications 2017-12-08 /pmc/articles/PMC5741828/ /pubmed/29222076 http://dx.doi.org/10.2196/medinform.9170 Text en ©Ahmad P Tafti, Jonathan Badger, Eric LaRose, Ehsan Shirzadi, Andrea Mahnke, John Mayer, Zhan Ye, David Page, Peggy Peissig. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 08.12.2017. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
P Tafti, Ahmad
Badger, Jonathan
LaRose, Eric
Shirzadi, Ehsan
Mahnke, Andrea
Mayer, John
Ye, Zhan
Page, David
Peissig, Peggy
Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure
title Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure
title_full Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure
title_fullStr Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure
title_full_unstemmed Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure
title_short Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure
title_sort adverse drug event discovery using biomedical literature: a big data neural network adventure
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5741828/
https://www.ncbi.nlm.nih.gov/pubmed/29222076
http://dx.doi.org/10.2196/medinform.9170
work_keys_str_mv AT ptaftiahmad adversedrugeventdiscoveryusingbiomedicalliteratureabigdataneuralnetworkadventure
AT badgerjonathan adversedrugeventdiscoveryusingbiomedicalliteratureabigdataneuralnetworkadventure
AT laroseeric adversedrugeventdiscoveryusingbiomedicalliteratureabigdataneuralnetworkadventure
AT shirzadiehsan adversedrugeventdiscoveryusingbiomedicalliteratureabigdataneuralnetworkadventure
AT mahnkeandrea adversedrugeventdiscoveryusingbiomedicalliteratureabigdataneuralnetworkadventure
AT mayerjohn adversedrugeventdiscoveryusingbiomedicalliteratureabigdataneuralnetworkadventure
AT yezhan adversedrugeventdiscoveryusingbiomedicalliteratureabigdataneuralnetworkadventure
AT pagedavid adversedrugeventdiscoveryusingbiomedicalliteratureabigdataneuralnetworkadventure
AT peissigpeggy adversedrugeventdiscoveryusingbiomedicalliteratureabigdataneuralnetworkadventure