Cargando…

Mining hidden knowledge for drug safety assessment: topic modeling of LiverTox as a case study

BACKGROUND: Given the significant impact on public health and drug development, drug safety has been a focal point and research emphasis across multiple disciplines in addition to scientific investigation, including consumer advocates, drug developers and regulators. Such a concern and effort has le...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Ke, Zhang, Jie, Chen, Minjun, Xu, Xiaowei, Suzuki, Ayako, Ilic, Katarina, Tong, Weida
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4304199/
https://www.ncbi.nlm.nih.gov/pubmed/25559675
http://dx.doi.org/10.1186/1471-2105-15-S17-S6
_version_ 1782354054878330880
author Yu, Ke
Zhang, Jie
Chen, Minjun
Xu, Xiaowei
Suzuki, Ayako
Ilic, Katarina
Tong, Weida
author_facet Yu, Ke
Zhang, Jie
Chen, Minjun
Xu, Xiaowei
Suzuki, Ayako
Ilic, Katarina
Tong, Weida
author_sort Yu, Ke
collection PubMed
description BACKGROUND: Given the significant impact on public health and drug development, drug safety has been a focal point and research emphasis across multiple disciplines in addition to scientific investigation, including consumer advocates, drug developers and regulators. Such a concern and effort has led numerous databases with drug safety information available in the public domain and the majority of them contain substantial textual data. Text mining offers an opportunity to leverage the hidden knowledge within these textual data for the enhanced understanding of drug safety and thus improving public health. METHODS: In this proof-of-concept study, topic modeling, an unsupervised text mining approach, was performed on the LiverTox database developed by National Institutes of Health (NIH). The LiverTox structured one document per drug that contains multiple sections summarizing clinical information on drug-induced liver injury (DILI). We hypothesized that these documents might contain specific textual patterns that could be used to address key DILI issues. We placed the study on drug-induced acute liver failure (ALF) which was a severe form of DILI with limited treatment options. RESULTS: After topic modeling of the "Hepatotoxicity" sections of the LiverTox across 478 drug documents, we identified a hidden topic relevant to Hy's law that was a widely-accepted rule incriminating drugs with high risk of causing ALF in humans. Using this topic, a total of 127 drugs were further implicated, 77 of which had clear ALF relevant terms in the "Outcome and management" sections of the LiverTox. For the rest of 50 drugs, evidence supporting risk of ALF was found for 42 drugs from other public databases. CONCLUSION: In this case study, the knowledge buried in the textual data was extracted for identification of drugs with potential of causing ALF by applying topic modeling to the LiverTox database. The knowledge further guided identification of drugs with the similar potential and most of them could be verified and confirmed. This study highlights the utility of topic modeling to leverage information within textual drug safety databases, which provides new opportunities in the big data era to assess drug safety.
format Online
Article
Text
id pubmed-4304199
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43041992015-02-09 Mining hidden knowledge for drug safety assessment: topic modeling of LiverTox as a case study Yu, Ke Zhang, Jie Chen, Minjun Xu, Xiaowei Suzuki, Ayako Ilic, Katarina Tong, Weida BMC Bioinformatics Research BACKGROUND: Given the significant impact on public health and drug development, drug safety has been a focal point and research emphasis across multiple disciplines in addition to scientific investigation, including consumer advocates, drug developers and regulators. Such a concern and effort has led numerous databases with drug safety information available in the public domain and the majority of them contain substantial textual data. Text mining offers an opportunity to leverage the hidden knowledge within these textual data for the enhanced understanding of drug safety and thus improving public health. METHODS: In this proof-of-concept study, topic modeling, an unsupervised text mining approach, was performed on the LiverTox database developed by National Institutes of Health (NIH). The LiverTox structured one document per drug that contains multiple sections summarizing clinical information on drug-induced liver injury (DILI). We hypothesized that these documents might contain specific textual patterns that could be used to address key DILI issues. We placed the study on drug-induced acute liver failure (ALF) which was a severe form of DILI with limited treatment options. RESULTS: After topic modeling of the "Hepatotoxicity" sections of the LiverTox across 478 drug documents, we identified a hidden topic relevant to Hy's law that was a widely-accepted rule incriminating drugs with high risk of causing ALF in humans. Using this topic, a total of 127 drugs were further implicated, 77 of which had clear ALF relevant terms in the "Outcome and management" sections of the LiverTox. For the rest of 50 drugs, evidence supporting risk of ALF was found for 42 drugs from other public databases. CONCLUSION: In this case study, the knowledge buried in the textual data was extracted for identification of drugs with potential of causing ALF by applying topic modeling to the LiverTox database. The knowledge further guided identification of drugs with the similar potential and most of them could be verified and confirmed. This study highlights the utility of topic modeling to leverage information within textual drug safety databases, which provides new opportunities in the big data era to assess drug safety. BioMed Central 2014-12-16 /pmc/articles/PMC4304199/ /pubmed/25559675 http://dx.doi.org/10.1186/1471-2105-15-S17-S6 Text en Copyright © 2014 Yu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Yu, Ke
Zhang, Jie
Chen, Minjun
Xu, Xiaowei
Suzuki, Ayako
Ilic, Katarina
Tong, Weida
Mining hidden knowledge for drug safety assessment: topic modeling of LiverTox as a case study
title Mining hidden knowledge for drug safety assessment: topic modeling of LiverTox as a case study
title_full Mining hidden knowledge for drug safety assessment: topic modeling of LiverTox as a case study
title_fullStr Mining hidden knowledge for drug safety assessment: topic modeling of LiverTox as a case study
title_full_unstemmed Mining hidden knowledge for drug safety assessment: topic modeling of LiverTox as a case study
title_short Mining hidden knowledge for drug safety assessment: topic modeling of LiverTox as a case study
title_sort mining hidden knowledge for drug safety assessment: topic modeling of livertox as a case study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4304199/
https://www.ncbi.nlm.nih.gov/pubmed/25559675
http://dx.doi.org/10.1186/1471-2105-15-S17-S6
work_keys_str_mv AT yuke mininghiddenknowledgefordrugsafetyassessmenttopicmodelingoflivertoxasacasestudy
AT zhangjie mininghiddenknowledgefordrugsafetyassessmenttopicmodelingoflivertoxasacasestudy
AT chenminjun mininghiddenknowledgefordrugsafetyassessmenttopicmodelingoflivertoxasacasestudy
AT xuxiaowei mininghiddenknowledgefordrugsafetyassessmenttopicmodelingoflivertoxasacasestudy
AT suzukiayako mininghiddenknowledgefordrugsafetyassessmenttopicmodelingoflivertoxasacasestudy
AT ilickatarina mininghiddenknowledgefordrugsafetyassessmenttopicmodelingoflivertoxasacasestudy
AT tongweida mininghiddenknowledgefordrugsafetyassessmenttopicmodelingoflivertoxasacasestudy