Cargando…
Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature
BACKGROUND: Identification of associations between marketed drugs and adverse events from the biomedical literature assists drug safety monitoring efforts. Assessing the significance of such literature-derived associations and determining the granularity at which they should be captured remains a ch...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4918084/ https://www.ncbi.nlm.nih.gov/pubmed/27333889 http://dx.doi.org/10.1186/s12859-016-1080-z |
_version_ | 1782439055820062720 |
---|---|
author | Winnenburg, Rainer Shah, Nigam H. |
author_facet | Winnenburg, Rainer Shah, Nigam H. |
author_sort | Winnenburg, Rainer |
collection | PubMed |
description | BACKGROUND: Identification of associations between marketed drugs and adverse events from the biomedical literature assists drug safety monitoring efforts. Assessing the significance of such literature-derived associations and determining the granularity at which they should be captured remains a challenge. Here, we assess how defining a selection of adverse event terms from MeSH, based on information content, can improve the detection of adverse events for drugs and drug classes. RESULTS: We analyze a set of 105,354 candidate drug adverse event pairs extracted from article indexes in MEDLINE. First, we harmonize extracted adverse event terms by aggregating them into higher-level MeSH terms based on the terms’ information content. Then, we determine statistical enrichment of adverse events associated with drug and drug classes using a conditional hypergeometric test that adjusts for dependencies among associated terms. We compare our results with methods based on disproportionality analysis (proportional reporting ratio, PRR) and quantify the improvement in signal detection with our generalized enrichment analysis (GEA) approach using a gold standard of drug-adverse event associations spanning 174 drugs and four events. For single drugs, the best GEA method (Precision: .92/Recall: .71/F1-measure: .80) outperforms the best PRR based method (.69/.69/.69) on all four adverse event outcomes in our gold standard. For drug classes, our GEA performs similarly (.85/.69/.74) when increasing the level of abstraction for adverse event terms. Finally, on examining the 1609 individual drugs in our MEDLINE set, which map to chemical substances in ATC, we find signals for 1379 drugs (10,122 unique adverse event associations) on applying GEA with p < 0.005. CONCLUSIONS: We present an approach based on generalized enrichment analysis that can be used to detect associations between drugs, drug classes and adverse events at a given level of granularity, at the same time correcting for known dependencies among events. Our study demonstrates the use of GEA, and the importance of choosing appropriate abstraction levels to complement current drug safety methods. We provide an R package for exploration of alternative abstraction levels of adverse event terms based on information content. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1080-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4918084 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49180842016-06-28 Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature Winnenburg, Rainer Shah, Nigam H. BMC Bioinformatics Methodology Article BACKGROUND: Identification of associations between marketed drugs and adverse events from the biomedical literature assists drug safety monitoring efforts. Assessing the significance of such literature-derived associations and determining the granularity at which they should be captured remains a challenge. Here, we assess how defining a selection of adverse event terms from MeSH, based on information content, can improve the detection of adverse events for drugs and drug classes. RESULTS: We analyze a set of 105,354 candidate drug adverse event pairs extracted from article indexes in MEDLINE. First, we harmonize extracted adverse event terms by aggregating them into higher-level MeSH terms based on the terms’ information content. Then, we determine statistical enrichment of adverse events associated with drug and drug classes using a conditional hypergeometric test that adjusts for dependencies among associated terms. We compare our results with methods based on disproportionality analysis (proportional reporting ratio, PRR) and quantify the improvement in signal detection with our generalized enrichment analysis (GEA) approach using a gold standard of drug-adverse event associations spanning 174 drugs and four events. For single drugs, the best GEA method (Precision: .92/Recall: .71/F1-measure: .80) outperforms the best PRR based method (.69/.69/.69) on all four adverse event outcomes in our gold standard. For drug classes, our GEA performs similarly (.85/.69/.74) when increasing the level of abstraction for adverse event terms. Finally, on examining the 1609 individual drugs in our MEDLINE set, which map to chemical substances in ATC, we find signals for 1379 drugs (10,122 unique adverse event associations) on applying GEA with p < 0.005. CONCLUSIONS: We present an approach based on generalized enrichment analysis that can be used to detect associations between drugs, drug classes and adverse events at a given level of granularity, at the same time correcting for known dependencies among events. Our study demonstrates the use of GEA, and the importance of choosing appropriate abstraction levels to complement current drug safety methods. We provide an R package for exploration of alternative abstraction levels of adverse event terms based on information content. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1080-z) contains supplementary material, which is available to authorized users. BioMed Central 2016-06-23 /pmc/articles/PMC4918084/ /pubmed/27333889 http://dx.doi.org/10.1186/s12859-016-1080-z Text en © Winnenburg and Shah. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Winnenburg, Rainer Shah, Nigam H. Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature |
title | Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature |
title_full | Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature |
title_fullStr | Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature |
title_full_unstemmed | Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature |
title_short | Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature |
title_sort | generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4918084/ https://www.ncbi.nlm.nih.gov/pubmed/27333889 http://dx.doi.org/10.1186/s12859-016-1080-z |
work_keys_str_mv | AT winnenburgrainer generalizedenrichmentanalysisimprovesthedetectionofadversedrugeventsfromthebiomedicalliterature AT shahnigamh generalizedenrichmentanalysisimprovesthedetectionofadversedrugeventsfromthebiomedicalliterature |