Cargando…
Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases
BACKGROUND: Early and accurate identification of potential adverse drug reactions (ADRs) for combined medication is vital for public health. Existing methods either rely on expensive wet-lab experiments or detecting existing associations from related records. Thus, they inevitably suffer under-repor...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311930/ https://www.ncbi.nlm.nih.gov/pubmed/30598065 http://dx.doi.org/10.1186/s12859-018-2520-8 |
_version_ | 1783383703956750336 |
---|---|
author | Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Yin, Jie Li, Jinyan |
author_facet | Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Yin, Jie Li, Jinyan |
author_sort | Zheng, Yi |
collection | PubMed |
description | BACKGROUND: Early and accurate identification of potential adverse drug reactions (ADRs) for combined medication is vital for public health. Existing methods either rely on expensive wet-lab experiments or detecting existing associations from related records. Thus, they inevitably suffer under-reporting, delays in reporting, and inability to detect ADRs for new and rare drugs. The current application of machine learning methods is severely impeded by the lack of proper drug representation and credible negative samples. Therefore, a method to represent drugs properly and to select credible negative samples becomes vital in applying machine learning methods to this problem. RESULTS: In this work, we propose a machine learning method to predict ADRs of combined medication from pharmacologic databases by building up highly-credible negative samples (HCNS-ADR). Specifically, we fuse heterogeneous information from different databases and represent each drug as a multi-dimensional vector according to its chemical substructures, target proteins, substituents, and related pathways first. Then, a drug-pair vector is obtained by appending the vector of one drug to the other. Next, we construct a drug-disease-gene network and devise a scoring method to measure the interaction probability of every drug pair via network analysis. Drug pairs with lower interaction probability are preferentially selected as negative samples. Following that, the validated positive samples and the selected credible negative samples are projected into a lower-dimensional space using the principal component analysis. Finally, a classifier is built for each ADR using its positive and negative samples with reduced dimensions. The performance of the proposed method is evaluated on simulative prediction for 1276 ADRs and 1048 drugs, comparing using four machine learning algorithms and with two baseline approaches. Extensive experiments show that the proposed way to represent drugs characterizes drugs accurately. With highly-credible negative samples selected by HCNS-ADR, the four machine learning algorithms achieve significant performance improvements. HCNS-ADR is also shown to be able to predict both known and novel drug-drug-ADR associations, outperforming two other baseline approaches significantly. CONCLUSIONS: The results demonstrate that integration of different drug properties to represent drugs are valuable for ADR prediction of combined medication and the selection of highly-credible negative samples can significantly improve the prediction performance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2520-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6311930 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63119302019-01-07 Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Yin, Jie Li, Jinyan BMC Bioinformatics Research BACKGROUND: Early and accurate identification of potential adverse drug reactions (ADRs) for combined medication is vital for public health. Existing methods either rely on expensive wet-lab experiments or detecting existing associations from related records. Thus, they inevitably suffer under-reporting, delays in reporting, and inability to detect ADRs for new and rare drugs. The current application of machine learning methods is severely impeded by the lack of proper drug representation and credible negative samples. Therefore, a method to represent drugs properly and to select credible negative samples becomes vital in applying machine learning methods to this problem. RESULTS: In this work, we propose a machine learning method to predict ADRs of combined medication from pharmacologic databases by building up highly-credible negative samples (HCNS-ADR). Specifically, we fuse heterogeneous information from different databases and represent each drug as a multi-dimensional vector according to its chemical substructures, target proteins, substituents, and related pathways first. Then, a drug-pair vector is obtained by appending the vector of one drug to the other. Next, we construct a drug-disease-gene network and devise a scoring method to measure the interaction probability of every drug pair via network analysis. Drug pairs with lower interaction probability are preferentially selected as negative samples. Following that, the validated positive samples and the selected credible negative samples are projected into a lower-dimensional space using the principal component analysis. Finally, a classifier is built for each ADR using its positive and negative samples with reduced dimensions. The performance of the proposed method is evaluated on simulative prediction for 1276 ADRs and 1048 drugs, comparing using four machine learning algorithms and with two baseline approaches. Extensive experiments show that the proposed way to represent drugs characterizes drugs accurately. With highly-credible negative samples selected by HCNS-ADR, the four machine learning algorithms achieve significant performance improvements. HCNS-ADR is also shown to be able to predict both known and novel drug-drug-ADR associations, outperforming two other baseline approaches significantly. CONCLUSIONS: The results demonstrate that integration of different drug properties to represent drugs are valuable for ADR prediction of combined medication and the selection of highly-credible negative samples can significantly improve the prediction performance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2520-8) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-31 /pmc/articles/PMC6311930/ /pubmed/30598065 http://dx.doi.org/10.1186/s12859-018-2520-8 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Yin, Jie Li, Jinyan Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases |
title | Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases |
title_full | Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases |
title_fullStr | Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases |
title_full_unstemmed | Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases |
title_short | Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases |
title_sort | predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311930/ https://www.ncbi.nlm.nih.gov/pubmed/30598065 http://dx.doi.org/10.1186/s12859-018-2520-8 |
work_keys_str_mv | AT zhengyi predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases AT penghui predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases AT zhangxiaocai predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases AT zhaozhixun predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases AT yinjie predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases AT lijinyan predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases |