Cargando…

Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases

BACKGROUND: Early and accurate identification of potential adverse drug reactions (ADRs) for combined medication is vital for public health. Existing methods either rely on expensive wet-lab experiments or detecting existing associations from related records. Thus, they inevitably suffer under-repor...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Yi, Peng, Hui, Zhang, Xiaocai, Zhao, Zhixun, Yin, Jie, Li, Jinyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311930/
https://www.ncbi.nlm.nih.gov/pubmed/30598065
http://dx.doi.org/10.1186/s12859-018-2520-8
_version_ 1783383703956750336
author Zheng, Yi
Peng, Hui
Zhang, Xiaocai
Zhao, Zhixun
Yin, Jie
Li, Jinyan
author_facet Zheng, Yi
Peng, Hui
Zhang, Xiaocai
Zhao, Zhixun
Yin, Jie
Li, Jinyan
author_sort Zheng, Yi
collection PubMed
description BACKGROUND: Early and accurate identification of potential adverse drug reactions (ADRs) for combined medication is vital for public health. Existing methods either rely on expensive wet-lab experiments or detecting existing associations from related records. Thus, they inevitably suffer under-reporting, delays in reporting, and inability to detect ADRs for new and rare drugs. The current application of machine learning methods is severely impeded by the lack of proper drug representation and credible negative samples. Therefore, a method to represent drugs properly and to select credible negative samples becomes vital in applying machine learning methods to this problem. RESULTS: In this work, we propose a machine learning method to predict ADRs of combined medication from pharmacologic databases by building up highly-credible negative samples (HCNS-ADR). Specifically, we fuse heterogeneous information from different databases and represent each drug as a multi-dimensional vector according to its chemical substructures, target proteins, substituents, and related pathways first. Then, a drug-pair vector is obtained by appending the vector of one drug to the other. Next, we construct a drug-disease-gene network and devise a scoring method to measure the interaction probability of every drug pair via network analysis. Drug pairs with lower interaction probability are preferentially selected as negative samples. Following that, the validated positive samples and the selected credible negative samples are projected into a lower-dimensional space using the principal component analysis. Finally, a classifier is built for each ADR using its positive and negative samples with reduced dimensions. The performance of the proposed method is evaluated on simulative prediction for 1276 ADRs and 1048 drugs, comparing using four machine learning algorithms and with two baseline approaches. Extensive experiments show that the proposed way to represent drugs characterizes drugs accurately. With highly-credible negative samples selected by HCNS-ADR, the four machine learning algorithms achieve significant performance improvements. HCNS-ADR is also shown to be able to predict both known and novel drug-drug-ADR associations, outperforming two other baseline approaches significantly. CONCLUSIONS: The results demonstrate that integration of different drug properties to represent drugs are valuable for ADR prediction of combined medication and the selection of highly-credible negative samples can significantly improve the prediction performance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2520-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6311930
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63119302019-01-07 Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Yin, Jie Li, Jinyan BMC Bioinformatics Research BACKGROUND: Early and accurate identification of potential adverse drug reactions (ADRs) for combined medication is vital for public health. Existing methods either rely on expensive wet-lab experiments or detecting existing associations from related records. Thus, they inevitably suffer under-reporting, delays in reporting, and inability to detect ADRs for new and rare drugs. The current application of machine learning methods is severely impeded by the lack of proper drug representation and credible negative samples. Therefore, a method to represent drugs properly and to select credible negative samples becomes vital in applying machine learning methods to this problem. RESULTS: In this work, we propose a machine learning method to predict ADRs of combined medication from pharmacologic databases by building up highly-credible negative samples (HCNS-ADR). Specifically, we fuse heterogeneous information from different databases and represent each drug as a multi-dimensional vector according to its chemical substructures, target proteins, substituents, and related pathways first. Then, a drug-pair vector is obtained by appending the vector of one drug to the other. Next, we construct a drug-disease-gene network and devise a scoring method to measure the interaction probability of every drug pair via network analysis. Drug pairs with lower interaction probability are preferentially selected as negative samples. Following that, the validated positive samples and the selected credible negative samples are projected into a lower-dimensional space using the principal component analysis. Finally, a classifier is built for each ADR using its positive and negative samples with reduced dimensions. The performance of the proposed method is evaluated on simulative prediction for 1276 ADRs and 1048 drugs, comparing using four machine learning algorithms and with two baseline approaches. Extensive experiments show that the proposed way to represent drugs characterizes drugs accurately. With highly-credible negative samples selected by HCNS-ADR, the four machine learning algorithms achieve significant performance improvements. HCNS-ADR is also shown to be able to predict both known and novel drug-drug-ADR associations, outperforming two other baseline approaches significantly. CONCLUSIONS: The results demonstrate that integration of different drug properties to represent drugs are valuable for ADR prediction of combined medication and the selection of highly-credible negative samples can significantly improve the prediction performance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2520-8) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-31 /pmc/articles/PMC6311930/ /pubmed/30598065 http://dx.doi.org/10.1186/s12859-018-2520-8 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zheng, Yi
Peng, Hui
Zhang, Xiaocai
Zhao, Zhixun
Yin, Jie
Li, Jinyan
Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases
title Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases
title_full Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases
title_fullStr Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases
title_full_unstemmed Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases
title_short Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases
title_sort predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311930/
https://www.ncbi.nlm.nih.gov/pubmed/30598065
http://dx.doi.org/10.1186/s12859-018-2520-8
work_keys_str_mv AT zhengyi predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases
AT penghui predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases
AT zhangxiaocai predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases
AT zhaozhixun predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases
AT yinjie predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases
AT lijinyan predictingadversedrugreactionsofcombinedmedicationfromheterogeneouspharmacologicdatabases