Cargando…

A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration

BACKGROUND: Drug repositioning is the process of identifying new uses for existing drugs. Computational drug repositioning methods can reduce the time, costs and risks of drug development by automating the analysis of the relationships in pharmacology networks. Pharmacology networks are large and he...

Descripción completa

Detalles Bibliográficos
Autores principales: Hameed, Pathima Nusrath, Verspoor, Karin, Kusljic, Snezana, Halgamuge, Saman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896044/
https://www.ncbi.nlm.nih.gov/pubmed/29642848
http://dx.doi.org/10.1186/s12859-018-2123-4
_version_ 1783313761986150400
author Hameed, Pathima Nusrath
Verspoor, Karin
Kusljic, Snezana
Halgamuge, Saman
author_facet Hameed, Pathima Nusrath
Verspoor, Karin
Kusljic, Snezana
Halgamuge, Saman
author_sort Hameed, Pathima Nusrath
collection PubMed
description BACKGROUND: Drug repositioning is the process of identifying new uses for existing drugs. Computational drug repositioning methods can reduce the time, costs and risks of drug development by automating the analysis of the relationships in pharmacology networks. Pharmacology networks are large and heterogeneous. Clustering drugs into small groups can simplify large pharmacology networks, these subgroups can also be used as a starting point for repositioning drugs. In this paper, we propose a two-tiered drug-centric unsupervised clustering approach for drug repositioning, integrating heterogeneous drug data profiles: drug-chemical, drug-disease, drug-gene, drug-protein and drug-side effect relationships. RESULTS: The proposed drug repositioning approach is threefold; (i) clustering drugs based on their homogeneous profiles using the Growing Self Organizing Map (GSOM); (ii) clustering drugs based on drug-drug relation matrices based on the previous step, considering three state-of-the-art graph clustering methods; and (iii) inferring drug repositioning candidates and assigning a confidence value for each identified candidate. In this paper, we compare our two-tiered clustering approach against two existing heterogeneous data integration approaches with reference to the Anatomical Therapeutic Chemical (ATC) classification, using GSOM. Our approach yields Normalized Mutual Information (NMI) and Standardized Mutual Information (SMI) of 0.66 and 36.11, respectively, while the two existing methods yield NMI of 0.60 and 0.64 and SMI of 22.26 and 33.59. Moreover, the two existing approaches failed to produce useful cluster separations when using graph clustering algorithms while our approach is able to identify useful clusters for drug repositioning. Furthermore, we provide clinical evidence for four predicted results (Chlorthalidone, Indomethacin, Metformin and Thioridazine) to support that our proposed approach can be reliably used to infer ATC code and drug repositioning. CONCLUSION: The proposed two-tiered unsupervised clustering approach is suitable for drug clustering and enables heterogeneous data integration. It also enables identifying reliable repositioning drug candidates with reference to ATC therapeutic classification. The repositioning drug candidates identified consistently by multiple clustering algorithms and with high confidence have a higher possibility of being effective repositioning candidates. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2123-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5896044
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58960442018-04-12 A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration Hameed, Pathima Nusrath Verspoor, Karin Kusljic, Snezana Halgamuge, Saman BMC Bioinformatics Research Article BACKGROUND: Drug repositioning is the process of identifying new uses for existing drugs. Computational drug repositioning methods can reduce the time, costs and risks of drug development by automating the analysis of the relationships in pharmacology networks. Pharmacology networks are large and heterogeneous. Clustering drugs into small groups can simplify large pharmacology networks, these subgroups can also be used as a starting point for repositioning drugs. In this paper, we propose a two-tiered drug-centric unsupervised clustering approach for drug repositioning, integrating heterogeneous drug data profiles: drug-chemical, drug-disease, drug-gene, drug-protein and drug-side effect relationships. RESULTS: The proposed drug repositioning approach is threefold; (i) clustering drugs based on their homogeneous profiles using the Growing Self Organizing Map (GSOM); (ii) clustering drugs based on drug-drug relation matrices based on the previous step, considering three state-of-the-art graph clustering methods; and (iii) inferring drug repositioning candidates and assigning a confidence value for each identified candidate. In this paper, we compare our two-tiered clustering approach against two existing heterogeneous data integration approaches with reference to the Anatomical Therapeutic Chemical (ATC) classification, using GSOM. Our approach yields Normalized Mutual Information (NMI) and Standardized Mutual Information (SMI) of 0.66 and 36.11, respectively, while the two existing methods yield NMI of 0.60 and 0.64 and SMI of 22.26 and 33.59. Moreover, the two existing approaches failed to produce useful cluster separations when using graph clustering algorithms while our approach is able to identify useful clusters for drug repositioning. Furthermore, we provide clinical evidence for four predicted results (Chlorthalidone, Indomethacin, Metformin and Thioridazine) to support that our proposed approach can be reliably used to infer ATC code and drug repositioning. CONCLUSION: The proposed two-tiered unsupervised clustering approach is suitable for drug clustering and enables heterogeneous data integration. It also enables identifying reliable repositioning drug candidates with reference to ATC therapeutic classification. The repositioning drug candidates identified consistently by multiple clustering algorithms and with high confidence have a higher possibility of being effective repositioning candidates. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2123-4) contains supplementary material, which is available to authorized users. BioMed Central 2018-04-11 /pmc/articles/PMC5896044/ /pubmed/29642848 http://dx.doi.org/10.1186/s12859-018-2123-4 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Hameed, Pathima Nusrath
Verspoor, Karin
Kusljic, Snezana
Halgamuge, Saman
A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
title A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
title_full A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
title_fullStr A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
title_full_unstemmed A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
title_short A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
title_sort two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896044/
https://www.ncbi.nlm.nih.gov/pubmed/29642848
http://dx.doi.org/10.1186/s12859-018-2123-4
work_keys_str_mv AT hameedpathimanusrath atwotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration
AT verspoorkarin atwotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration
AT kusljicsnezana atwotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration
AT halgamugesaman atwotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration
AT hameedpathimanusrath twotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration
AT verspoorkarin twotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration
AT kusljicsnezana twotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration
AT halgamugesaman twotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration