Cargando…
A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
BACKGROUND: Drug repositioning is the process of identifying new uses for existing drugs. Computational drug repositioning methods can reduce the time, costs and risks of drug development by automating the analysis of the relationships in pharmacology networks. Pharmacology networks are large and he...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896044/ https://www.ncbi.nlm.nih.gov/pubmed/29642848 http://dx.doi.org/10.1186/s12859-018-2123-4 |
_version_ | 1783313761986150400 |
---|---|
author | Hameed, Pathima Nusrath Verspoor, Karin Kusljic, Snezana Halgamuge, Saman |
author_facet | Hameed, Pathima Nusrath Verspoor, Karin Kusljic, Snezana Halgamuge, Saman |
author_sort | Hameed, Pathima Nusrath |
collection | PubMed |
description | BACKGROUND: Drug repositioning is the process of identifying new uses for existing drugs. Computational drug repositioning methods can reduce the time, costs and risks of drug development by automating the analysis of the relationships in pharmacology networks. Pharmacology networks are large and heterogeneous. Clustering drugs into small groups can simplify large pharmacology networks, these subgroups can also be used as a starting point for repositioning drugs. In this paper, we propose a two-tiered drug-centric unsupervised clustering approach for drug repositioning, integrating heterogeneous drug data profiles: drug-chemical, drug-disease, drug-gene, drug-protein and drug-side effect relationships. RESULTS: The proposed drug repositioning approach is threefold; (i) clustering drugs based on their homogeneous profiles using the Growing Self Organizing Map (GSOM); (ii) clustering drugs based on drug-drug relation matrices based on the previous step, considering three state-of-the-art graph clustering methods; and (iii) inferring drug repositioning candidates and assigning a confidence value for each identified candidate. In this paper, we compare our two-tiered clustering approach against two existing heterogeneous data integration approaches with reference to the Anatomical Therapeutic Chemical (ATC) classification, using GSOM. Our approach yields Normalized Mutual Information (NMI) and Standardized Mutual Information (SMI) of 0.66 and 36.11, respectively, while the two existing methods yield NMI of 0.60 and 0.64 and SMI of 22.26 and 33.59. Moreover, the two existing approaches failed to produce useful cluster separations when using graph clustering algorithms while our approach is able to identify useful clusters for drug repositioning. Furthermore, we provide clinical evidence for four predicted results (Chlorthalidone, Indomethacin, Metformin and Thioridazine) to support that our proposed approach can be reliably used to infer ATC code and drug repositioning. CONCLUSION: The proposed two-tiered unsupervised clustering approach is suitable for drug clustering and enables heterogeneous data integration. It also enables identifying reliable repositioning drug candidates with reference to ATC therapeutic classification. The repositioning drug candidates identified consistently by multiple clustering algorithms and with high confidence have a higher possibility of being effective repositioning candidates. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2123-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5896044 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58960442018-04-12 A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration Hameed, Pathima Nusrath Verspoor, Karin Kusljic, Snezana Halgamuge, Saman BMC Bioinformatics Research Article BACKGROUND: Drug repositioning is the process of identifying new uses for existing drugs. Computational drug repositioning methods can reduce the time, costs and risks of drug development by automating the analysis of the relationships in pharmacology networks. Pharmacology networks are large and heterogeneous. Clustering drugs into small groups can simplify large pharmacology networks, these subgroups can also be used as a starting point for repositioning drugs. In this paper, we propose a two-tiered drug-centric unsupervised clustering approach for drug repositioning, integrating heterogeneous drug data profiles: drug-chemical, drug-disease, drug-gene, drug-protein and drug-side effect relationships. RESULTS: The proposed drug repositioning approach is threefold; (i) clustering drugs based on their homogeneous profiles using the Growing Self Organizing Map (GSOM); (ii) clustering drugs based on drug-drug relation matrices based on the previous step, considering three state-of-the-art graph clustering methods; and (iii) inferring drug repositioning candidates and assigning a confidence value for each identified candidate. In this paper, we compare our two-tiered clustering approach against two existing heterogeneous data integration approaches with reference to the Anatomical Therapeutic Chemical (ATC) classification, using GSOM. Our approach yields Normalized Mutual Information (NMI) and Standardized Mutual Information (SMI) of 0.66 and 36.11, respectively, while the two existing methods yield NMI of 0.60 and 0.64 and SMI of 22.26 and 33.59. Moreover, the two existing approaches failed to produce useful cluster separations when using graph clustering algorithms while our approach is able to identify useful clusters for drug repositioning. Furthermore, we provide clinical evidence for four predicted results (Chlorthalidone, Indomethacin, Metformin and Thioridazine) to support that our proposed approach can be reliably used to infer ATC code and drug repositioning. CONCLUSION: The proposed two-tiered unsupervised clustering approach is suitable for drug clustering and enables heterogeneous data integration. It also enables identifying reliable repositioning drug candidates with reference to ATC therapeutic classification. The repositioning drug candidates identified consistently by multiple clustering algorithms and with high confidence have a higher possibility of being effective repositioning candidates. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2123-4) contains supplementary material, which is available to authorized users. BioMed Central 2018-04-11 /pmc/articles/PMC5896044/ /pubmed/29642848 http://dx.doi.org/10.1186/s12859-018-2123-4 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Hameed, Pathima Nusrath Verspoor, Karin Kusljic, Snezana Halgamuge, Saman A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration |
title | A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration |
title_full | A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration |
title_fullStr | A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration |
title_full_unstemmed | A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration |
title_short | A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration |
title_sort | two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896044/ https://www.ncbi.nlm.nih.gov/pubmed/29642848 http://dx.doi.org/10.1186/s12859-018-2123-4 |
work_keys_str_mv | AT hameedpathimanusrath atwotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration AT verspoorkarin atwotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration AT kusljicsnezana atwotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration AT halgamugesaman atwotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration AT hameedpathimanusrath twotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration AT verspoorkarin twotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration AT kusljicsnezana twotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration AT halgamugesaman twotieredunsupervisedclusteringapproachfordrugrepositioningthroughheterogeneousdataintegration |