Cargando…
Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration
BACKGROUND: Prediction of drug-disease interactions is promising for either drug repositioning or disease treatment fields. The discovery of novel drug-disease interactions, on one hand can help to find novel indictions for the approved drugs; on the other hand can provide new therapeutic approaches...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751445/ https://www.ncbi.nlm.nih.gov/pubmed/29297383 http://dx.doi.org/10.1186/s12920-017-0311-0 |
_version_ | 1783289946498400256 |
---|---|
author | Wu, Guangsheng Liu, Juan Wang, Caihua |
author_facet | Wu, Guangsheng Liu, Juan Wang, Caihua |
author_sort | Wu, Guangsheng |
collection | PubMed |
description | BACKGROUND: Prediction of drug-disease interactions is promising for either drug repositioning or disease treatment fields. The discovery of novel drug-disease interactions, on one hand can help to find novel indictions for the approved drugs; on the other hand can provide new therapeutic approaches for the diseases. Recently, computational methods for finding drug-disease interactions have attracted lots of attention because of their far more higher efficiency and lower cost than the traditional wet experiment methods. However, they still face several challenges, such as the organization of the heterogeneous data, the performance of the model, and so on. METHODS: In this work, we present to hierarchically integrate the heterogeneous data into three layers. The drug-drug and disease-disease similarities are first calculated separately in each layer, and then the similarities from three layers are linearly fused into comprehensive drug similarities and disease similarities, which can then be used to measure the similarities between two drug-disease pairs. We construct a novel weighted drug-disease pair network, where a node is a drug-disease pair with known or unknown treatment relation, an edge represents the node-node relation which is weighted with the similarity score between two pairs. Now that similar drug-disease pairs are supposed to show similar treatment patterns, we can find the optimal graph cut of the network. The drug-disease pair with unknown relation can then be considered to have similar treatment relation with that within the same cut. Therefore, we develop a semi-supervised graph cut algorithm, SSGC, to find the optimal graph cut, based on which we can identify the potential drug-disease treatment interactions. RESULTS: By comparing with three representative network-based methods, SSGC achieves the highest performances, in terms of both AUC score and the identification rates of true drug-disease pairs. The experiments with different integration strategies also demonstrate that considering several sources of data can improve the performances of the predictors. Further case studies on four diseases, the top-ranked drug-disease associations have been confirmed by KEGG, CTD database and the literature, illustrating the usefulness of SSGC. CONCLUSIONS: The proposed comprehensive similarity scores from multi-views and multiple layers and the graph-cut based algorithm can greatly improve the prediction performances of drug-disease associations. |
format | Online Article Text |
id | pubmed-5751445 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57514452018-01-05 Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration Wu, Guangsheng Liu, Juan Wang, Caihua BMC Med Genomics Research BACKGROUND: Prediction of drug-disease interactions is promising for either drug repositioning or disease treatment fields. The discovery of novel drug-disease interactions, on one hand can help to find novel indictions for the approved drugs; on the other hand can provide new therapeutic approaches for the diseases. Recently, computational methods for finding drug-disease interactions have attracted lots of attention because of their far more higher efficiency and lower cost than the traditional wet experiment methods. However, they still face several challenges, such as the organization of the heterogeneous data, the performance of the model, and so on. METHODS: In this work, we present to hierarchically integrate the heterogeneous data into three layers. The drug-drug and disease-disease similarities are first calculated separately in each layer, and then the similarities from three layers are linearly fused into comprehensive drug similarities and disease similarities, which can then be used to measure the similarities between two drug-disease pairs. We construct a novel weighted drug-disease pair network, where a node is a drug-disease pair with known or unknown treatment relation, an edge represents the node-node relation which is weighted with the similarity score between two pairs. Now that similar drug-disease pairs are supposed to show similar treatment patterns, we can find the optimal graph cut of the network. The drug-disease pair with unknown relation can then be considered to have similar treatment relation with that within the same cut. Therefore, we develop a semi-supervised graph cut algorithm, SSGC, to find the optimal graph cut, based on which we can identify the potential drug-disease treatment interactions. RESULTS: By comparing with three representative network-based methods, SSGC achieves the highest performances, in terms of both AUC score and the identification rates of true drug-disease pairs. The experiments with different integration strategies also demonstrate that considering several sources of data can improve the performances of the predictors. Further case studies on four diseases, the top-ranked drug-disease associations have been confirmed by KEGG, CTD database and the literature, illustrating the usefulness of SSGC. CONCLUSIONS: The proposed comprehensive similarity scores from multi-views and multiple layers and the graph-cut based algorithm can greatly improve the prediction performances of drug-disease associations. BioMed Central 2017-12-28 /pmc/articles/PMC5751445/ /pubmed/29297383 http://dx.doi.org/10.1186/s12920-017-0311-0 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Wu, Guangsheng Liu, Juan Wang, Caihua Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration |
title | Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration |
title_full | Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration |
title_fullStr | Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration |
title_full_unstemmed | Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration |
title_short | Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration |
title_sort | predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751445/ https://www.ncbi.nlm.nih.gov/pubmed/29297383 http://dx.doi.org/10.1186/s12920-017-0311-0 |
work_keys_str_mv | AT wuguangsheng predictingdrugdiseaseinteractionsbysemisupervisedgraphcutalgorithmandthreelayerdataintegration AT liujuan predictingdrugdiseaseinteractionsbysemisupervisedgraphcutalgorithmandthreelayerdataintegration AT wangcaihua predictingdrugdiseaseinteractionsbysemisupervisedgraphcutalgorithmandthreelayerdataintegration |