Cargando…
Drug-target interaction prediction with tree-ensemble learning and output space reconstruction
BACKGROUND: Computational prediction of drug-target interactions (DTI) is vital for drug discovery. The experimental identification of interactions between drugs and target proteins is very onerous. Modern technologies have mitigated the problem, leveraging the development of new drugs. However, dru...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7006075/ https://www.ncbi.nlm.nih.gov/pubmed/32033537 http://dx.doi.org/10.1186/s12859-020-3379-z |
_version_ | 1783495067387822080 |
---|---|
author | Pliakos, Konstantinos Vens, Celine |
author_facet | Pliakos, Konstantinos Vens, Celine |
author_sort | Pliakos, Konstantinos |
collection | PubMed |
description | BACKGROUND: Computational prediction of drug-target interactions (DTI) is vital for drug discovery. The experimental identification of interactions between drugs and target proteins is very onerous. Modern technologies have mitigated the problem, leveraging the development of new drugs. However, drug development remains extremely expensive and time consuming. Therefore, in silico DTI predictions based on machine learning can alleviate the burdensome task of drug development. Many machine learning approaches have been proposed over the years for DTI prediction. Nevertheless, prediction accuracy and efficiency are persisting problems that still need to be tackled. Here, we propose a new learning method which addresses DTI prediction as a multi-output prediction task by learning ensembles of multi-output bi-clustering trees (eBICT) on reconstructed networks. In our setting, the nodes of a DTI network (drugs and proteins) are represented by features (background information). The interactions between the nodes of a DTI network are modeled as an interaction matrix and compose the output space in our problem. The proposed approach integrates background information from both drug and target protein spaces into the same global network framework. RESULTS: We performed an empirical evaluation, comparing the proposed approach to state of the art DTI prediction methods and demonstrated the effectiveness of the proposed approach in different prediction settings. For evaluation purposes, we used several benchmark datasets that represent drug-protein networks. We show that output space reconstruction can boost the predictive performance of tree-ensemble learning methods, yielding more accurate DTI predictions. CONCLUSIONS: We proposed a new DTI prediction method where bi-clustering trees are built on reconstructed networks. Building tree-ensemble learning models with output space reconstruction leads to superior prediction results, while preserving the advantages of tree-ensembles, such as scalability, interpretability and inductive setting. |
format | Online Article Text |
id | pubmed-7006075 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-70060752020-02-11 Drug-target interaction prediction with tree-ensemble learning and output space reconstruction Pliakos, Konstantinos Vens, Celine BMC Bioinformatics Methodology Article BACKGROUND: Computational prediction of drug-target interactions (DTI) is vital for drug discovery. The experimental identification of interactions between drugs and target proteins is very onerous. Modern technologies have mitigated the problem, leveraging the development of new drugs. However, drug development remains extremely expensive and time consuming. Therefore, in silico DTI predictions based on machine learning can alleviate the burdensome task of drug development. Many machine learning approaches have been proposed over the years for DTI prediction. Nevertheless, prediction accuracy and efficiency are persisting problems that still need to be tackled. Here, we propose a new learning method which addresses DTI prediction as a multi-output prediction task by learning ensembles of multi-output bi-clustering trees (eBICT) on reconstructed networks. In our setting, the nodes of a DTI network (drugs and proteins) are represented by features (background information). The interactions between the nodes of a DTI network are modeled as an interaction matrix and compose the output space in our problem. The proposed approach integrates background information from both drug and target protein spaces into the same global network framework. RESULTS: We performed an empirical evaluation, comparing the proposed approach to state of the art DTI prediction methods and demonstrated the effectiveness of the proposed approach in different prediction settings. For evaluation purposes, we used several benchmark datasets that represent drug-protein networks. We show that output space reconstruction can boost the predictive performance of tree-ensemble learning methods, yielding more accurate DTI predictions. CONCLUSIONS: We proposed a new DTI prediction method where bi-clustering trees are built on reconstructed networks. Building tree-ensemble learning models with output space reconstruction leads to superior prediction results, while preserving the advantages of tree-ensembles, such as scalability, interpretability and inductive setting. BioMed Central 2020-02-07 /pmc/articles/PMC7006075/ /pubmed/32033537 http://dx.doi.org/10.1186/s12859-020-3379-z Text en © The Author(s) 2020 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Pliakos, Konstantinos Vens, Celine Drug-target interaction prediction with tree-ensemble learning and output space reconstruction |
title | Drug-target interaction prediction with tree-ensemble learning and output space reconstruction |
title_full | Drug-target interaction prediction with tree-ensemble learning and output space reconstruction |
title_fullStr | Drug-target interaction prediction with tree-ensemble learning and output space reconstruction |
title_full_unstemmed | Drug-target interaction prediction with tree-ensemble learning and output space reconstruction |
title_short | Drug-target interaction prediction with tree-ensemble learning and output space reconstruction |
title_sort | drug-target interaction prediction with tree-ensemble learning and output space reconstruction |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7006075/ https://www.ncbi.nlm.nih.gov/pubmed/32033537 http://dx.doi.org/10.1186/s12859-020-3379-z |
work_keys_str_mv | AT pliakoskonstantinos drugtargetinteractionpredictionwithtreeensemblelearningandoutputspacereconstruction AT vensceline drugtargetinteractionpredictionwithtreeensemblelearningandoutputspacereconstruction |