Cargando…

Drug-target interaction prediction with tree-ensemble learning and output space reconstruction

BACKGROUND: Computational prediction of drug-target interactions (DTI) is vital for drug discovery. The experimental identification of interactions between drugs and target proteins is very onerous. Modern technologies have mitigated the problem, leveraging the development of new drugs. However, dru...

Descripción completa

Detalles Bibliográficos
Autores principales: Pliakos, Konstantinos, Vens, Celine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7006075/
https://www.ncbi.nlm.nih.gov/pubmed/32033537
http://dx.doi.org/10.1186/s12859-020-3379-z
_version_ 1783495067387822080
author Pliakos, Konstantinos
Vens, Celine
author_facet Pliakos, Konstantinos
Vens, Celine
author_sort Pliakos, Konstantinos
collection PubMed
description BACKGROUND: Computational prediction of drug-target interactions (DTI) is vital for drug discovery. The experimental identification of interactions between drugs and target proteins is very onerous. Modern technologies have mitigated the problem, leveraging the development of new drugs. However, drug development remains extremely expensive and time consuming. Therefore, in silico DTI predictions based on machine learning can alleviate the burdensome task of drug development. Many machine learning approaches have been proposed over the years for DTI prediction. Nevertheless, prediction accuracy and efficiency are persisting problems that still need to be tackled. Here, we propose a new learning method which addresses DTI prediction as a multi-output prediction task by learning ensembles of multi-output bi-clustering trees (eBICT) on reconstructed networks. In our setting, the nodes of a DTI network (drugs and proteins) are represented by features (background information). The interactions between the nodes of a DTI network are modeled as an interaction matrix and compose the output space in our problem. The proposed approach integrates background information from both drug and target protein spaces into the same global network framework. RESULTS: We performed an empirical evaluation, comparing the proposed approach to state of the art DTI prediction methods and demonstrated the effectiveness of the proposed approach in different prediction settings. For evaluation purposes, we used several benchmark datasets that represent drug-protein networks. We show that output space reconstruction can boost the predictive performance of tree-ensemble learning methods, yielding more accurate DTI predictions. CONCLUSIONS: We proposed a new DTI prediction method where bi-clustering trees are built on reconstructed networks. Building tree-ensemble learning models with output space reconstruction leads to superior prediction results, while preserving the advantages of tree-ensembles, such as scalability, interpretability and inductive setting.
format Online
Article
Text
id pubmed-7006075
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70060752020-02-11 Drug-target interaction prediction with tree-ensemble learning and output space reconstruction Pliakos, Konstantinos Vens, Celine BMC Bioinformatics Methodology Article BACKGROUND: Computational prediction of drug-target interactions (DTI) is vital for drug discovery. The experimental identification of interactions between drugs and target proteins is very onerous. Modern technologies have mitigated the problem, leveraging the development of new drugs. However, drug development remains extremely expensive and time consuming. Therefore, in silico DTI predictions based on machine learning can alleviate the burdensome task of drug development. Many machine learning approaches have been proposed over the years for DTI prediction. Nevertheless, prediction accuracy and efficiency are persisting problems that still need to be tackled. Here, we propose a new learning method which addresses DTI prediction as a multi-output prediction task by learning ensembles of multi-output bi-clustering trees (eBICT) on reconstructed networks. In our setting, the nodes of a DTI network (drugs and proteins) are represented by features (background information). The interactions between the nodes of a DTI network are modeled as an interaction matrix and compose the output space in our problem. The proposed approach integrates background information from both drug and target protein spaces into the same global network framework. RESULTS: We performed an empirical evaluation, comparing the proposed approach to state of the art DTI prediction methods and demonstrated the effectiveness of the proposed approach in different prediction settings. For evaluation purposes, we used several benchmark datasets that represent drug-protein networks. We show that output space reconstruction can boost the predictive performance of tree-ensemble learning methods, yielding more accurate DTI predictions. CONCLUSIONS: We proposed a new DTI prediction method where bi-clustering trees are built on reconstructed networks. Building tree-ensemble learning models with output space reconstruction leads to superior prediction results, while preserving the advantages of tree-ensembles, such as scalability, interpretability and inductive setting. BioMed Central 2020-02-07 /pmc/articles/PMC7006075/ /pubmed/32033537 http://dx.doi.org/10.1186/s12859-020-3379-z Text en © The Author(s) 2020 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Pliakos, Konstantinos
Vens, Celine
Drug-target interaction prediction with tree-ensemble learning and output space reconstruction
title Drug-target interaction prediction with tree-ensemble learning and output space reconstruction
title_full Drug-target interaction prediction with tree-ensemble learning and output space reconstruction
title_fullStr Drug-target interaction prediction with tree-ensemble learning and output space reconstruction
title_full_unstemmed Drug-target interaction prediction with tree-ensemble learning and output space reconstruction
title_short Drug-target interaction prediction with tree-ensemble learning and output space reconstruction
title_sort drug-target interaction prediction with tree-ensemble learning and output space reconstruction
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7006075/
https://www.ncbi.nlm.nih.gov/pubmed/32033537
http://dx.doi.org/10.1186/s12859-020-3379-z
work_keys_str_mv AT pliakoskonstantinos drugtargetinteractionpredictionwithtreeensemblelearningandoutputspacereconstruction
AT vensceline drugtargetinteractionpredictionwithtreeensemblelearningandoutputspacereconstruction