Cargando…

A multi-label approach to target prediction taking ligand promiscuity into account

BACKGROUND: According to Cobanoglu et al., it is now widely acknowledged that the single target paradigm (one protein/target, one disease, one drug) that has been the dominant premise in drug development in the recent past is untenable. More often than not, a drug-like compound (ligand) can be promi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Afzal, Avid M, Mussa, Hamse Y, Turner, Richard E, Bender, Andreas, Glen, Robert C
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2015
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4461803/ https://www.ncbi.nlm.nih.gov/pubmed/26064191 http://dx.doi.org/10.1186/s13321-015-0071-9

_version_	1782375559025655808
author	Afzal, Avid M Mussa, Hamse Y Turner, Richard E Bender, Andreas Glen, Robert C
author_facet	Afzal, Avid M Mussa, Hamse Y Turner, Richard E Bender, Andreas Glen, Robert C
author_sort	Afzal, Avid M
collection	PubMed
description	BACKGROUND: According to Cobanoglu et al., it is now widely acknowledged that the single target paradigm (one protein/target, one disease, one drug) that has been the dominant premise in drug development in the recent past is untenable. More often than not, a drug-like compound (ligand) can be promiscuous – it can interact with more than one target protein. In recent years, in in silico target prediction methods the promiscuity issue has generally been approached computationally in three main ways: ligand-based methods; target-protein-based methods; and integrative schemes. In this study we confine attention to ligand-based target prediction machine learning approaches, commonly referred to as target-fishing. The target-fishing approaches that are currently ubiquitous in cheminformatics literature can be essentially viewed as single-label multi-classification schemes; these approaches inherently bank on the single target paradigm assumption that a ligand can zero in on one single target. In order to address the ligand promiscuity issue, one might be able to cast target-fishing as a multi-label multi-class classification problem. For illustrative and comparison purposes, single-label and multi-label Naïve Bayes classification models (denoted here by SMM and MMM, respectively) for target-fishing were implemented. The models were constructed and tested on 65,587 compounds/ligands and 308 targets retrieved from the ChEMBL17 database. RESULTS: On classifying 3,332 test multi-label (promiscuous) compounds, SMM and MMM performed differently. At the 0.05 significance level, a Wilcoxon signed rank test performed on the paired target predictions yielded by SMM and MMM for the test ligands gave a p-value < 5.1 × 10(−94) and test statistics value of 6.8 × 10(5), in favour of MMM. The two models performed differently when tested on four datasets comprising single-label (non-promiscuous) compounds; McNemar’s test yielded χ(2) values of 15.657, 16.500 and 16.405 (with corresponding p-values of 7.594 × 10(−05), 4.865 × 10(−05) and 5.115 × 10(−05)), respectively, for three test sets, in favour of MMM. The models performed similarly on the fourth set. CONCLUSIONS: The target prediction results obtained in this study indicate that multi-label multi-class approaches are more apt than the ubiquitous single-label multi-class schemes when it comes to the application of ligand-based classifiers to target-fishing. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-015-0071-9) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-4461803
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-44618032015-06-11 A multi-label approach to target prediction taking ligand promiscuity into account Afzal, Avid M Mussa, Hamse Y Turner, Richard E Bender, Andreas Glen, Robert C J Cheminform Research Article BACKGROUND: According to Cobanoglu et al., it is now widely acknowledged that the single target paradigm (one protein/target, one disease, one drug) that has been the dominant premise in drug development in the recent past is untenable. More often than not, a drug-like compound (ligand) can be promiscuous – it can interact with more than one target protein. In recent years, in in silico target prediction methods the promiscuity issue has generally been approached computationally in three main ways: ligand-based methods; target-protein-based methods; and integrative schemes. In this study we confine attention to ligand-based target prediction machine learning approaches, commonly referred to as target-fishing. The target-fishing approaches that are currently ubiquitous in cheminformatics literature can be essentially viewed as single-label multi-classification schemes; these approaches inherently bank on the single target paradigm assumption that a ligand can zero in on one single target. In order to address the ligand promiscuity issue, one might be able to cast target-fishing as a multi-label multi-class classification problem. For illustrative and comparison purposes, single-label and multi-label Naïve Bayes classification models (denoted here by SMM and MMM, respectively) for target-fishing were implemented. The models were constructed and tested on 65,587 compounds/ligands and 308 targets retrieved from the ChEMBL17 database. RESULTS: On classifying 3,332 test multi-label (promiscuous) compounds, SMM and MMM performed differently. At the 0.05 significance level, a Wilcoxon signed rank test performed on the paired target predictions yielded by SMM and MMM for the test ligands gave a p-value < 5.1 × 10(−94) and test statistics value of 6.8 × 10(5), in favour of MMM. The two models performed differently when tested on four datasets comprising single-label (non-promiscuous) compounds; McNemar’s test yielded χ(2) values of 15.657, 16.500 and 16.405 (with corresponding p-values of 7.594 × 10(−05), 4.865 × 10(−05) and 5.115 × 10(−05)), respectively, for three test sets, in favour of MMM. The models performed similarly on the fourth set. CONCLUSIONS: The target prediction results obtained in this study indicate that multi-label multi-class approaches are more apt than the ubiquitous single-label multi-class schemes when it comes to the application of ligand-based classifiers to target-fishing. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-015-0071-9) contains supplementary material, which is available to authorized users. Springer International Publishing 2015-05-30 /pmc/articles/PMC4461803/ /pubmed/26064191 http://dx.doi.org/10.1186/s13321-015-0071-9 Text en © Afzal et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Afzal, Avid M Mussa, Hamse Y Turner, Richard E Bender, Andreas Glen, Robert C A multi-label approach to target prediction taking ligand promiscuity into account
title	A multi-label approach to target prediction taking ligand promiscuity into account
title_full	A multi-label approach to target prediction taking ligand promiscuity into account
title_fullStr	A multi-label approach to target prediction taking ligand promiscuity into account
title_full_unstemmed	A multi-label approach to target prediction taking ligand promiscuity into account
title_short	A multi-label approach to target prediction taking ligand promiscuity into account
title_sort	multi-label approach to target prediction taking ligand promiscuity into account
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4461803/ https://www.ncbi.nlm.nih.gov/pubmed/26064191 http://dx.doi.org/10.1186/s13321-015-0071-9
work_keys_str_mv	AT afzalavidm amultilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount AT mussahamsey amultilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount AT turnerricharde amultilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount AT benderandreas amultilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount AT glenrobertc amultilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount AT afzalavidm multilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount AT mussahamsey multilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount AT turnerricharde multilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount AT benderandreas multilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount AT glenrobertc multilabelapproachtotargetpredictiontakingligandpromiscuityintoaccount

A multi-label approach to target prediction taking ligand promiscuity into account

Ejemplares similares