Cargando…

Learning with multiple pairwise kernels for drug bioactivity prediction

MOTIVATION: Many inference problems in bioinformatics, including drug bioactivity prediction, can be formulated as pairwise learning problems, in which one is interested in making predictions for pairs of objects, e.g. drugs and their targets. Kernel-based approaches have emerged as powerful tools f...

Descripción completa

Detalles Bibliográficos
Autores principales: Cichonska, Anna, Pahikkala, Tapio, Szedmak, Sandor, Julkunen, Heli, Airola, Antti, Heinonen, Markus, Aittokallio, Tero, Rousu, Juho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022556/
https://www.ncbi.nlm.nih.gov/pubmed/29949975
http://dx.doi.org/10.1093/bioinformatics/bty277
_version_ 1783335703637131264
author Cichonska, Anna
Pahikkala, Tapio
Szedmak, Sandor
Julkunen, Heli
Airola, Antti
Heinonen, Markus
Aittokallio, Tero
Rousu, Juho
author_facet Cichonska, Anna
Pahikkala, Tapio
Szedmak, Sandor
Julkunen, Heli
Airola, Antti
Heinonen, Markus
Aittokallio, Tero
Rousu, Juho
author_sort Cichonska, Anna
collection PubMed
description MOTIVATION: Many inference problems in bioinformatics, including drug bioactivity prediction, can be formulated as pairwise learning problems, in which one is interested in making predictions for pairs of objects, e.g. drugs and their targets. Kernel-based approaches have emerged as powerful tools for solving problems of that kind, and especially multiple kernel learning (MKL) offers promising benefits as it enables integrating various types of complex biomedical information sources in the form of kernels, along with learning their importance for the prediction task. However, the immense size of pairwise kernel spaces remains a major bottleneck, making the existing MKL algorithms computationally infeasible even for small number of input pairs. RESULTS: We introduce pairwiseMKL, the first method for time- and memory-efficient learning with multiple pairwise kernels. pairwiseMKL first determines the mixture weights of the input pairwise kernels, and then learns the pairwise prediction function. Both steps are performed efficiently without explicit computation of the massive pairwise matrices, therefore making the method applicable to solving large pairwise learning problems. We demonstrate the performance of pairwiseMKL in two related tasks of quantitative drug bioactivity prediction using up to 167 995 bioactivity measurements and 3120 pairwise kernels: (i) prediction of anticancer efficacy of drug compounds across a large panel of cancer cell lines; and (ii) prediction of target profiles of anticancer compounds across their kinome-wide target spaces. We show that pairwiseMKL provides accurate predictions using sparse solutions in terms of selected kernels, and therefore it automatically identifies also data sources relevant for the prediction problem. AVAILABILITY AND IMPLEMENTATION: Code is available at https://github.com/aalto-ics-kepaco. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6022556
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60225562018-07-10 Learning with multiple pairwise kernels for drug bioactivity prediction Cichonska, Anna Pahikkala, Tapio Szedmak, Sandor Julkunen, Heli Airola, Antti Heinonen, Markus Aittokallio, Tero Rousu, Juho Bioinformatics Ismb 2018–Intelligent Systems for Molecular Biology Proceedings MOTIVATION: Many inference problems in bioinformatics, including drug bioactivity prediction, can be formulated as pairwise learning problems, in which one is interested in making predictions for pairs of objects, e.g. drugs and their targets. Kernel-based approaches have emerged as powerful tools for solving problems of that kind, and especially multiple kernel learning (MKL) offers promising benefits as it enables integrating various types of complex biomedical information sources in the form of kernels, along with learning their importance for the prediction task. However, the immense size of pairwise kernel spaces remains a major bottleneck, making the existing MKL algorithms computationally infeasible even for small number of input pairs. RESULTS: We introduce pairwiseMKL, the first method for time- and memory-efficient learning with multiple pairwise kernels. pairwiseMKL first determines the mixture weights of the input pairwise kernels, and then learns the pairwise prediction function. Both steps are performed efficiently without explicit computation of the massive pairwise matrices, therefore making the method applicable to solving large pairwise learning problems. We demonstrate the performance of pairwiseMKL in two related tasks of quantitative drug bioactivity prediction using up to 167 995 bioactivity measurements and 3120 pairwise kernels: (i) prediction of anticancer efficacy of drug compounds across a large panel of cancer cell lines; and (ii) prediction of target profiles of anticancer compounds across their kinome-wide target spaces. We show that pairwiseMKL provides accurate predictions using sparse solutions in terms of selected kernels, and therefore it automatically identifies also data sources relevant for the prediction problem. AVAILABILITY AND IMPLEMENTATION: Code is available at https://github.com/aalto-ics-kepaco. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-07-01 2018-06-27 /pmc/articles/PMC6022556/ /pubmed/29949975 http://dx.doi.org/10.1093/bioinformatics/bty277 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2018–Intelligent Systems for Molecular Biology Proceedings
Cichonska, Anna
Pahikkala, Tapio
Szedmak, Sandor
Julkunen, Heli
Airola, Antti
Heinonen, Markus
Aittokallio, Tero
Rousu, Juho
Learning with multiple pairwise kernels for drug bioactivity prediction
title Learning with multiple pairwise kernels for drug bioactivity prediction
title_full Learning with multiple pairwise kernels for drug bioactivity prediction
title_fullStr Learning with multiple pairwise kernels for drug bioactivity prediction
title_full_unstemmed Learning with multiple pairwise kernels for drug bioactivity prediction
title_short Learning with multiple pairwise kernels for drug bioactivity prediction
title_sort learning with multiple pairwise kernels for drug bioactivity prediction
topic Ismb 2018–Intelligent Systems for Molecular Biology Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022556/
https://www.ncbi.nlm.nih.gov/pubmed/29949975
http://dx.doi.org/10.1093/bioinformatics/bty277
work_keys_str_mv AT cichonskaanna learningwithmultiplepairwisekernelsfordrugbioactivityprediction
AT pahikkalatapio learningwithmultiplepairwisekernelsfordrugbioactivityprediction
AT szedmaksandor learningwithmultiplepairwisekernelsfordrugbioactivityprediction
AT julkunenheli learningwithmultiplepairwisekernelsfordrugbioactivityprediction
AT airolaantti learningwithmultiplepairwisekernelsfordrugbioactivityprediction
AT heinonenmarkus learningwithmultiplepairwisekernelsfordrugbioactivityprediction
AT aittokalliotero learningwithmultiplepairwisekernelsfordrugbioactivityprediction
AT rousujuho learningwithmultiplepairwisekernelsfordrugbioactivityprediction