Cargando…

Multi-task learning with a natural metric for quantitative structure activity relationship learning

The goal of quantitative structure activity relationship (QSAR) learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the compound. We employed multi-task learning (MTL) to exploit commonalities in drug targets and assays....

Descripción completa

Detalles Bibliográficos
Autores principales: Sadawi, Noureddin, Olier, Ivan, Vanschoren, Joaquin, van Rijn, Jan N., Besnard, Jeremy, Bickerton, Richard, Grosan, Crina, Soldatova, Larisa, King, Ross D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6852942/
https://www.ncbi.nlm.nih.gov/pubmed/33430958
http://dx.doi.org/10.1186/s13321-019-0392-1
_version_ 1783469950650810368
author Sadawi, Noureddin
Olier, Ivan
Vanschoren, Joaquin
van Rijn, Jan N.
Besnard, Jeremy
Bickerton, Richard
Grosan, Crina
Soldatova, Larisa
King, Ross D.
author_facet Sadawi, Noureddin
Olier, Ivan
Vanschoren, Joaquin
van Rijn, Jan N.
Besnard, Jeremy
Bickerton, Richard
Grosan, Crina
Soldatova, Larisa
King, Ross D.
author_sort Sadawi, Noureddin
collection PubMed
description The goal of quantitative structure activity relationship (QSAR) learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the compound. We employed multi-task learning (MTL) to exploit commonalities in drug targets and assays. We used datasets containing curated records about the activity of specific compounds on drug targets provided by ChEMBL. Totally, 1091 assays have been analysed. As a baseline, a single task learning approach that trains random forest to predict drug activity for each drug target individually was considered. We then carried out feature-based and instance-based MTL to predict drug activities. We introduced a natural metric of evolutionary distance between drug targets as a measure of tasks relatedness. Instance-based MTL significantly outperformed both, feature-based MTL and the base learner, on 741 drug targets out of 1091. Feature-based MTL won on 179 occasions and the base learner performed best on 171 drug targets. We conclude that MTL QSAR is improved by incorporating the evolutionary distance between targets. These results indicate that QSAR learning can be performed effectively, even if little data is available for specific drug targets, by leveraging what is known about similar drug targets.
format Online
Article
Text
id pubmed-6852942
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-68529422019-11-21 Multi-task learning with a natural metric for quantitative structure activity relationship learning Sadawi, Noureddin Olier, Ivan Vanschoren, Joaquin van Rijn, Jan N. Besnard, Jeremy Bickerton, Richard Grosan, Crina Soldatova, Larisa King, Ross D. J Cheminform Research Article The goal of quantitative structure activity relationship (QSAR) learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the compound. We employed multi-task learning (MTL) to exploit commonalities in drug targets and assays. We used datasets containing curated records about the activity of specific compounds on drug targets provided by ChEMBL. Totally, 1091 assays have been analysed. As a baseline, a single task learning approach that trains random forest to predict drug activity for each drug target individually was considered. We then carried out feature-based and instance-based MTL to predict drug activities. We introduced a natural metric of evolutionary distance between drug targets as a measure of tasks relatedness. Instance-based MTL significantly outperformed both, feature-based MTL and the base learner, on 741 drug targets out of 1091. Feature-based MTL won on 179 occasions and the base learner performed best on 171 drug targets. We conclude that MTL QSAR is improved by incorporating the evolutionary distance between targets. These results indicate that QSAR learning can be performed effectively, even if little data is available for specific drug targets, by leveraging what is known about similar drug targets. Springer International Publishing 2019-11-12 /pmc/articles/PMC6852942/ /pubmed/33430958 http://dx.doi.org/10.1186/s13321-019-0392-1 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Sadawi, Noureddin
Olier, Ivan
Vanschoren, Joaquin
van Rijn, Jan N.
Besnard, Jeremy
Bickerton, Richard
Grosan, Crina
Soldatova, Larisa
King, Ross D.
Multi-task learning with a natural metric for quantitative structure activity relationship learning
title Multi-task learning with a natural metric for quantitative structure activity relationship learning
title_full Multi-task learning with a natural metric for quantitative structure activity relationship learning
title_fullStr Multi-task learning with a natural metric for quantitative structure activity relationship learning
title_full_unstemmed Multi-task learning with a natural metric for quantitative structure activity relationship learning
title_short Multi-task learning with a natural metric for quantitative structure activity relationship learning
title_sort multi-task learning with a natural metric for quantitative structure activity relationship learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6852942/
https://www.ncbi.nlm.nih.gov/pubmed/33430958
http://dx.doi.org/10.1186/s13321-019-0392-1
work_keys_str_mv AT sadawinoureddin multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning
AT olierivan multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning
AT vanschorenjoaquin multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning
AT vanrijnjann multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning
AT besnardjeremy multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning
AT bickertonrichard multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning
AT grosancrina multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning
AT soldatovalarisa multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning
AT kingrossd multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning