Cargando…
Multi-task learning with a natural metric for quantitative structure activity relationship learning
The goal of quantitative structure activity relationship (QSAR) learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the compound. We employed multi-task learning (MTL) to exploit commonalities in drug targets and assays....
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6852942/ https://www.ncbi.nlm.nih.gov/pubmed/33430958 http://dx.doi.org/10.1186/s13321-019-0392-1 |
_version_ | 1783469950650810368 |
---|---|
author | Sadawi, Noureddin Olier, Ivan Vanschoren, Joaquin van Rijn, Jan N. Besnard, Jeremy Bickerton, Richard Grosan, Crina Soldatova, Larisa King, Ross D. |
author_facet | Sadawi, Noureddin Olier, Ivan Vanschoren, Joaquin van Rijn, Jan N. Besnard, Jeremy Bickerton, Richard Grosan, Crina Soldatova, Larisa King, Ross D. |
author_sort | Sadawi, Noureddin |
collection | PubMed |
description | The goal of quantitative structure activity relationship (QSAR) learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the compound. We employed multi-task learning (MTL) to exploit commonalities in drug targets and assays. We used datasets containing curated records about the activity of specific compounds on drug targets provided by ChEMBL. Totally, 1091 assays have been analysed. As a baseline, a single task learning approach that trains random forest to predict drug activity for each drug target individually was considered. We then carried out feature-based and instance-based MTL to predict drug activities. We introduced a natural metric of evolutionary distance between drug targets as a measure of tasks relatedness. Instance-based MTL significantly outperformed both, feature-based MTL and the base learner, on 741 drug targets out of 1091. Feature-based MTL won on 179 occasions and the base learner performed best on 171 drug targets. We conclude that MTL QSAR is improved by incorporating the evolutionary distance between targets. These results indicate that QSAR learning can be performed effectively, even if little data is available for specific drug targets, by leveraging what is known about similar drug targets. |
format | Online Article Text |
id | pubmed-6852942 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-68529422019-11-21 Multi-task learning with a natural metric for quantitative structure activity relationship learning Sadawi, Noureddin Olier, Ivan Vanschoren, Joaquin van Rijn, Jan N. Besnard, Jeremy Bickerton, Richard Grosan, Crina Soldatova, Larisa King, Ross D. J Cheminform Research Article The goal of quantitative structure activity relationship (QSAR) learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the compound. We employed multi-task learning (MTL) to exploit commonalities in drug targets and assays. We used datasets containing curated records about the activity of specific compounds on drug targets provided by ChEMBL. Totally, 1091 assays have been analysed. As a baseline, a single task learning approach that trains random forest to predict drug activity for each drug target individually was considered. We then carried out feature-based and instance-based MTL to predict drug activities. We introduced a natural metric of evolutionary distance between drug targets as a measure of tasks relatedness. Instance-based MTL significantly outperformed both, feature-based MTL and the base learner, on 741 drug targets out of 1091. Feature-based MTL won on 179 occasions and the base learner performed best on 171 drug targets. We conclude that MTL QSAR is improved by incorporating the evolutionary distance between targets. These results indicate that QSAR learning can be performed effectively, even if little data is available for specific drug targets, by leveraging what is known about similar drug targets. Springer International Publishing 2019-11-12 /pmc/articles/PMC6852942/ /pubmed/33430958 http://dx.doi.org/10.1186/s13321-019-0392-1 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Sadawi, Noureddin Olier, Ivan Vanschoren, Joaquin van Rijn, Jan N. Besnard, Jeremy Bickerton, Richard Grosan, Crina Soldatova, Larisa King, Ross D. Multi-task learning with a natural metric for quantitative structure activity relationship learning |
title | Multi-task learning with a natural metric for quantitative structure activity relationship learning |
title_full | Multi-task learning with a natural metric for quantitative structure activity relationship learning |
title_fullStr | Multi-task learning with a natural metric for quantitative structure activity relationship learning |
title_full_unstemmed | Multi-task learning with a natural metric for quantitative structure activity relationship learning |
title_short | Multi-task learning with a natural metric for quantitative structure activity relationship learning |
title_sort | multi-task learning with a natural metric for quantitative structure activity relationship learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6852942/ https://www.ncbi.nlm.nih.gov/pubmed/33430958 http://dx.doi.org/10.1186/s13321-019-0392-1 |
work_keys_str_mv | AT sadawinoureddin multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning AT olierivan multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning AT vanschorenjoaquin multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning AT vanrijnjann multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning AT besnardjeremy multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning AT bickertonrichard multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning AT grosancrina multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning AT soldatovalarisa multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning AT kingrossd multitasklearningwithanaturalmetricforquantitativestructureactivityrelationshiplearning |