Cargando…

Meta-QSAR: a large-scale application of meta-learning to drug design and discovery

We investigate the learning of quantitative structure activity relationships (QSARs) as a case-study of meta-learning. This application area is of the highest societal importance, as it is a key step in the development of new medicines. The standard QSAR learning problem is: given a target (usually...

Descripción completa

Detalles Bibliográficos
Autores principales: Olier, Ivan, Sadawi, Noureddin, Bickerton, G. Richard, Vanschoren, Joaquin, Grosan, Crina, Soldatova, Larisa, King, Ross D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6956898/
https://www.ncbi.nlm.nih.gov/pubmed/31997851
http://dx.doi.org/10.1007/s10994-017-5685-x
_version_ 1783487222553509888
author Olier, Ivan
Sadawi, Noureddin
Bickerton, G. Richard
Vanschoren, Joaquin
Grosan, Crina
Soldatova, Larisa
King, Ross D.
author_facet Olier, Ivan
Sadawi, Noureddin
Bickerton, G. Richard
Vanschoren, Joaquin
Grosan, Crina
Soldatova, Larisa
King, Ross D.
author_sort Olier, Ivan
collection PubMed
description We investigate the learning of quantitative structure activity relationships (QSARs) as a case-study of meta-learning. This application area is of the highest societal importance, as it is a key step in the development of new medicines. The standard QSAR learning problem is: given a target (usually a protein) and a set of chemical compounds (small molecules) with associated bioactivities (e.g. inhibition of the target), learn a predictive mapping from molecular representation to activity. Although almost every type of machine learning method has been applied to QSAR learning there is no agreed single best way of learning QSARs, and therefore the problem area is well-suited to meta-learning. We first carried out the most comprehensive ever comparison of machine learning methods for QSAR learning: 18 regression methods, 3 molecular representations, applied to more than 2700 QSAR problems. (These results have been made publicly available on OpenML and represent a valuable resource for testing novel meta-learning methods.) We then investigated the utility of algorithm selection for QSAR problems. We found that this meta-learning approach outperformed the best individual QSAR learning method (random forests using a molecular fingerprint representation) by up to 13%, on average. We conclude that meta-learning outperforms base-learning methods for QSAR learning, and as this investigation is one of the most extensive ever comparisons of base and meta-learning methods ever made, it provides evidence for the general effectiveness of meta-learning over base-learning.
format Online
Article
Text
id pubmed-6956898
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-69568982020-01-27 Meta-QSAR: a large-scale application of meta-learning to drug design and discovery Olier, Ivan Sadawi, Noureddin Bickerton, G. Richard Vanschoren, Joaquin Grosan, Crina Soldatova, Larisa King, Ross D. Mach Learn Article We investigate the learning of quantitative structure activity relationships (QSARs) as a case-study of meta-learning. This application area is of the highest societal importance, as it is a key step in the development of new medicines. The standard QSAR learning problem is: given a target (usually a protein) and a set of chemical compounds (small molecules) with associated bioactivities (e.g. inhibition of the target), learn a predictive mapping from molecular representation to activity. Although almost every type of machine learning method has been applied to QSAR learning there is no agreed single best way of learning QSARs, and therefore the problem area is well-suited to meta-learning. We first carried out the most comprehensive ever comparison of machine learning methods for QSAR learning: 18 regression methods, 3 molecular representations, applied to more than 2700 QSAR problems. (These results have been made publicly available on OpenML and represent a valuable resource for testing novel meta-learning methods.) We then investigated the utility of algorithm selection for QSAR problems. We found that this meta-learning approach outperformed the best individual QSAR learning method (random forests using a molecular fingerprint representation) by up to 13%, on average. We conclude that meta-learning outperforms base-learning methods for QSAR learning, and as this investigation is one of the most extensive ever comparisons of base and meta-learning methods ever made, it provides evidence for the general effectiveness of meta-learning over base-learning. Springer US 2017-12-22 2018 /pmc/articles/PMC6956898/ /pubmed/31997851 http://dx.doi.org/10.1007/s10994-017-5685-x Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Olier, Ivan
Sadawi, Noureddin
Bickerton, G. Richard
Vanschoren, Joaquin
Grosan, Crina
Soldatova, Larisa
King, Ross D.
Meta-QSAR: a large-scale application of meta-learning to drug design and discovery
title Meta-QSAR: a large-scale application of meta-learning to drug design and discovery
title_full Meta-QSAR: a large-scale application of meta-learning to drug design and discovery
title_fullStr Meta-QSAR: a large-scale application of meta-learning to drug design and discovery
title_full_unstemmed Meta-QSAR: a large-scale application of meta-learning to drug design and discovery
title_short Meta-QSAR: a large-scale application of meta-learning to drug design and discovery
title_sort meta-qsar: a large-scale application of meta-learning to drug design and discovery
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6956898/
https://www.ncbi.nlm.nih.gov/pubmed/31997851
http://dx.doi.org/10.1007/s10994-017-5685-x
work_keys_str_mv AT olierivan metaqsaralargescaleapplicationofmetalearningtodrugdesignanddiscovery
AT sadawinoureddin metaqsaralargescaleapplicationofmetalearningtodrugdesignanddiscovery
AT bickertongrichard metaqsaralargescaleapplicationofmetalearningtodrugdesignanddiscovery
AT vanschorenjoaquin metaqsaralargescaleapplicationofmetalearningtodrugdesignanddiscovery
AT grosancrina metaqsaralargescaleapplicationofmetalearningtodrugdesignanddiscovery
AT soldatovalarisa metaqsaralargescaleapplicationofmetalearningtodrugdesignanddiscovery
AT kingrossd metaqsaralargescaleapplicationofmetalearningtodrugdesignanddiscovery