Cargando…

Limits of Prediction for Machine Learning in Drug Discovery

In drug discovery, molecules are optimized towards desired properties. In this context, machine learning is used for extrapolation in drug discovery projects. The limits of extrapolation for regression models are known. However, a systematic analysis of the effectiveness of extrapolation in drug dis...

Descripción completa

Detalles Bibliográficos
Autores principales: von Korff, Modest, Sander, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8960959/
https://www.ncbi.nlm.nih.gov/pubmed/35359835
http://dx.doi.org/10.3389/fphar.2022.832120
_version_ 1784677494728687616
author von Korff, Modest
Sander, Thomas
author_facet von Korff, Modest
Sander, Thomas
author_sort von Korff, Modest
collection PubMed
description In drug discovery, molecules are optimized towards desired properties. In this context, machine learning is used for extrapolation in drug discovery projects. The limits of extrapolation for regression models are known. However, a systematic analysis of the effectiveness of extrapolation in drug discovery has not yet been performed. In response, this study examined the capabilities of six machine learning algorithms to extrapolate from 243 datasets. The response values calculated from the molecules in the datasets were molecular weight, cLogP, and the number of sp3-atoms. Three experimental set ups were chosen for response values. Shuffled data were used for interpolation, whereas data for extrapolation were sorted from high to low values, and the reverse. Extrapolation with sorted data resulted in much larger prediction errors than extrapolation with shuffled data. Additionally, this study demonstrated that linear machine learning methods are preferable for extrapolation.
format Online
Article
Text
id pubmed-8960959
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89609592022-03-30 Limits of Prediction for Machine Learning in Drug Discovery von Korff, Modest Sander, Thomas Front Pharmacol Pharmacology In drug discovery, molecules are optimized towards desired properties. In this context, machine learning is used for extrapolation in drug discovery projects. The limits of extrapolation for regression models are known. However, a systematic analysis of the effectiveness of extrapolation in drug discovery has not yet been performed. In response, this study examined the capabilities of six machine learning algorithms to extrapolate from 243 datasets. The response values calculated from the molecules in the datasets were molecular weight, cLogP, and the number of sp3-atoms. Three experimental set ups were chosen for response values. Shuffled data were used for interpolation, whereas data for extrapolation were sorted from high to low values, and the reverse. Extrapolation with sorted data resulted in much larger prediction errors than extrapolation with shuffled data. Additionally, this study demonstrated that linear machine learning methods are preferable for extrapolation. Frontiers Media S.A. 2022-03-10 /pmc/articles/PMC8960959/ /pubmed/35359835 http://dx.doi.org/10.3389/fphar.2022.832120 Text en Copyright © 2022 von Korff and Sander. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Pharmacology
von Korff, Modest
Sander, Thomas
Limits of Prediction for Machine Learning in Drug Discovery
title Limits of Prediction for Machine Learning in Drug Discovery
title_full Limits of Prediction for Machine Learning in Drug Discovery
title_fullStr Limits of Prediction for Machine Learning in Drug Discovery
title_full_unstemmed Limits of Prediction for Machine Learning in Drug Discovery
title_short Limits of Prediction for Machine Learning in Drug Discovery
title_sort limits of prediction for machine learning in drug discovery
topic Pharmacology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8960959/
https://www.ncbi.nlm.nih.gov/pubmed/35359835
http://dx.doi.org/10.3389/fphar.2022.832120
work_keys_str_mv AT vonkorffmodest limitsofpredictionformachinelearningindrugdiscovery
AT sanderthomas limitsofpredictionformachinelearningindrugdiscovery