Cargando…

Machine Learning Strategies When Transitioning between Biological Assays

[Image: see text] Machine learning is widely used in drug development to predict activity in biological assays based on chemical structure. However, the process of transitioning from one experimental setup to another for the same biological endpoint has not been extensively studied. In a retrospecti...

Descripción completa

Detalles Bibliográficos
Autores principales: Arvidsson McShane, Staffan, Ahlberg, Ernst, Noeske, Tobias, Spjuth, Ola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8317157/
https://www.ncbi.nlm.nih.gov/pubmed/34152755
http://dx.doi.org/10.1021/acs.jcim.1c00293
_version_ 1783730016469647360
author Arvidsson McShane, Staffan
Ahlberg, Ernst
Noeske, Tobias
Spjuth, Ola
author_facet Arvidsson McShane, Staffan
Ahlberg, Ernst
Noeske, Tobias
Spjuth, Ola
author_sort Arvidsson McShane, Staffan
collection PubMed
description [Image: see text] Machine learning is widely used in drug development to predict activity in biological assays based on chemical structure. However, the process of transitioning from one experimental setup to another for the same biological endpoint has not been extensively studied. In a retrospective study, we here explore different modeling strategies of how to combine data from the old and new assays when training conformal prediction models using data from hERG and Na(V) assays. We suggest to continuously monitor the validity and efficiency of models as more data is accumulated from the new assay and select a modeling strategy based on these metrics. In order to maximize the utility of data from the old assay, we propose a strategy that augments the proper training set of an inductive conformal predictor by adding data from the old assay but only having data from the new assay in the calibration set, which results in valid (well-calibrated) models with improved efficiency compared to other strategies. We study the results for varying sizes of new and old assays, allowing for discussion of different practical scenarios. We also conclude that our proposed assay transition strategy is more beneficial, and the value of data from the new assay is higher, for the harder case of regression compared to classification problems.
format Online
Article
Text
id pubmed-8317157
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-83171572021-07-28 Machine Learning Strategies When Transitioning between Biological Assays Arvidsson McShane, Staffan Ahlberg, Ernst Noeske, Tobias Spjuth, Ola J Chem Inf Model [Image: see text] Machine learning is widely used in drug development to predict activity in biological assays based on chemical structure. However, the process of transitioning from one experimental setup to another for the same biological endpoint has not been extensively studied. In a retrospective study, we here explore different modeling strategies of how to combine data from the old and new assays when training conformal prediction models using data from hERG and Na(V) assays. We suggest to continuously monitor the validity and efficiency of models as more data is accumulated from the new assay and select a modeling strategy based on these metrics. In order to maximize the utility of data from the old assay, we propose a strategy that augments the proper training set of an inductive conformal predictor by adding data from the old assay but only having data from the new assay in the calibration set, which results in valid (well-calibrated) models with improved efficiency compared to other strategies. We study the results for varying sizes of new and old assays, allowing for discussion of different practical scenarios. We also conclude that our proposed assay transition strategy is more beneficial, and the value of data from the new assay is higher, for the harder case of regression compared to classification problems. American Chemical Society 2021-06-21 2021-07-26 /pmc/articles/PMC8317157/ /pubmed/34152755 http://dx.doi.org/10.1021/acs.jcim.1c00293 Text en © 2021 The Authors. Published by American Chemical Society Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Arvidsson McShane, Staffan
Ahlberg, Ernst
Noeske, Tobias
Spjuth, Ola
Machine Learning Strategies When Transitioning between Biological Assays
title Machine Learning Strategies When Transitioning between Biological Assays
title_full Machine Learning Strategies When Transitioning between Biological Assays
title_fullStr Machine Learning Strategies When Transitioning between Biological Assays
title_full_unstemmed Machine Learning Strategies When Transitioning between Biological Assays
title_short Machine Learning Strategies When Transitioning between Biological Assays
title_sort machine learning strategies when transitioning between biological assays
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8317157/
https://www.ncbi.nlm.nih.gov/pubmed/34152755
http://dx.doi.org/10.1021/acs.jcim.1c00293
work_keys_str_mv AT arvidssonmcshanestaffan machinelearningstrategieswhentransitioningbetweenbiologicalassays
AT ahlbergernst machinelearningstrategieswhentransitioningbetweenbiologicalassays
AT noesketobias machinelearningstrategieswhentransitioningbetweenbiologicalassays
AT spjuthola machinelearningstrategieswhentransitioningbetweenbiologicalassays