Cargando…
Machine Learning Strategies When Transitioning between Biological Assays
[Image: see text] Machine learning is widely used in drug development to predict activity in biological assays based on chemical structure. However, the process of transitioning from one experimental setup to another for the same biological endpoint has not been extensively studied. In a retrospecti...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical
Society
2021
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8317157/ https://www.ncbi.nlm.nih.gov/pubmed/34152755 http://dx.doi.org/10.1021/acs.jcim.1c00293 |
_version_ | 1783730016469647360 |
---|---|
author | Arvidsson McShane, Staffan Ahlberg, Ernst Noeske, Tobias Spjuth, Ola |
author_facet | Arvidsson McShane, Staffan Ahlberg, Ernst Noeske, Tobias Spjuth, Ola |
author_sort | Arvidsson McShane, Staffan |
collection | PubMed |
description | [Image: see text] Machine learning is widely used in drug development to predict activity in biological assays based on chemical structure. However, the process of transitioning from one experimental setup to another for the same biological endpoint has not been extensively studied. In a retrospective study, we here explore different modeling strategies of how to combine data from the old and new assays when training conformal prediction models using data from hERG and Na(V) assays. We suggest to continuously monitor the validity and efficiency of models as more data is accumulated from the new assay and select a modeling strategy based on these metrics. In order to maximize the utility of data from the old assay, we propose a strategy that augments the proper training set of an inductive conformal predictor by adding data from the old assay but only having data from the new assay in the calibration set, which results in valid (well-calibrated) models with improved efficiency compared to other strategies. We study the results for varying sizes of new and old assays, allowing for discussion of different practical scenarios. We also conclude that our proposed assay transition strategy is more beneficial, and the value of data from the new assay is higher, for the harder case of regression compared to classification problems. |
format | Online Article Text |
id | pubmed-8317157 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | American Chemical
Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-83171572021-07-28 Machine Learning Strategies When Transitioning between Biological Assays Arvidsson McShane, Staffan Ahlberg, Ernst Noeske, Tobias Spjuth, Ola J Chem Inf Model [Image: see text] Machine learning is widely used in drug development to predict activity in biological assays based on chemical structure. However, the process of transitioning from one experimental setup to another for the same biological endpoint has not been extensively studied. In a retrospective study, we here explore different modeling strategies of how to combine data from the old and new assays when training conformal prediction models using data from hERG and Na(V) assays. We suggest to continuously monitor the validity and efficiency of models as more data is accumulated from the new assay and select a modeling strategy based on these metrics. In order to maximize the utility of data from the old assay, we propose a strategy that augments the proper training set of an inductive conformal predictor by adding data from the old assay but only having data from the new assay in the calibration set, which results in valid (well-calibrated) models with improved efficiency compared to other strategies. We study the results for varying sizes of new and old assays, allowing for discussion of different practical scenarios. We also conclude that our proposed assay transition strategy is more beneficial, and the value of data from the new assay is higher, for the harder case of regression compared to classification problems. American Chemical Society 2021-06-21 2021-07-26 /pmc/articles/PMC8317157/ /pubmed/34152755 http://dx.doi.org/10.1021/acs.jcim.1c00293 Text en © 2021 The Authors. Published by American Chemical Society Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Arvidsson McShane, Staffan Ahlberg, Ernst Noeske, Tobias Spjuth, Ola Machine Learning Strategies When Transitioning between Biological Assays |
title | Machine Learning Strategies When Transitioning between
Biological Assays |
title_full | Machine Learning Strategies When Transitioning between
Biological Assays |
title_fullStr | Machine Learning Strategies When Transitioning between
Biological Assays |
title_full_unstemmed | Machine Learning Strategies When Transitioning between
Biological Assays |
title_short | Machine Learning Strategies When Transitioning between
Biological Assays |
title_sort | machine learning strategies when transitioning between
biological assays |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8317157/ https://www.ncbi.nlm.nih.gov/pubmed/34152755 http://dx.doi.org/10.1021/acs.jcim.1c00293 |
work_keys_str_mv | AT arvidssonmcshanestaffan machinelearningstrategieswhentransitioningbetweenbiologicalassays AT ahlbergernst machinelearningstrategieswhentransitioningbetweenbiologicalassays AT noesketobias machinelearningstrategieswhentransitioningbetweenbiologicalassays AT spjuthola machinelearningstrategieswhentransitioningbetweenbiologicalassays |