Cargando…
Dynamic applicability domain (dAD): compound–target binding affinity estimates with local conformal prediction
MOTIVATION: Increasing efforts are being made in the field of machine learning to advance the learning of robust and accurate models from experimentally measured data and enable more efficient drug discovery processes. The prediction of binding affinity is one of the most frequent tasks of compound...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10457664/ https://www.ncbi.nlm.nih.gov/pubmed/37594752 http://dx.doi.org/10.1093/bioinformatics/btad465 |
_version_ | 1785096980534394880 |
---|---|
author | Oršolić, Davor Šmuc, Tomislav |
author_facet | Oršolić, Davor Šmuc, Tomislav |
author_sort | Oršolić, Davor |
collection | PubMed |
description | MOTIVATION: Increasing efforts are being made in the field of machine learning to advance the learning of robust and accurate models from experimentally measured data and enable more efficient drug discovery processes. The prediction of binding affinity is one of the most frequent tasks of compound bioactivity modelling. Learned models for binding affinity prediction are assessed by their average performance on unseen samples, but point predictions are typically not provided with a rigorous confidence assessment. Approaches, such as the conformal predictor framework equip conventional models with a more rigorous assessment of confidence for individual point predictions. In this article, we extend the inductive conformal prediction framework for interaction data, in particular the compound–target binding affinity prediction task. The new framework is based on dynamically defined calibration sets that are specific for each testing pair and provides prediction assessment in the context of calibration pairs from its compound–target neighbourhood, enabling improved estimates based on the local properties of the prediction model. RESULTS: The effectiveness of the approach is benchmarked on several publicly available datasets and tested in realistic use-case scenarios with increasing levels of difficulty on a complex compound–target binding affinity space. We demonstrate that in such scenarios, novel approach combining applicability domain paradigm with conformal prediction framework, produces superior confidence assessment with valid and more informative prediction regions compared to other ‘state-of-the-art’ conformal prediction approaches. AVAILABILITY AND IMPLEMENTATION: Dataset and the code are available on GitHub (https://github.com/mlkr-rbi/dAD). |
format | Online Article Text |
id | pubmed-10457664 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-104576642023-08-27 Dynamic applicability domain (dAD): compound–target binding affinity estimates with local conformal prediction Oršolić, Davor Šmuc, Tomislav Bioinformatics Original Paper MOTIVATION: Increasing efforts are being made in the field of machine learning to advance the learning of robust and accurate models from experimentally measured data and enable more efficient drug discovery processes. The prediction of binding affinity is one of the most frequent tasks of compound bioactivity modelling. Learned models for binding affinity prediction are assessed by their average performance on unseen samples, but point predictions are typically not provided with a rigorous confidence assessment. Approaches, such as the conformal predictor framework equip conventional models with a more rigorous assessment of confidence for individual point predictions. In this article, we extend the inductive conformal prediction framework for interaction data, in particular the compound–target binding affinity prediction task. The new framework is based on dynamically defined calibration sets that are specific for each testing pair and provides prediction assessment in the context of calibration pairs from its compound–target neighbourhood, enabling improved estimates based on the local properties of the prediction model. RESULTS: The effectiveness of the approach is benchmarked on several publicly available datasets and tested in realistic use-case scenarios with increasing levels of difficulty on a complex compound–target binding affinity space. We demonstrate that in such scenarios, novel approach combining applicability domain paradigm with conformal prediction framework, produces superior confidence assessment with valid and more informative prediction regions compared to other ‘state-of-the-art’ conformal prediction approaches. AVAILABILITY AND IMPLEMENTATION: Dataset and the code are available on GitHub (https://github.com/mlkr-rbi/dAD). Oxford University Press 2023-08-18 /pmc/articles/PMC10457664/ /pubmed/37594752 http://dx.doi.org/10.1093/bioinformatics/btad465 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Oršolić, Davor Šmuc, Tomislav Dynamic applicability domain (dAD): compound–target binding affinity estimates with local conformal prediction |
title | Dynamic applicability domain (dAD): compound–target binding affinity estimates with local conformal prediction |
title_full | Dynamic applicability domain (dAD): compound–target binding affinity estimates with local conformal prediction |
title_fullStr | Dynamic applicability domain (dAD): compound–target binding affinity estimates with local conformal prediction |
title_full_unstemmed | Dynamic applicability domain (dAD): compound–target binding affinity estimates with local conformal prediction |
title_short | Dynamic applicability domain (dAD): compound–target binding affinity estimates with local conformal prediction |
title_sort | dynamic applicability domain (dad): compound–target binding affinity estimates with local conformal prediction |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10457664/ https://www.ncbi.nlm.nih.gov/pubmed/37594752 http://dx.doi.org/10.1093/bioinformatics/btad465 |
work_keys_str_mv | AT orsolicdavor dynamicapplicabilitydomaindadcompoundtargetbindingaffinityestimateswithlocalconformalprediction AT smuctomislav dynamicapplicabilitydomaindadcompoundtargetbindingaffinityestimateswithlocalconformalprediction |