Cargando…
Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction
Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models requires uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncer...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10426331/ https://www.ncbi.nlm.nih.gov/pubmed/37583465 http://dx.doi.org/10.1016/j.aichem.2023.100004 |
_version_ | 1785090034773262336 |
---|---|
author | Fan, Ya Ju Allen, Jonathan E. McLoughlin, Kevin S. Shi, Da Bennion, Brian J. Zhang, Xiaohua Lightstone, Felice C. |
author_facet | Fan, Ya Ju Allen, Jonathan E. McLoughlin, Kevin S. Shi, Da Bennion, Brian J. Zhang, Xiaohua Lightstone, Felice C. |
author_sort | Fan, Ya Ju |
collection | PubMed |
description | Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models requires uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncertainty information. Some methods require changing the NN architecture or training procedure, limiting the selection of NN models. Moreover, predictive uncertainty can come from different sources. It is important to have the ability to separately model different types of predictive uncertainty, as the model can take assorted actions depending on the source of uncertainty. In this paper, we examine UQ methods that estimate different sources of predictive uncertainty for NN models aiming at protein-ligand binding prediction. We use our prior knowledge on chemical compounds to design the experiments. By utilizing a visualization method we create non-overlapping and chemically diverse partitions from a collection of chemical compounds. These partitions are used as training and test set splits to explore NN model uncertainty. We demonstrate how the uncertainties estimated by the selected methods describe different sources of uncertainty under different partitions and featurization schemes and the relationship to prediction error. |
format | Online Article Text |
id | pubmed-10426331 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
record_format | MEDLINE/PubMed |
spelling | pubmed-104263312023-08-15 Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction Fan, Ya Ju Allen, Jonathan E. McLoughlin, Kevin S. Shi, Da Bennion, Brian J. Zhang, Xiaohua Lightstone, Felice C. Artif Intell Chem Article Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models requires uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncertainty information. Some methods require changing the NN architecture or training procedure, limiting the selection of NN models. Moreover, predictive uncertainty can come from different sources. It is important to have the ability to separately model different types of predictive uncertainty, as the model can take assorted actions depending on the source of uncertainty. In this paper, we examine UQ methods that estimate different sources of predictive uncertainty for NN models aiming at protein-ligand binding prediction. We use our prior knowledge on chemical compounds to design the experiments. By utilizing a visualization method we create non-overlapping and chemically diverse partitions from a collection of chemical compounds. These partitions are used as training and test set splits to explore NN model uncertainty. We demonstrate how the uncertainties estimated by the selected methods describe different sources of uncertainty under different partitions and featurization schemes and the relationship to prediction error. 2023-06 2023-06-03 /pmc/articles/PMC10426331/ /pubmed/37583465 http://dx.doi.org/10.1016/j.aichem.2023.100004 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ). |
spellingShingle | Article Fan, Ya Ju Allen, Jonathan E. McLoughlin, Kevin S. Shi, Da Bennion, Brian J. Zhang, Xiaohua Lightstone, Felice C. Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction |
title | Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction |
title_full | Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction |
title_fullStr | Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction |
title_full_unstemmed | Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction |
title_short | Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction |
title_sort | evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10426331/ https://www.ncbi.nlm.nih.gov/pubmed/37583465 http://dx.doi.org/10.1016/j.aichem.2023.100004 |
work_keys_str_mv | AT fanyaju evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction AT allenjonathane evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction AT mcloughlinkevins evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction AT shida evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction AT bennionbrianj evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction AT zhangxiaohua evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction AT lightstonefelicec evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction |