Cargando…

Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction

Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models requires uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncer...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Ya Ju, Allen, Jonathan E., McLoughlin, Kevin S., Shi, Da, Bennion, Brian J., Zhang, Xiaohua, Lightstone, Felice C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10426331/
https://www.ncbi.nlm.nih.gov/pubmed/37583465
http://dx.doi.org/10.1016/j.aichem.2023.100004
_version_ 1785090034773262336
author Fan, Ya Ju
Allen, Jonathan E.
McLoughlin, Kevin S.
Shi, Da
Bennion, Brian J.
Zhang, Xiaohua
Lightstone, Felice C.
author_facet Fan, Ya Ju
Allen, Jonathan E.
McLoughlin, Kevin S.
Shi, Da
Bennion, Brian J.
Zhang, Xiaohua
Lightstone, Felice C.
author_sort Fan, Ya Ju
collection PubMed
description Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models requires uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncertainty information. Some methods require changing the NN architecture or training procedure, limiting the selection of NN models. Moreover, predictive uncertainty can come from different sources. It is important to have the ability to separately model different types of predictive uncertainty, as the model can take assorted actions depending on the source of uncertainty. In this paper, we examine UQ methods that estimate different sources of predictive uncertainty for NN models aiming at protein-ligand binding prediction. We use our prior knowledge on chemical compounds to design the experiments. By utilizing a visualization method we create non-overlapping and chemically diverse partitions from a collection of chemical compounds. These partitions are used as training and test set splits to explore NN model uncertainty. We demonstrate how the uncertainties estimated by the selected methods describe different sources of uncertainty under different partitions and featurization schemes and the relationship to prediction error.
format Online
Article
Text
id pubmed-10426331
institution National Center for Biotechnology Information
language English
publishDate 2023
record_format MEDLINE/PubMed
spelling pubmed-104263312023-08-15 Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction Fan, Ya Ju Allen, Jonathan E. McLoughlin, Kevin S. Shi, Da Bennion, Brian J. Zhang, Xiaohua Lightstone, Felice C. Artif Intell Chem Article Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models requires uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncertainty information. Some methods require changing the NN architecture or training procedure, limiting the selection of NN models. Moreover, predictive uncertainty can come from different sources. It is important to have the ability to separately model different types of predictive uncertainty, as the model can take assorted actions depending on the source of uncertainty. In this paper, we examine UQ methods that estimate different sources of predictive uncertainty for NN models aiming at protein-ligand binding prediction. We use our prior knowledge on chemical compounds to design the experiments. By utilizing a visualization method we create non-overlapping and chemically diverse partitions from a collection of chemical compounds. These partitions are used as training and test set splits to explore NN model uncertainty. We demonstrate how the uncertainties estimated by the selected methods describe different sources of uncertainty under different partitions and featurization schemes and the relationship to prediction error. 2023-06 2023-06-03 /pmc/articles/PMC10426331/ /pubmed/37583465 http://dx.doi.org/10.1016/j.aichem.2023.100004 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ).
spellingShingle Article
Fan, Ya Ju
Allen, Jonathan E.
McLoughlin, Kevin S.
Shi, Da
Bennion, Brian J.
Zhang, Xiaohua
Lightstone, Felice C.
Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction
title Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction
title_full Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction
title_fullStr Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction
title_full_unstemmed Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction
title_short Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction
title_sort evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10426331/
https://www.ncbi.nlm.nih.gov/pubmed/37583465
http://dx.doi.org/10.1016/j.aichem.2023.100004
work_keys_str_mv AT fanyaju evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction
AT allenjonathane evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction
AT mcloughlinkevins evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction
AT shida evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction
AT bennionbrianj evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction
AT zhangxiaohua evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction
AT lightstonefelicec evaluatingpointpredictionuncertaintiesinneuralnetworksforproteinligandbindingprediction