Cargando…

Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning

Predicting bioactivity and physical properties of small molecules is a central challenge in drug discovery. Deep learning is becoming the method of choice but studies to date focus on mean accuracy as the main metric. However, to replace costly and mission-critical experiments by models, a high mean...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yao, Lee, Alpha A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Royal Society of Chemistry 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6837061/
https://www.ncbi.nlm.nih.gov/pubmed/31857882
http://dx.doi.org/10.1039/c9sc00616h
_version_ 1783467012905762816
author Zhang, Yao
Lee, Alpha A.
author_facet Zhang, Yao
Lee, Alpha A.
author_sort Zhang, Yao
collection PubMed
description Predicting bioactivity and physical properties of small molecules is a central challenge in drug discovery. Deep learning is becoming the method of choice but studies to date focus on mean accuracy as the main metric. However, to replace costly and mission-critical experiments by models, a high mean accuracy is not enough: outliers can derail a discovery campaign, thus models need to reliably predict when it will fail, even when the training data is biased; experiments are expensive, thus models need to be data-efficient and suggest informative training sets using active learning. We show that uncertainty quantification and active learning can be achieved by Bayesian semi-supervised graph convolutional neural networks. The Bayesian approach estimates uncertainty in a statistically principled way through sampling from the posterior distribution. Semi-supervised learning disentangles representation learning and regression, keeping uncertainty estimates accurate in the low data limit and allowing the model to start active learning from a small initial pool of training data. Our study highlights the promise of Bayesian deep learning for chemistry.
format Online
Article
Text
id pubmed-6837061
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-68370612019-12-19 Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning Zhang, Yao Lee, Alpha A. Chem Sci Chemistry Predicting bioactivity and physical properties of small molecules is a central challenge in drug discovery. Deep learning is becoming the method of choice but studies to date focus on mean accuracy as the main metric. However, to replace costly and mission-critical experiments by models, a high mean accuracy is not enough: outliers can derail a discovery campaign, thus models need to reliably predict when it will fail, even when the training data is biased; experiments are expensive, thus models need to be data-efficient and suggest informative training sets using active learning. We show that uncertainty quantification and active learning can be achieved by Bayesian semi-supervised graph convolutional neural networks. The Bayesian approach estimates uncertainty in a statistically principled way through sampling from the posterior distribution. Semi-supervised learning disentangles representation learning and regression, keeping uncertainty estimates accurate in the low data limit and allowing the model to start active learning from a small initial pool of training data. Our study highlights the promise of Bayesian deep learning for chemistry. Royal Society of Chemistry 2019-07-10 /pmc/articles/PMC6837061/ /pubmed/31857882 http://dx.doi.org/10.1039/c9sc00616h Text en This journal is © The Royal Society of Chemistry 2019 http://creativecommons.org/licenses/by/3.0/ This article is freely available. This article is licensed under a Creative Commons Attribution 3.0 Unported Licence (CC BY 3.0)
spellingShingle Chemistry
Zhang, Yao
Lee, Alpha A.
Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning
title Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning
title_full Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning
title_fullStr Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning
title_full_unstemmed Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning
title_short Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning
title_sort bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6837061/
https://www.ncbi.nlm.nih.gov/pubmed/31857882
http://dx.doi.org/10.1039/c9sc00616h
work_keys_str_mv AT zhangyao bayesiansemisupervisedlearningforuncertaintycalibratedpredictionofmolecularpropertiesandactivelearning
AT leealphaa bayesiansemisupervisedlearningforuncertaintycalibratedpredictionofmolecularpropertiesandactivelearning