Cargando…

Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions

The related problems of chemical reaction optimization and reaction scope search concern the discovery of reaction pathways and conditions that provide the best percentage yield of a target product. The space of possible reaction pathways or conditions is too large to search in full, so identifying...

Descripción completa

Detalles Bibliográficos
Autores principales: Khondaker, Rubaiyat Mohammad, Gow, Stephen, Kanza, Samantha, Frey, Jeremy G, Niranjan, Mahesan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9434872/
https://www.ncbi.nlm.nih.gov/pubmed/36050750
http://dx.doi.org/10.1186/s13321-022-00641-4
_version_ 1784780979612680192
author Khondaker, Rubaiyat Mohammad
Gow, Stephen
Kanza, Samantha
Frey, Jeremy G
Niranjan, Mahesan
author_facet Khondaker, Rubaiyat Mohammad
Gow, Stephen
Kanza, Samantha
Frey, Jeremy G
Niranjan, Mahesan
author_sort Khondaker, Rubaiyat Mohammad
collection PubMed
description The related problems of chemical reaction optimization and reaction scope search concern the discovery of reaction pathways and conditions that provide the best percentage yield of a target product. The space of possible reaction pathways or conditions is too large to search in full, so identifying a globally optimal set of conditions must instead draw on mathematical methods to identify areas of the space that should be investigated. An intriguing contribution to this area of research is the recent development of the Experimental Design for Bayesian optimization (EDBO) optimizer [1]. Bayesian optimization works by building an approximation to the true function to be optimized based on a small set of simulations, and selecting the next point (or points) to be tested based on an acquisition function reflecting the value of different points within the input space. In this work, we evaluated the robustness of the EDBO optimizer under several changes to its specification. We investigated the effect on the performance of the optimizer of altering the acquisition function and batch size, applied the method to other existing reaction yield data sets, and considered its performance in the new problem domain of molecular power conversion efficiency in photovoltaic cells. Our results indicated that the EDBO optimizer broadly performs well under these changes; of particular note is the competitive performance of the computationally cheaper acquisition function Thompson Sampling when compared to the original Expected Improvement function, and some concerns around the method’s performance for “incomplete” input domains.
format Online
Article
Text
id pubmed-9434872
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-94348722022-09-02 Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions Khondaker, Rubaiyat Mohammad Gow, Stephen Kanza, Samantha Frey, Jeremy G Niranjan, Mahesan J Cheminform Research The related problems of chemical reaction optimization and reaction scope search concern the discovery of reaction pathways and conditions that provide the best percentage yield of a target product. The space of possible reaction pathways or conditions is too large to search in full, so identifying a globally optimal set of conditions must instead draw on mathematical methods to identify areas of the space that should be investigated. An intriguing contribution to this area of research is the recent development of the Experimental Design for Bayesian optimization (EDBO) optimizer [1]. Bayesian optimization works by building an approximation to the true function to be optimized based on a small set of simulations, and selecting the next point (or points) to be tested based on an acquisition function reflecting the value of different points within the input space. In this work, we evaluated the robustness of the EDBO optimizer under several changes to its specification. We investigated the effect on the performance of the optimizer of altering the acquisition function and batch size, applied the method to other existing reaction yield data sets, and considered its performance in the new problem domain of molecular power conversion efficiency in photovoltaic cells. Our results indicated that the EDBO optimizer broadly performs well under these changes; of particular note is the competitive performance of the computationally cheaper acquisition function Thompson Sampling when compared to the original Expected Improvement function, and some concerns around the method’s performance for “incomplete” input domains. Springer International Publishing 2022-09-01 /pmc/articles/PMC9434872/ /pubmed/36050750 http://dx.doi.org/10.1186/s13321-022-00641-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Khondaker, Rubaiyat Mohammad
Gow, Stephen
Kanza, Samantha
Frey, Jeremy G
Niranjan, Mahesan
Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions
title Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions
title_full Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions
title_fullStr Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions
title_full_unstemmed Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions
title_short Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions
title_sort robustness under parameter and problem domain alterations of bayesian optimization methods for chemical reactions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9434872/
https://www.ncbi.nlm.nih.gov/pubmed/36050750
http://dx.doi.org/10.1186/s13321-022-00641-4
work_keys_str_mv AT khondakerrubaiyatmohammad robustnessunderparameterandproblemdomainalterationsofbayesianoptimizationmethodsforchemicalreactions
AT gowstephen robustnessunderparameterandproblemdomainalterationsofbayesianoptimizationmethodsforchemicalreactions
AT kanzasamantha robustnessunderparameterandproblemdomainalterationsofbayesianoptimizationmethodsforchemicalreactions
AT freyjeremyg robustnessunderparameterandproblemdomainalterationsofbayesianoptimizationmethodsforchemicalreactions
AT niranjanmahesan robustnessunderparameterandproblemdomainalterationsofbayesianoptimizationmethodsforchemicalreactions