Cargando…

Autonomous design of new chemical reactions using a variational autoencoder

Artificial intelligence based chemistry models are a promising method of exploring chemical reaction design spaces. However, training datasets based on experimental synthesis are typically reported only for the optimal synthesis reactions. This leads to an inherited bias in the model predictions. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Tempke, Robert, Musho, Terence
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9814385/
https://www.ncbi.nlm.nih.gov/pubmed/36697652
http://dx.doi.org/10.1038/s42004-022-00647-x
_version_ 1784864120882855936
author Tempke, Robert
Musho, Terence
author_facet Tempke, Robert
Musho, Terence
author_sort Tempke, Robert
collection PubMed
description Artificial intelligence based chemistry models are a promising method of exploring chemical reaction design spaces. However, training datasets based on experimental synthesis are typically reported only for the optimal synthesis reactions. This leads to an inherited bias in the model predictions. Therefore, robust datasets that span the entirety of the solution space are necessary to remove inherited bias and permit complete training of the space. In this study, an artificial intelligence model based on a Variational AutoEncoder (VAE) has been developed and investigated to synthetically generate continuous datasets. The approach involves sampling the latent space to generate new chemical reactions. This developed technique is demonstrated by generating over 7,000,000 new reactions from a training dataset containing only 7,000 reactions. The generated reactions include molecular species that are larger and more diverse than the training set.
format Online
Article
Text
id pubmed-9814385
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-98143852023-01-10 Autonomous design of new chemical reactions using a variational autoencoder Tempke, Robert Musho, Terence Commun Chem Article Artificial intelligence based chemistry models are a promising method of exploring chemical reaction design spaces. However, training datasets based on experimental synthesis are typically reported only for the optimal synthesis reactions. This leads to an inherited bias in the model predictions. Therefore, robust datasets that span the entirety of the solution space are necessary to remove inherited bias and permit complete training of the space. In this study, an artificial intelligence model based on a Variational AutoEncoder (VAE) has been developed and investigated to synthetically generate continuous datasets. The approach involves sampling the latent space to generate new chemical reactions. This developed technique is demonstrated by generating over 7,000,000 new reactions from a training dataset containing only 7,000 reactions. The generated reactions include molecular species that are larger and more diverse than the training set. Nature Publishing Group UK 2022-03-22 /pmc/articles/PMC9814385/ /pubmed/36697652 http://dx.doi.org/10.1038/s42004-022-00647-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Tempke, Robert
Musho, Terence
Autonomous design of new chemical reactions using a variational autoencoder
title Autonomous design of new chemical reactions using a variational autoencoder
title_full Autonomous design of new chemical reactions using a variational autoencoder
title_fullStr Autonomous design of new chemical reactions using a variational autoencoder
title_full_unstemmed Autonomous design of new chemical reactions using a variational autoencoder
title_short Autonomous design of new chemical reactions using a variational autoencoder
title_sort autonomous design of new chemical reactions using a variational autoencoder
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9814385/
https://www.ncbi.nlm.nih.gov/pubmed/36697652
http://dx.doi.org/10.1038/s42004-022-00647-x
work_keys_str_mv AT tempkerobert autonomousdesignofnewchemicalreactionsusingavariationalautoencoder
AT mushoterence autonomousdesignofnewchemicalreactionsusingavariationalautoencoder