Cargando…

MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design

[Image: see text] The growing capabilities of synthetic biology and organic chemistry demand tools to guide syntheses toward useful molecules. Here, we present Molecular AutoenCoding Auto-Workaround (MACAW), a tool that uses a novel approach to generate molecules predicted to meet a desired property...

Descripción completa

Detalles Bibliográficos
Autores principales: Blay, Vincent, Radivojevic, Tijana, Allen, Jonathan E., Hudson, Corey M., Garcia Martin, Hector
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9364320/
https://www.ncbi.nlm.nih.gov/pubmed/35857932
http://dx.doi.org/10.1021/acs.jcim.2c00229
_version_ 1784765125859737600
author Blay, Vincent
Radivojevic, Tijana
Allen, Jonathan E.
Hudson, Corey M.
Garcia Martin, Hector
author_facet Blay, Vincent
Radivojevic, Tijana
Allen, Jonathan E.
Hudson, Corey M.
Garcia Martin, Hector
author_sort Blay, Vincent
collection PubMed
description [Image: see text] The growing capabilities of synthetic biology and organic chemistry demand tools to guide syntheses toward useful molecules. Here, we present Molecular AutoenCoding Auto-Workaround (MACAW), a tool that uses a novel approach to generate molecules predicted to meet a desired property specification (e.g., a binding affinity of 50 nM or an octane number of 90). MACAW describes molecules by embedding them into a smooth multidimensional numerical space, avoiding uninformative dimensions that previous methods often introduce. The coordinates in this embedding provide a natural choice of features for accurately predicting molecular properties, which we demonstrate with examples for cetane and octane numbers, flash points, and histamine H1 receptor binding affinity. The approach is computationally efficient and well-suited to the small- and medium-size datasets commonly used in biosciences. We showcase the utility of MACAW for virtual screening by identifying molecules with high predicted binding affinity to the histamine H1 receptor and limited affinity to the muscarinic M2 receptor, which are targets of medicinal relevance. Combining these predictive capabilities with a novel generative algorithm for molecules allows us to recommend molecules with a desired property value (i.e., inverse molecular design). We demonstrate this capability by recommending molecules with predicted octane numbers of 40, 80, and 120, which is an important characteristic of biofuels. Thus, MACAW augments classical retrosynthesis tools by providing recommendations for molecules on specification.
format Online
Article
Text
id pubmed-9364320
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-93643202022-08-11 MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design Blay, Vincent Radivojevic, Tijana Allen, Jonathan E. Hudson, Corey M. Garcia Martin, Hector J Chem Inf Model [Image: see text] The growing capabilities of synthetic biology and organic chemistry demand tools to guide syntheses toward useful molecules. Here, we present Molecular AutoenCoding Auto-Workaround (MACAW), a tool that uses a novel approach to generate molecules predicted to meet a desired property specification (e.g., a binding affinity of 50 nM or an octane number of 90). MACAW describes molecules by embedding them into a smooth multidimensional numerical space, avoiding uninformative dimensions that previous methods often introduce. The coordinates in this embedding provide a natural choice of features for accurately predicting molecular properties, which we demonstrate with examples for cetane and octane numbers, flash points, and histamine H1 receptor binding affinity. The approach is computationally efficient and well-suited to the small- and medium-size datasets commonly used in biosciences. We showcase the utility of MACAW for virtual screening by identifying molecules with high predicted binding affinity to the histamine H1 receptor and limited affinity to the muscarinic M2 receptor, which are targets of medicinal relevance. Combining these predictive capabilities with a novel generative algorithm for molecules allows us to recommend molecules with a desired property value (i.e., inverse molecular design). We demonstrate this capability by recommending molecules with predicted octane numbers of 40, 80, and 120, which is an important characteristic of biofuels. Thus, MACAW augments classical retrosynthesis tools by providing recommendations for molecules on specification. American Chemical Society 2022-07-20 2022-08-08 /pmc/articles/PMC9364320/ /pubmed/35857932 http://dx.doi.org/10.1021/acs.jcim.2c00229 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Blay, Vincent
Radivojevic, Tijana
Allen, Jonathan E.
Hudson, Corey M.
Garcia Martin, Hector
MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design
title MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design
title_full MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design
title_fullStr MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design
title_full_unstemmed MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design
title_short MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design
title_sort macaw: an accessible tool for molecular embedding and inverse molecular design
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9364320/
https://www.ncbi.nlm.nih.gov/pubmed/35857932
http://dx.doi.org/10.1021/acs.jcim.2c00229
work_keys_str_mv AT blayvincent macawanaccessibletoolformolecularembeddingandinversemoleculardesign
AT radivojevictijana macawanaccessibletoolformolecularembeddingandinversemoleculardesign
AT allenjonathane macawanaccessibletoolformolecularembeddingandinversemoleculardesign
AT hudsoncoreym macawanaccessibletoolformolecularembeddingandinversemoleculardesign
AT garciamartinhector macawanaccessibletoolformolecularembeddingandinversemoleculardesign