Cargando…

Explaining and avoiding failure modes in goal-directed generation of small molecules

Despite growing interest and success in automated in-silico molecular design, questions remain regarding the ability of goal-directed generation algorithms to perform unbiased exploration of novel chemical spaces. A specific phenomenon has recently been highlighted: goal-directed generation guided w...

Descripción completa

Detalles Bibliográficos
Autores principales: Langevin, Maxime, Vuilleumier, Rodolphe, Bianciotto, Marc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8973583/
https://www.ncbi.nlm.nih.gov/pubmed/35365218
http://dx.doi.org/10.1186/s13321-022-00601-y
Descripción
Sumario:Despite growing interest and success in automated in-silico molecular design, questions remain regarding the ability of goal-directed generation algorithms to perform unbiased exploration of novel chemical spaces. A specific phenomenon has recently been highlighted: goal-directed generation guided with machine learning models produce molecules with high scores according to the optimization model, but low scores according to control models, even when trained on the same data distribution and the same target. In this work, we show that this worrisome behavior is actually due to issues with the predictive models and not the goal-directed generation algorithms. We show that with appropriate predictive models, this issue can be resolved, and molecules generated have high scores according to both the optimization and the control models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00601-y.