Cargando…

Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration

The flourishment of machine learning and deep learning methods has boosted the development of cheminformatics, especially regarding the application of drug discovery and new material exploration. Lower time and space expenses make it possible for scientists to search the enormous chemical space. Rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Lin, Shen, Qing, Lou, Jungang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10132416/
https://www.ncbi.nlm.nih.gov/pubmed/37101113
http://dx.doi.org/10.1186/s12859-023-05286-0
_version_ 1785031382707208192
author Chen, Lin
Shen, Qing
Lou, Jungang
author_facet Chen, Lin
Shen, Qing
Lou, Jungang
author_sort Chen, Lin
collection PubMed
description The flourishment of machine learning and deep learning methods has boosted the development of cheminformatics, especially regarding the application of drug discovery and new material exploration. Lower time and space expenses make it possible for scientists to search the enormous chemical space. Recently, some work combined reinforcement learning strategies with recurrent neural network (RNN)-based models to optimize the property of generated small molecules, which notably improved a batch of critical factors for these candidates. However, a common problem among these RNN-based methods is that several generated molecules have difficulty in synthesizing despite owning higher desired properties such as binding affinity. However, RNN-based framework better reproduces the molecule distribution among the training set than other categories of models during molecule exploration tasks. Thus, to optimize the whole exploration process and make it contribute to the optimization of specified molecules, we devised a light-weighted pipeline called Magicmol; this pipeline has a re-mastered RNN network and utilize SELFIES presentation instead of SMILES. Our backbone model achieved extraordinary performance while reducing the training cost; moreover, we devised reward truncate strategies to eliminate the model collapse problem. Additionally, adopting SELFIES presentation made it possible to combine STONED-SELFIES as a post-processing procedure for specified molecule optimization and quick chemical space exploration.
format Online
Article
Text
id pubmed-10132416
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101324162023-04-27 Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration Chen, Lin Shen, Qing Lou, Jungang BMC Bioinformatics Research The flourishment of machine learning and deep learning methods has boosted the development of cheminformatics, especially regarding the application of drug discovery and new material exploration. Lower time and space expenses make it possible for scientists to search the enormous chemical space. Recently, some work combined reinforcement learning strategies with recurrent neural network (RNN)-based models to optimize the property of generated small molecules, which notably improved a batch of critical factors for these candidates. However, a common problem among these RNN-based methods is that several generated molecules have difficulty in synthesizing despite owning higher desired properties such as binding affinity. However, RNN-based framework better reproduces the molecule distribution among the training set than other categories of models during molecule exploration tasks. Thus, to optimize the whole exploration process and make it contribute to the optimization of specified molecules, we devised a light-weighted pipeline called Magicmol; this pipeline has a re-mastered RNN network and utilize SELFIES presentation instead of SMILES. Our backbone model achieved extraordinary performance while reducing the training cost; moreover, we devised reward truncate strategies to eliminate the model collapse problem. Additionally, adopting SELFIES presentation made it possible to combine STONED-SELFIES as a post-processing procedure for specified molecule optimization and quick chemical space exploration. BioMed Central 2023-04-26 /pmc/articles/PMC10132416/ /pubmed/37101113 http://dx.doi.org/10.1186/s12859-023-05286-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Chen, Lin
Shen, Qing
Lou, Jungang
Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_full Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_fullStr Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_full_unstemmed Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_short Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_sort magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10132416/
https://www.ncbi.nlm.nih.gov/pubmed/37101113
http://dx.doi.org/10.1186/s12859-023-05286-0
work_keys_str_mv AT chenlin magicmolalightweightedpipelinefordruglikemoleculeevolutionandquickchemicalspaceexploration
AT shenqing magicmolalightweightedpipelinefordruglikemoleculeevolutionandquickchemicalspaceexploration
AT loujungang magicmolalightweightedpipelinefordruglikemoleculeevolutionandquickchemicalspaceexploration