Cargando…

On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach

A large number of chemical compounds are available in databases such as PubChem and ZINC. However, currently known compounds, though large, represent only a fraction of possible compounds, which is known as chemical space. Many of these compounds in the databases are annotated with properties and as...

Descripción completa

Detalles Bibliográficos
Autores principales: Lim, Sangsoo, Lee, Sangseon, Piao, Yinhua, Choi, MinGyu, Bang, Dongmin, Gu, Jeonghyeon, Kim, Sun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9399946/
https://www.ncbi.nlm.nih.gov/pubmed/36051875
http://dx.doi.org/10.1016/j.csbj.2022.07.049
_version_ 1784772641864810496
author Lim, Sangsoo
Lee, Sangseon
Piao, Yinhua
Choi, MinGyu
Bang, Dongmin
Gu, Jeonghyeon
Kim, Sun
author_facet Lim, Sangsoo
Lee, Sangseon
Piao, Yinhua
Choi, MinGyu
Bang, Dongmin
Gu, Jeonghyeon
Kim, Sun
author_sort Lim, Sangsoo
collection PubMed
description A large number of chemical compounds are available in databases such as PubChem and ZINC. However, currently known compounds, though large, represent only a fraction of possible compounds, which is known as chemical space. Many of these compounds in the databases are annotated with properties and assay data that can be used for drug discovery efforts. For this goal, a number of machine learning algorithms have been developed and recent deep learning technologies can be effectively used to navigate chemical space, especially for unknown chemical compounds, in terms of drug-related tasks. In this article, we survey how deep learning technologies can model and utilize chemical compound information in a task-oriented way by exploiting annotated properties and assay data in the chemical compounds databases. We first compile what kind of tasks are trying to be accomplished by machine learning methods. Then, we survey deep learning technologies to show their modeling power and current applications for accomplishing drug related tasks. Next, we survey deep learning techniques to address the insufficiency issue of annotated data for more effective navigation of chemical space. Chemical compound information alone may not be powerful enough for drug related tasks, thus we survey what kind of information, such as assay and gene expression data, can be used to improve the prediction power of deep learning models. Finally, we conclude this survey with four important newly developed technologies that are yet to be fully incorporated into computational analysis of chemical information.
format Online
Article
Text
id pubmed-9399946
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-93999462022-08-31 On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach Lim, Sangsoo Lee, Sangseon Piao, Yinhua Choi, MinGyu Bang, Dongmin Gu, Jeonghyeon Kim, Sun Comput Struct Biotechnol J Mini Review A large number of chemical compounds are available in databases such as PubChem and ZINC. However, currently known compounds, though large, represent only a fraction of possible compounds, which is known as chemical space. Many of these compounds in the databases are annotated with properties and assay data that can be used for drug discovery efforts. For this goal, a number of machine learning algorithms have been developed and recent deep learning technologies can be effectively used to navigate chemical space, especially for unknown chemical compounds, in terms of drug-related tasks. In this article, we survey how deep learning technologies can model and utilize chemical compound information in a task-oriented way by exploiting annotated properties and assay data in the chemical compounds databases. We first compile what kind of tasks are trying to be accomplished by machine learning methods. Then, we survey deep learning technologies to show their modeling power and current applications for accomplishing drug related tasks. Next, we survey deep learning techniques to address the insufficiency issue of annotated data for more effective navigation of chemical space. Chemical compound information alone may not be powerful enough for drug related tasks, thus we survey what kind of information, such as assay and gene expression data, can be used to improve the prediction power of deep learning models. Finally, we conclude this survey with four important newly developed technologies that are yet to be fully incorporated into computational analysis of chemical information. Research Network of Computational and Structural Biotechnology 2022-08-05 /pmc/articles/PMC9399946/ /pubmed/36051875 http://dx.doi.org/10.1016/j.csbj.2022.07.049 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Mini Review
Lim, Sangsoo
Lee, Sangseon
Piao, Yinhua
Choi, MinGyu
Bang, Dongmin
Gu, Jeonghyeon
Kim, Sun
On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach
title On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach
title_full On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach
title_fullStr On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach
title_full_unstemmed On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach
title_short On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach
title_sort on modeling and utilizing chemical compound information with deep learning technologies: a task-oriented approach
topic Mini Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9399946/
https://www.ncbi.nlm.nih.gov/pubmed/36051875
http://dx.doi.org/10.1016/j.csbj.2022.07.049
work_keys_str_mv AT limsangsoo onmodelingandutilizingchemicalcompoundinformationwithdeeplearningtechnologiesataskorientedapproach
AT leesangseon onmodelingandutilizingchemicalcompoundinformationwithdeeplearningtechnologiesataskorientedapproach
AT piaoyinhua onmodelingandutilizingchemicalcompoundinformationwithdeeplearningtechnologiesataskorientedapproach
AT choimingyu onmodelingandutilizingchemicalcompoundinformationwithdeeplearningtechnologiesataskorientedapproach
AT bangdongmin onmodelingandutilizingchemicalcompoundinformationwithdeeplearningtechnologiesataskorientedapproach
AT gujeonghyeon onmodelingandutilizingchemicalcompoundinformationwithdeeplearningtechnologiesataskorientedapproach
AT kimsun onmodelingandutilizingchemicalcompoundinformationwithdeeplearningtechnologiesataskorientedapproach