Cargando…
Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently
The number of ‘small’ molecules that may be of interest to chemical biologists — chemical space — is enormous, but the fraction that have ever been made is tiny. Most strategies are discriminative, i.e. have involved ‘forward’ problems (have molecule, establish properties). However, we normally wish...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Portland Press Ltd.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7733676/ https://www.ncbi.nlm.nih.gov/pubmed/33290527 http://dx.doi.org/10.1042/BCJ20200781 |
_version_ | 1783622322592153600 |
---|---|
author | Kell, Douglas B. Samanta, Soumitra Swainston, Neil |
author_facet | Kell, Douglas B. Samanta, Soumitra Swainston, Neil |
author_sort | Kell, Douglas B. |
collection | PubMed |
description | The number of ‘small’ molecules that may be of interest to chemical biologists — chemical space — is enormous, but the fraction that have ever been made is tiny. Most strategies are discriminative, i.e. have involved ‘forward’ problems (have molecule, establish properties). However, we normally wish to solve the much harder generative or inverse problem (describe desired properties, find molecule). ‘Deep’ (machine) learning based on large-scale neural networks underpins technologies such as computer vision, natural language processing, driverless cars, and world-leading performance in games such as Go; it can also be applied to the solution of inverse problems in chemical biology. In particular, recent developments in deep learning admit the in silico generation of candidate molecular structures and the prediction of their properties, thereby allowing one to navigate (bio)chemical space intelligently. These methods are revolutionary but require an understanding of both (bio)chemistry and computer science to be exploited to best advantage. We give a high-level (non-mathematical) background to the deep learning revolution, and set out the crucial issue for chemical biology and informatics as a two-way mapping from the discrete nature of individual molecules to the continuous but high-dimensional latent representation that may best reflect chemical space. A variety of architectures can do this; we focus on a particular type known as variational autoencoders. We then provide some examples of recent successes of these kinds of approach, and a look towards the future. |
format | Online Article Text |
id | pubmed-7733676 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Portland Press Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-77336762020-12-18 Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently Kell, Douglas B. Samanta, Soumitra Swainston, Neil Biochem J Bioinformatics The number of ‘small’ molecules that may be of interest to chemical biologists — chemical space — is enormous, but the fraction that have ever been made is tiny. Most strategies are discriminative, i.e. have involved ‘forward’ problems (have molecule, establish properties). However, we normally wish to solve the much harder generative or inverse problem (describe desired properties, find molecule). ‘Deep’ (machine) learning based on large-scale neural networks underpins technologies such as computer vision, natural language processing, driverless cars, and world-leading performance in games such as Go; it can also be applied to the solution of inverse problems in chemical biology. In particular, recent developments in deep learning admit the in silico generation of candidate molecular structures and the prediction of their properties, thereby allowing one to navigate (bio)chemical space intelligently. These methods are revolutionary but require an understanding of both (bio)chemistry and computer science to be exploited to best advantage. We give a high-level (non-mathematical) background to the deep learning revolution, and set out the crucial issue for chemical biology and informatics as a two-way mapping from the discrete nature of individual molecules to the continuous but high-dimensional latent representation that may best reflect chemical space. A variety of architectures can do this; we focus on a particular type known as variational autoencoders. We then provide some examples of recent successes of these kinds of approach, and a look towards the future. Portland Press Ltd. 2020-12-11 2020-12-08 /pmc/articles/PMC7733676/ /pubmed/33290527 http://dx.doi.org/10.1042/BCJ20200781 Text en © 2020 The Author(s) https://creativecommons.org/licenses/by/4.0/ This is an open access article published by Portland Press Limited on behalf of the Biochemical Society and distributed under the Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) . Open access for this article was enabled by the participation of University of Liverpool in an all-inclusive Read & Publish pilot with Portland Press and the Biochemical Society under a transformative agreement with JISC. |
spellingShingle | Bioinformatics Kell, Douglas B. Samanta, Soumitra Swainston, Neil Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently |
title | Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently |
title_full | Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently |
title_fullStr | Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently |
title_full_unstemmed | Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently |
title_short | Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently |
title_sort | deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7733676/ https://www.ncbi.nlm.nih.gov/pubmed/33290527 http://dx.doi.org/10.1042/BCJ20200781 |
work_keys_str_mv | AT kelldouglasb deeplearningandgenerativemethodsincheminformaticsandchemicalbiologynavigatingsmallmoleculespaceintelligently AT samantasoumitra deeplearningandgenerativemethodsincheminformaticsandchemicalbiologynavigatingsmallmoleculespaceintelligently AT swainstonneil deeplearningandgenerativemethodsincheminformaticsandchemicalbiologynavigatingsmallmoleculespaceintelligently |