Cargando…

Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently

The number of ‘small’ molecules that may be of interest to chemical biologists — chemical space — is enormous, but the fraction that have ever been made is tiny. Most strategies are discriminative, i.e. have involved ‘forward’ problems (have molecule, establish properties). However, we normally wish...

Descripción completa

Detalles Bibliográficos
Autores principales: Kell, Douglas B., Samanta, Soumitra, Swainston, Neil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Portland Press Ltd. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7733676/
https://www.ncbi.nlm.nih.gov/pubmed/33290527
http://dx.doi.org/10.1042/BCJ20200781
_version_ 1783622322592153600
author Kell, Douglas B.
Samanta, Soumitra
Swainston, Neil
author_facet Kell, Douglas B.
Samanta, Soumitra
Swainston, Neil
author_sort Kell, Douglas B.
collection PubMed
description The number of ‘small’ molecules that may be of interest to chemical biologists — chemical space — is enormous, but the fraction that have ever been made is tiny. Most strategies are discriminative, i.e. have involved ‘forward’ problems (have molecule, establish properties). However, we normally wish to solve the much harder generative or inverse problem (describe desired properties, find molecule). ‘Deep’ (machine) learning based on large-scale neural networks underpins technologies such as computer vision, natural language processing, driverless cars, and world-leading performance in games such as Go; it can also be applied to the solution of inverse problems in chemical biology. In particular, recent developments in deep learning admit the in silico generation of candidate molecular structures and the prediction of their properties, thereby allowing one to navigate (bio)chemical space intelligently. These methods are revolutionary but require an understanding of both (bio)chemistry and computer science to be exploited to best advantage. We give a high-level (non-mathematical) background to the deep learning revolution, and set out the crucial issue for chemical biology and informatics as a two-way mapping from the discrete nature of individual molecules to the continuous but high-dimensional latent representation that may best reflect chemical space. A variety of architectures can do this; we focus on a particular type known as variational autoencoders. We then provide some examples of recent successes of these kinds of approach, and a look towards the future.
format Online
Article
Text
id pubmed-7733676
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Portland Press Ltd.
record_format MEDLINE/PubMed
spelling pubmed-77336762020-12-18 Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently Kell, Douglas B. Samanta, Soumitra Swainston, Neil Biochem J Bioinformatics The number of ‘small’ molecules that may be of interest to chemical biologists — chemical space — is enormous, but the fraction that have ever been made is tiny. Most strategies are discriminative, i.e. have involved ‘forward’ problems (have molecule, establish properties). However, we normally wish to solve the much harder generative or inverse problem (describe desired properties, find molecule). ‘Deep’ (machine) learning based on large-scale neural networks underpins technologies such as computer vision, natural language processing, driverless cars, and world-leading performance in games such as Go; it can also be applied to the solution of inverse problems in chemical biology. In particular, recent developments in deep learning admit the in silico generation of candidate molecular structures and the prediction of their properties, thereby allowing one to navigate (bio)chemical space intelligently. These methods are revolutionary but require an understanding of both (bio)chemistry and computer science to be exploited to best advantage. We give a high-level (non-mathematical) background to the deep learning revolution, and set out the crucial issue for chemical biology and informatics as a two-way mapping from the discrete nature of individual molecules to the continuous but high-dimensional latent representation that may best reflect chemical space. A variety of architectures can do this; we focus on a particular type known as variational autoencoders. We then provide some examples of recent successes of these kinds of approach, and a look towards the future. Portland Press Ltd. 2020-12-11 2020-12-08 /pmc/articles/PMC7733676/ /pubmed/33290527 http://dx.doi.org/10.1042/BCJ20200781 Text en © 2020 The Author(s) https://creativecommons.org/licenses/by/4.0/ This is an open access article published by Portland Press Limited on behalf of the Biochemical Society and distributed under the Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) . Open access for this article was enabled by the participation of University of Liverpool in an all-inclusive Read & Publish pilot with Portland Press and the Biochemical Society under a transformative agreement with JISC.
spellingShingle Bioinformatics
Kell, Douglas B.
Samanta, Soumitra
Swainston, Neil
Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently
title Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently
title_full Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently
title_fullStr Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently
title_full_unstemmed Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently
title_short Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently
title_sort deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7733676/
https://www.ncbi.nlm.nih.gov/pubmed/33290527
http://dx.doi.org/10.1042/BCJ20200781
work_keys_str_mv AT kelldouglasb deeplearningandgenerativemethodsincheminformaticsandchemicalbiologynavigatingsmallmoleculespaceintelligently
AT samantasoumitra deeplearningandgenerativemethodsincheminformaticsandchemicalbiologynavigatingsmallmoleculespaceintelligently
AT swainstonneil deeplearningandgenerativemethodsincheminformaticsandchemicalbiologynavigatingsmallmoleculespaceintelligently