Cargando…

Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research

In recent years, the development of high-throughput screening (HTS) technologies and their establishment in an industrialized environment have given scientists the possibility to test millions of molecules and profile them against a multitude of biological targets in a short period of time, generati...

Descripción completa

Detalles Bibliográficos
Autores principales: David, Laurianne, Arús-Pous, Josep, Karlsson, Johan, Engkvist, Ola, Bjerrum, Esben Jannik, Kogej, Thierry, Kriegl, Jan M., Beck, Bernd, Chen, Hongming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6848277/
https://www.ncbi.nlm.nih.gov/pubmed/31749705
http://dx.doi.org/10.3389/fphar.2019.01303
_version_ 1783469063459045376
author David, Laurianne
Arús-Pous, Josep
Karlsson, Johan
Engkvist, Ola
Bjerrum, Esben Jannik
Kogej, Thierry
Kriegl, Jan M.
Beck, Bernd
Chen, Hongming
author_facet David, Laurianne
Arús-Pous, Josep
Karlsson, Johan
Engkvist, Ola
Bjerrum, Esben Jannik
Kogej, Thierry
Kriegl, Jan M.
Beck, Bernd
Chen, Hongming
author_sort David, Laurianne
collection PubMed
description In recent years, the development of high-throughput screening (HTS) technologies and their establishment in an industrialized environment have given scientists the possibility to test millions of molecules and profile them against a multitude of biological targets in a short period of time, generating data in a much faster pace and with a higher quality than before. Besides the structure activity data from traditional bioassays, more complex assays such as transcriptomics profiling or imaging have also been established as routine profiling experiments thanks to the advancement of Next Generation Sequencing or automated microscopy technologies. In industrial pharmaceutical research, these technologies are typically established in conjunction with automated platforms in order to enable efficient handling of screening collections of thousands to millions of compounds. To exploit the ever-growing amount of data that are generated by these approaches, computational techniques are constantly evolving. In this regard, artificial intelligence technologies such as deep learning and machine learning methods play a key role in cheminformatics and bio-image analytics fields to address activity prediction, scaffold hopping, de novo molecule design, reaction/retrosynthesis predictions, or high content screening analysis. Herein we summarize the current state of analyzing large-scale compound data in industrial pharmaceutical research and describe the impact it has had on the drug discovery process over the last two decades, with a specific focus on deep-learning technologies.
format Online
Article
Text
id pubmed-6848277
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68482772019-11-20 Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research David, Laurianne Arús-Pous, Josep Karlsson, Johan Engkvist, Ola Bjerrum, Esben Jannik Kogej, Thierry Kriegl, Jan M. Beck, Bernd Chen, Hongming Front Pharmacol Pharmacology In recent years, the development of high-throughput screening (HTS) technologies and their establishment in an industrialized environment have given scientists the possibility to test millions of molecules and profile them against a multitude of biological targets in a short period of time, generating data in a much faster pace and with a higher quality than before. Besides the structure activity data from traditional bioassays, more complex assays such as transcriptomics profiling or imaging have also been established as routine profiling experiments thanks to the advancement of Next Generation Sequencing or automated microscopy technologies. In industrial pharmaceutical research, these technologies are typically established in conjunction with automated platforms in order to enable efficient handling of screening collections of thousands to millions of compounds. To exploit the ever-growing amount of data that are generated by these approaches, computational techniques are constantly evolving. In this regard, artificial intelligence technologies such as deep learning and machine learning methods play a key role in cheminformatics and bio-image analytics fields to address activity prediction, scaffold hopping, de novo molecule design, reaction/retrosynthesis predictions, or high content screening analysis. Herein we summarize the current state of analyzing large-scale compound data in industrial pharmaceutical research and describe the impact it has had on the drug discovery process over the last two decades, with a specific focus on deep-learning technologies. Frontiers Media S.A. 2019-11-05 /pmc/articles/PMC6848277/ /pubmed/31749705 http://dx.doi.org/10.3389/fphar.2019.01303 Text en Copyright © 2019 David, Arús-Pous, Karlsson, Engkvist, Bjerrum, Kogej, Kriegl, Beck and Chen http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Pharmacology
David, Laurianne
Arús-Pous, Josep
Karlsson, Johan
Engkvist, Ola
Bjerrum, Esben Jannik
Kogej, Thierry
Kriegl, Jan M.
Beck, Bernd
Chen, Hongming
Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research
title Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research
title_full Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research
title_fullStr Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research
title_full_unstemmed Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research
title_short Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research
title_sort applications of deep-learning in exploiting large-scale and heterogeneous compound data in industrial pharmaceutical research
topic Pharmacology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6848277/
https://www.ncbi.nlm.nih.gov/pubmed/31749705
http://dx.doi.org/10.3389/fphar.2019.01303
work_keys_str_mv AT davidlaurianne applicationsofdeeplearninginexploitinglargescaleandheterogeneouscompounddatainindustrialpharmaceuticalresearch
AT aruspousjosep applicationsofdeeplearninginexploitinglargescaleandheterogeneouscompounddatainindustrialpharmaceuticalresearch
AT karlssonjohan applicationsofdeeplearninginexploitinglargescaleandheterogeneouscompounddatainindustrialpharmaceuticalresearch
AT engkvistola applicationsofdeeplearninginexploitinglargescaleandheterogeneouscompounddatainindustrialpharmaceuticalresearch
AT bjerrumesbenjannik applicationsofdeeplearninginexploitinglargescaleandheterogeneouscompounddatainindustrialpharmaceuticalresearch
AT kogejthierry applicationsofdeeplearninginexploitinglargescaleandheterogeneouscompounddatainindustrialpharmaceuticalresearch
AT kriegljanm applicationsofdeeplearninginexploitinglargescaleandheterogeneouscompounddatainindustrialpharmaceuticalresearch
AT beckbernd applicationsofdeeplearninginexploitinglargescaleandheterogeneouscompounddatainindustrialpharmaceuticalresearch
AT chenhongming applicationsofdeeplearninginexploitinglargescaleandheterogeneouscompounddatainindustrialpharmaceuticalresearch