Cargando…

MEMES: Machine learning framework for Enhanced MolEcular Screening

In drug discovery applications, high throughput virtual screening exercises are routinely performed to determine an initial set of candidate molecules referred to as “hits”. In such an experiment, each molecule from a large small-molecule drug library is evaluated in terms of physical properties suc...

Descripción completa

Detalles Bibliográficos
Autores principales: Mehta, Sarvesh, Laghuvarapu, Siddhartha, Pathak, Yashaswi, Sethi, Aaftaab, Alvala, Mallika, Priyakumar, U. Deva
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8442698/
https://www.ncbi.nlm.nih.gov/pubmed/34659706
http://dx.doi.org/10.1039/d1sc02783b
_version_ 1783753054936367104
author Mehta, Sarvesh
Laghuvarapu, Siddhartha
Pathak, Yashaswi
Sethi, Aaftaab
Alvala, Mallika
Priyakumar, U. Deva
author_facet Mehta, Sarvesh
Laghuvarapu, Siddhartha
Pathak, Yashaswi
Sethi, Aaftaab
Alvala, Mallika
Priyakumar, U. Deva
author_sort Mehta, Sarvesh
collection PubMed
description In drug discovery applications, high throughput virtual screening exercises are routinely performed to determine an initial set of candidate molecules referred to as “hits”. In such an experiment, each molecule from a large small-molecule drug library is evaluated in terms of physical properties such as the docking score against a target receptor. In real-life drug discovery experiments, drug libraries are extremely large but still there is only a minor representation of the essentially infinite chemical space, and evaluation of physical properties for each molecule in the library is not computationally feasible. In the current study, a novel Machine learning framework for Enhanced MolEcular Screening (MEMES) based on Bayesian optimization is proposed for efficient sampling of the chemical space. The proposed framework is demonstrated to identify 90% of the top-1000 molecules from a molecular library of size about 100 million, while calculating the docking score only for about 6% of the complete library. We believe that such a framework would tremendously help to reduce the computational effort in not only drug-discovery but also areas that require such high-throughput experiments.
format Online
Article
Text
id pubmed-8442698
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-84426982021-10-14 MEMES: Machine learning framework for Enhanced MolEcular Screening Mehta, Sarvesh Laghuvarapu, Siddhartha Pathak, Yashaswi Sethi, Aaftaab Alvala, Mallika Priyakumar, U. Deva Chem Sci Chemistry In drug discovery applications, high throughput virtual screening exercises are routinely performed to determine an initial set of candidate molecules referred to as “hits”. In such an experiment, each molecule from a large small-molecule drug library is evaluated in terms of physical properties such as the docking score against a target receptor. In real-life drug discovery experiments, drug libraries are extremely large but still there is only a minor representation of the essentially infinite chemical space, and evaluation of physical properties for each molecule in the library is not computationally feasible. In the current study, a novel Machine learning framework for Enhanced MolEcular Screening (MEMES) based on Bayesian optimization is proposed for efficient sampling of the chemical space. The proposed framework is demonstrated to identify 90% of the top-1000 molecules from a molecular library of size about 100 million, while calculating the docking score only for about 6% of the complete library. We believe that such a framework would tremendously help to reduce the computational effort in not only drug-discovery but also areas that require such high-throughput experiments. The Royal Society of Chemistry 2021-07-26 /pmc/articles/PMC8442698/ /pubmed/34659706 http://dx.doi.org/10.1039/d1sc02783b Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Mehta, Sarvesh
Laghuvarapu, Siddhartha
Pathak, Yashaswi
Sethi, Aaftaab
Alvala, Mallika
Priyakumar, U. Deva
MEMES: Machine learning framework for Enhanced MolEcular Screening
title MEMES: Machine learning framework for Enhanced MolEcular Screening
title_full MEMES: Machine learning framework for Enhanced MolEcular Screening
title_fullStr MEMES: Machine learning framework for Enhanced MolEcular Screening
title_full_unstemmed MEMES: Machine learning framework for Enhanced MolEcular Screening
title_short MEMES: Machine learning framework for Enhanced MolEcular Screening
title_sort memes: machine learning framework for enhanced molecular screening
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8442698/
https://www.ncbi.nlm.nih.gov/pubmed/34659706
http://dx.doi.org/10.1039/d1sc02783b
work_keys_str_mv AT mehtasarvesh memesmachinelearningframeworkforenhancedmolecularscreening
AT laghuvarapusiddhartha memesmachinelearningframeworkforenhancedmolecularscreening
AT pathakyashaswi memesmachinelearningframeworkforenhancedmolecularscreening
AT sethiaaftaab memesmachinelearningframeworkforenhancedmolecularscreening
AT alvalamallika memesmachinelearningframeworkforenhancedmolecularscreening
AT priyakumarudeva memesmachinelearningframeworkforenhancedmolecularscreening