Cargando…

Novel Big Data-Driven Machine Learning Models for Drug Discovery Application

Most contemporary drug discovery projects start with a ‘hit discovery’ phase where small chemicals are identified that have the capacity to interact, in a chemical sense, with a protein target involved in a given disease. To assist and accelerate this initial drug discovery process, ’virtual docking...

Descripción completa

Detalles Bibliográficos
Autores principales: Sripriya Akondi, Vishnu, Menon, Vineetha, Baudry, Jerome, Whittle, Jana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8840520/
https://www.ncbi.nlm.nih.gov/pubmed/35163865
http://dx.doi.org/10.3390/molecules27030594
_version_ 1784650640428892160
author Sripriya Akondi, Vishnu
Menon, Vineetha
Baudry, Jerome
Whittle, Jana
author_facet Sripriya Akondi, Vishnu
Menon, Vineetha
Baudry, Jerome
Whittle, Jana
author_sort Sripriya Akondi, Vishnu
collection PubMed
description Most contemporary drug discovery projects start with a ‘hit discovery’ phase where small chemicals are identified that have the capacity to interact, in a chemical sense, with a protein target involved in a given disease. To assist and accelerate this initial drug discovery process, ’virtual docking calculations’ are routinely performed, where computational models of proteins and computational models of small chemicals are evaluated for their capacities to bind together. In cutting-edge, contemporary implementations of this process, several conformations of protein targets are independently assayed in parallel ‘ensemble docking’ calculations. Some of these protein conformations, a minority of them, will be capable of binding many chemicals, while other protein conformations, the majority of them, will not be able to do so. This fact that only some of the conformations accessible to a protein will be ’selected’ by chemicals is known as ’conformational selection’ process in biology. This work describes a machine learning approach to characterize and identify the properties of protein conformations that will be selected (i.e., bind to) chemicals, and classified as potential binding drug candidates, unlike the remaining non-binding drug candidate protein conformations. This work also addresses the class imbalance problem through advanced machine learning techniques that maximize the prediction rate of potential protein molecular conformations for the test case proteins ADORA2A (Adenosine A2a Receptor) and OPRK1 (Opioid Receptor Kappa 1), and subsequently reduces the failure rates and hastens the drug discovery process.
format Online
Article
Text
id pubmed-8840520
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88405202022-02-13 Novel Big Data-Driven Machine Learning Models for Drug Discovery Application Sripriya Akondi, Vishnu Menon, Vineetha Baudry, Jerome Whittle, Jana Molecules Article Most contemporary drug discovery projects start with a ‘hit discovery’ phase where small chemicals are identified that have the capacity to interact, in a chemical sense, with a protein target involved in a given disease. To assist and accelerate this initial drug discovery process, ’virtual docking calculations’ are routinely performed, where computational models of proteins and computational models of small chemicals are evaluated for their capacities to bind together. In cutting-edge, contemporary implementations of this process, several conformations of protein targets are independently assayed in parallel ‘ensemble docking’ calculations. Some of these protein conformations, a minority of them, will be capable of binding many chemicals, while other protein conformations, the majority of them, will not be able to do so. This fact that only some of the conformations accessible to a protein will be ’selected’ by chemicals is known as ’conformational selection’ process in biology. This work describes a machine learning approach to characterize and identify the properties of protein conformations that will be selected (i.e., bind to) chemicals, and classified as potential binding drug candidates, unlike the remaining non-binding drug candidate protein conformations. This work also addresses the class imbalance problem through advanced machine learning techniques that maximize the prediction rate of potential protein molecular conformations for the test case proteins ADORA2A (Adenosine A2a Receptor) and OPRK1 (Opioid Receptor Kappa 1), and subsequently reduces the failure rates and hastens the drug discovery process. MDPI 2022-01-18 /pmc/articles/PMC8840520/ /pubmed/35163865 http://dx.doi.org/10.3390/molecules27030594 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sripriya Akondi, Vishnu
Menon, Vineetha
Baudry, Jerome
Whittle, Jana
Novel Big Data-Driven Machine Learning Models for Drug Discovery Application
title Novel Big Data-Driven Machine Learning Models for Drug Discovery Application
title_full Novel Big Data-Driven Machine Learning Models for Drug Discovery Application
title_fullStr Novel Big Data-Driven Machine Learning Models for Drug Discovery Application
title_full_unstemmed Novel Big Data-Driven Machine Learning Models for Drug Discovery Application
title_short Novel Big Data-Driven Machine Learning Models for Drug Discovery Application
title_sort novel big data-driven machine learning models for drug discovery application
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8840520/
https://www.ncbi.nlm.nih.gov/pubmed/35163865
http://dx.doi.org/10.3390/molecules27030594
work_keys_str_mv AT sripriyaakondivishnu novelbigdatadrivenmachinelearningmodelsfordrugdiscoveryapplication
AT menonvineetha novelbigdatadrivenmachinelearningmodelsfordrugdiscoveryapplication
AT baudryjerome novelbigdatadrivenmachinelearningmodelsfordrugdiscoveryapplication
AT whittlejana novelbigdatadrivenmachinelearningmodelsfordrugdiscoveryapplication