Cargando…

Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics

Coronaviruses induce severe upper respiratory tract infections, which can spread to the lungs. The nucleocapsid protein (N protein) plays an important role in genome replication, transcription, and virion assembly in SARS-CoV-2, the virus causing COVID-19, and in other coronaviruses. Glycogen syntha...

Descripción completa

Detalles Bibliográficos
Autores principales: Pirzada, Rameez Hassan, Ahmad, Bilal, Qayyum, Naila, Choi, Sangdun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025526/
https://www.ncbi.nlm.nih.gov/pubmed/36950681
http://dx.doi.org/10.3389/fendo.2023.1084327
_version_ 1784909353154772992
author Pirzada, Rameez Hassan
Ahmad, Bilal
Qayyum, Naila
Choi, Sangdun
author_facet Pirzada, Rameez Hassan
Ahmad, Bilal
Qayyum, Naila
Choi, Sangdun
author_sort Pirzada, Rameez Hassan
collection PubMed
description Coronaviruses induce severe upper respiratory tract infections, which can spread to the lungs. The nucleocapsid protein (N protein) plays an important role in genome replication, transcription, and virion assembly in SARS-CoV-2, the virus causing COVID-19, and in other coronaviruses. Glycogen synthase kinase 3 (GSK3) activation phosphorylates the viral N protein. To combat COVID-19 and future coronavirus outbreaks, interference with the dependence of N protein on GSK3 may be a viable strategy. Toward this end, this study aimed to construct robust machine learning models to identify GSK3 inhibitors from Food and Drug Administration–approved and investigational drug libraries using the quantitative structure–activity relationship approach. A non-redundant dataset consisting of 495 and 3070 compounds for GSK3α and GSK3β, respectively, was acquired from the ChEMBL database. Twelve sets of molecular descriptors were used to define these inhibitors, and machine learning algorithms were selected using the LazyPredict package. Histogram-based gradient boosting and light gradient boosting machine algorithms were used to develop predictive models that were evaluated based on the root mean square error and R-squared value. Finally, the top two drugs (selinexor and ruboxistaurin) were selected for molecular dynamics simulation based on the highest predicted activity (negative log of the half-maximal inhibitory concentration, pIC(50) value) to further investigate the structural stability of the protein-ligand complexes. This artificial intelligence-based virtual high-throughput screening approach is an effective strategy for accelerating drug discovery and finding novel pharmacological targets while reducing the cost and time.
format Online
Article
Text
id pubmed-10025526
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100255262023-03-21 Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics Pirzada, Rameez Hassan Ahmad, Bilal Qayyum, Naila Choi, Sangdun Front Endocrinol (Lausanne) Endocrinology Coronaviruses induce severe upper respiratory tract infections, which can spread to the lungs. The nucleocapsid protein (N protein) plays an important role in genome replication, transcription, and virion assembly in SARS-CoV-2, the virus causing COVID-19, and in other coronaviruses. Glycogen synthase kinase 3 (GSK3) activation phosphorylates the viral N protein. To combat COVID-19 and future coronavirus outbreaks, interference with the dependence of N protein on GSK3 may be a viable strategy. Toward this end, this study aimed to construct robust machine learning models to identify GSK3 inhibitors from Food and Drug Administration–approved and investigational drug libraries using the quantitative structure–activity relationship approach. A non-redundant dataset consisting of 495 and 3070 compounds for GSK3α and GSK3β, respectively, was acquired from the ChEMBL database. Twelve sets of molecular descriptors were used to define these inhibitors, and machine learning algorithms were selected using the LazyPredict package. Histogram-based gradient boosting and light gradient boosting machine algorithms were used to develop predictive models that were evaluated based on the root mean square error and R-squared value. Finally, the top two drugs (selinexor and ruboxistaurin) were selected for molecular dynamics simulation based on the highest predicted activity (negative log of the half-maximal inhibitory concentration, pIC(50) value) to further investigate the structural stability of the protein-ligand complexes. This artificial intelligence-based virtual high-throughput screening approach is an effective strategy for accelerating drug discovery and finding novel pharmacological targets while reducing the cost and time. Frontiers Media S.A. 2023-03-06 /pmc/articles/PMC10025526/ /pubmed/36950681 http://dx.doi.org/10.3389/fendo.2023.1084327 Text en Copyright © 2023 Pirzada, Ahmad, Qayyum and Choi https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Endocrinology
Pirzada, Rameez Hassan
Ahmad, Bilal
Qayyum, Naila
Choi, Sangdun
Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics
title Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics
title_full Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics
title_fullStr Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics
title_full_unstemmed Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics
title_short Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics
title_sort modeling structure–activity relationships with machine learning to identify gsk3-targeted small molecules as potential covid-19 therapeutics
topic Endocrinology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025526/
https://www.ncbi.nlm.nih.gov/pubmed/36950681
http://dx.doi.org/10.3389/fendo.2023.1084327
work_keys_str_mv AT pirzadarameezhassan modelingstructureactivityrelationshipswithmachinelearningtoidentifygsk3targetedsmallmoleculesaspotentialcovid19therapeutics
AT ahmadbilal modelingstructureactivityrelationshipswithmachinelearningtoidentifygsk3targetedsmallmoleculesaspotentialcovid19therapeutics
AT qayyumnaila modelingstructureactivityrelationshipswithmachinelearningtoidentifygsk3targetedsmallmoleculesaspotentialcovid19therapeutics
AT choisangdun modelingstructureactivityrelationshipswithmachinelearningtoidentifygsk3targetedsmallmoleculesaspotentialcovid19therapeutics