Cargando…
Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics
Coronaviruses induce severe upper respiratory tract infections, which can spread to the lungs. The nucleocapsid protein (N protein) plays an important role in genome replication, transcription, and virion assembly in SARS-CoV-2, the virus causing COVID-19, and in other coronaviruses. Glycogen syntha...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025526/ https://www.ncbi.nlm.nih.gov/pubmed/36950681 http://dx.doi.org/10.3389/fendo.2023.1084327 |
_version_ | 1784909353154772992 |
---|---|
author | Pirzada, Rameez Hassan Ahmad, Bilal Qayyum, Naila Choi, Sangdun |
author_facet | Pirzada, Rameez Hassan Ahmad, Bilal Qayyum, Naila Choi, Sangdun |
author_sort | Pirzada, Rameez Hassan |
collection | PubMed |
description | Coronaviruses induce severe upper respiratory tract infections, which can spread to the lungs. The nucleocapsid protein (N protein) plays an important role in genome replication, transcription, and virion assembly in SARS-CoV-2, the virus causing COVID-19, and in other coronaviruses. Glycogen synthase kinase 3 (GSK3) activation phosphorylates the viral N protein. To combat COVID-19 and future coronavirus outbreaks, interference with the dependence of N protein on GSK3 may be a viable strategy. Toward this end, this study aimed to construct robust machine learning models to identify GSK3 inhibitors from Food and Drug Administration–approved and investigational drug libraries using the quantitative structure–activity relationship approach. A non-redundant dataset consisting of 495 and 3070 compounds for GSK3α and GSK3β, respectively, was acquired from the ChEMBL database. Twelve sets of molecular descriptors were used to define these inhibitors, and machine learning algorithms were selected using the LazyPredict package. Histogram-based gradient boosting and light gradient boosting machine algorithms were used to develop predictive models that were evaluated based on the root mean square error and R-squared value. Finally, the top two drugs (selinexor and ruboxistaurin) were selected for molecular dynamics simulation based on the highest predicted activity (negative log of the half-maximal inhibitory concentration, pIC(50) value) to further investigate the structural stability of the protein-ligand complexes. This artificial intelligence-based virtual high-throughput screening approach is an effective strategy for accelerating drug discovery and finding novel pharmacological targets while reducing the cost and time. |
format | Online Article Text |
id | pubmed-10025526 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-100255262023-03-21 Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics Pirzada, Rameez Hassan Ahmad, Bilal Qayyum, Naila Choi, Sangdun Front Endocrinol (Lausanne) Endocrinology Coronaviruses induce severe upper respiratory tract infections, which can spread to the lungs. The nucleocapsid protein (N protein) plays an important role in genome replication, transcription, and virion assembly in SARS-CoV-2, the virus causing COVID-19, and in other coronaviruses. Glycogen synthase kinase 3 (GSK3) activation phosphorylates the viral N protein. To combat COVID-19 and future coronavirus outbreaks, interference with the dependence of N protein on GSK3 may be a viable strategy. Toward this end, this study aimed to construct robust machine learning models to identify GSK3 inhibitors from Food and Drug Administration–approved and investigational drug libraries using the quantitative structure–activity relationship approach. A non-redundant dataset consisting of 495 and 3070 compounds for GSK3α and GSK3β, respectively, was acquired from the ChEMBL database. Twelve sets of molecular descriptors were used to define these inhibitors, and machine learning algorithms were selected using the LazyPredict package. Histogram-based gradient boosting and light gradient boosting machine algorithms were used to develop predictive models that were evaluated based on the root mean square error and R-squared value. Finally, the top two drugs (selinexor and ruboxistaurin) were selected for molecular dynamics simulation based on the highest predicted activity (negative log of the half-maximal inhibitory concentration, pIC(50) value) to further investigate the structural stability of the protein-ligand complexes. This artificial intelligence-based virtual high-throughput screening approach is an effective strategy for accelerating drug discovery and finding novel pharmacological targets while reducing the cost and time. Frontiers Media S.A. 2023-03-06 /pmc/articles/PMC10025526/ /pubmed/36950681 http://dx.doi.org/10.3389/fendo.2023.1084327 Text en Copyright © 2023 Pirzada, Ahmad, Qayyum and Choi https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Endocrinology Pirzada, Rameez Hassan Ahmad, Bilal Qayyum, Naila Choi, Sangdun Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics |
title | Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics |
title_full | Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics |
title_fullStr | Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics |
title_full_unstemmed | Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics |
title_short | Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics |
title_sort | modeling structure–activity relationships with machine learning to identify gsk3-targeted small molecules as potential covid-19 therapeutics |
topic | Endocrinology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025526/ https://www.ncbi.nlm.nih.gov/pubmed/36950681 http://dx.doi.org/10.3389/fendo.2023.1084327 |
work_keys_str_mv | AT pirzadarameezhassan modelingstructureactivityrelationshipswithmachinelearningtoidentifygsk3targetedsmallmoleculesaspotentialcovid19therapeutics AT ahmadbilal modelingstructureactivityrelationshipswithmachinelearningtoidentifygsk3targetedsmallmoleculesaspotentialcovid19therapeutics AT qayyumnaila modelingstructureactivityrelationshipswithmachinelearningtoidentifygsk3targetedsmallmoleculesaspotentialcovid19therapeutics AT choisangdun modelingstructureactivityrelationshipswithmachinelearningtoidentifygsk3targetedsmallmoleculesaspotentialcovid19therapeutics |