Cargando…

Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches

The SARS-CoV-2 3CLpro protein is one of the key therapeutic targets of interest for COVID-19 due to its critical role in viral replication, various high-quality protein crystal structures, and as a basis for computationally screening for compounds with improved inhibitory activity, bioavailability,...

Descripción completa

Detalles Bibliográficos
Autores principales: Ashraf, Faisal Bin, Akter, Sanjida, Mumu, Sumona Hoque, Islam, Muhammad Usama, Uddin, Jasim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10479925/
https://www.ncbi.nlm.nih.gov/pubmed/37669264
http://dx.doi.org/10.1371/journal.pone.0288053
_version_ 1785101696776536064
author Ashraf, Faisal Bin
Akter, Sanjida
Mumu, Sumona Hoque
Islam, Muhammad Usama
Uddin, Jasim
author_facet Ashraf, Faisal Bin
Akter, Sanjida
Mumu, Sumona Hoque
Islam, Muhammad Usama
Uddin, Jasim
author_sort Ashraf, Faisal Bin
collection PubMed
description The SARS-CoV-2 3CLpro protein is one of the key therapeutic targets of interest for COVID-19 due to its critical role in viral replication, various high-quality protein crystal structures, and as a basis for computationally screening for compounds with improved inhibitory activity, bioavailability, and ADMETox properties. The ChEMBL and PubChem database contains experimental data from screening small molecules against SARS-CoV-2 3CLpro, which expands the opportunity to learn the pattern and design a computational model that can predict the potency of any drug compound against coronavirus before in-vitro and in-vivo testing. In this study, Utilizing several descriptors, we evaluated 27 machine learning classifiers. We also developed a neural network model that can correctly identify bioactive and inactive chemicals with 91% accuracy, on CheMBL data and 93% accuracy on combined data on both CheMBL and Pubchem. The F1-score for inactive and active compounds was 93% and 94%, respectively. SHAP (SHapley Additive exPlanations) on XGB classifier to find important fingerprints from the PaDEL descriptors for this task. The results indicated that the PaDEL descriptors were effective in predicting bioactivity, the proposed neural network design was efficient, and the Explanatory factor through SHAP correctly identified the important fingertips. In addition, we validated the effectiveness of our proposed model using a large dataset encompassing over 100,000 molecules. This research employed various molecular descriptors to discover the optimal one for this task. To evaluate the effectiveness of these possible medications against SARS-CoV-2, more in-vitro and in-vivo research is required.
format Online
Article
Text
id pubmed-10479925
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-104799252023-09-06 Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches Ashraf, Faisal Bin Akter, Sanjida Mumu, Sumona Hoque Islam, Muhammad Usama Uddin, Jasim PLoS One Research Article The SARS-CoV-2 3CLpro protein is one of the key therapeutic targets of interest for COVID-19 due to its critical role in viral replication, various high-quality protein crystal structures, and as a basis for computationally screening for compounds with improved inhibitory activity, bioavailability, and ADMETox properties. The ChEMBL and PubChem database contains experimental data from screening small molecules against SARS-CoV-2 3CLpro, which expands the opportunity to learn the pattern and design a computational model that can predict the potency of any drug compound against coronavirus before in-vitro and in-vivo testing. In this study, Utilizing several descriptors, we evaluated 27 machine learning classifiers. We also developed a neural network model that can correctly identify bioactive and inactive chemicals with 91% accuracy, on CheMBL data and 93% accuracy on combined data on both CheMBL and Pubchem. The F1-score for inactive and active compounds was 93% and 94%, respectively. SHAP (SHapley Additive exPlanations) on XGB classifier to find important fingerprints from the PaDEL descriptors for this task. The results indicated that the PaDEL descriptors were effective in predicting bioactivity, the proposed neural network design was efficient, and the Explanatory factor through SHAP correctly identified the important fingertips. In addition, we validated the effectiveness of our proposed model using a large dataset encompassing over 100,000 molecules. This research employed various molecular descriptors to discover the optimal one for this task. To evaluate the effectiveness of these possible medications against SARS-CoV-2, more in-vitro and in-vivo research is required. Public Library of Science 2023-09-05 /pmc/articles/PMC10479925/ /pubmed/37669264 http://dx.doi.org/10.1371/journal.pone.0288053 Text en © 2023 Ashraf et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ashraf, Faisal Bin
Akter, Sanjida
Mumu, Sumona Hoque
Islam, Muhammad Usama
Uddin, Jasim
Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches
title Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches
title_full Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches
title_fullStr Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches
title_full_unstemmed Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches
title_short Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches
title_sort bio-activity prediction of drug candidate compounds targeting sars-cov-2 using machine learning approaches
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10479925/
https://www.ncbi.nlm.nih.gov/pubmed/37669264
http://dx.doi.org/10.1371/journal.pone.0288053
work_keys_str_mv AT ashraffaisalbin bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches
AT aktersanjida bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches
AT mumusumonahoque bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches
AT islammuhammadusama bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches
AT uddinjasim bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches