Cargando…
Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches
The SARS-CoV-2 3CLpro protein is one of the key therapeutic targets of interest for COVID-19 due to its critical role in viral replication, various high-quality protein crystal structures, and as a basis for computationally screening for compounds with improved inhibitory activity, bioavailability,...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10479925/ https://www.ncbi.nlm.nih.gov/pubmed/37669264 http://dx.doi.org/10.1371/journal.pone.0288053 |
_version_ | 1785101696776536064 |
---|---|
author | Ashraf, Faisal Bin Akter, Sanjida Mumu, Sumona Hoque Islam, Muhammad Usama Uddin, Jasim |
author_facet | Ashraf, Faisal Bin Akter, Sanjida Mumu, Sumona Hoque Islam, Muhammad Usama Uddin, Jasim |
author_sort | Ashraf, Faisal Bin |
collection | PubMed |
description | The SARS-CoV-2 3CLpro protein is one of the key therapeutic targets of interest for COVID-19 due to its critical role in viral replication, various high-quality protein crystal structures, and as a basis for computationally screening for compounds with improved inhibitory activity, bioavailability, and ADMETox properties. The ChEMBL and PubChem database contains experimental data from screening small molecules against SARS-CoV-2 3CLpro, which expands the opportunity to learn the pattern and design a computational model that can predict the potency of any drug compound against coronavirus before in-vitro and in-vivo testing. In this study, Utilizing several descriptors, we evaluated 27 machine learning classifiers. We also developed a neural network model that can correctly identify bioactive and inactive chemicals with 91% accuracy, on CheMBL data and 93% accuracy on combined data on both CheMBL and Pubchem. The F1-score for inactive and active compounds was 93% and 94%, respectively. SHAP (SHapley Additive exPlanations) on XGB classifier to find important fingerprints from the PaDEL descriptors for this task. The results indicated that the PaDEL descriptors were effective in predicting bioactivity, the proposed neural network design was efficient, and the Explanatory factor through SHAP correctly identified the important fingertips. In addition, we validated the effectiveness of our proposed model using a large dataset encompassing over 100,000 molecules. This research employed various molecular descriptors to discover the optimal one for this task. To evaluate the effectiveness of these possible medications against SARS-CoV-2, more in-vitro and in-vivo research is required. |
format | Online Article Text |
id | pubmed-10479925 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-104799252023-09-06 Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches Ashraf, Faisal Bin Akter, Sanjida Mumu, Sumona Hoque Islam, Muhammad Usama Uddin, Jasim PLoS One Research Article The SARS-CoV-2 3CLpro protein is one of the key therapeutic targets of interest for COVID-19 due to its critical role in viral replication, various high-quality protein crystal structures, and as a basis for computationally screening for compounds with improved inhibitory activity, bioavailability, and ADMETox properties. The ChEMBL and PubChem database contains experimental data from screening small molecules against SARS-CoV-2 3CLpro, which expands the opportunity to learn the pattern and design a computational model that can predict the potency of any drug compound against coronavirus before in-vitro and in-vivo testing. In this study, Utilizing several descriptors, we evaluated 27 machine learning classifiers. We also developed a neural network model that can correctly identify bioactive and inactive chemicals with 91% accuracy, on CheMBL data and 93% accuracy on combined data on both CheMBL and Pubchem. The F1-score for inactive and active compounds was 93% and 94%, respectively. SHAP (SHapley Additive exPlanations) on XGB classifier to find important fingerprints from the PaDEL descriptors for this task. The results indicated that the PaDEL descriptors were effective in predicting bioactivity, the proposed neural network design was efficient, and the Explanatory factor through SHAP correctly identified the important fingertips. In addition, we validated the effectiveness of our proposed model using a large dataset encompassing over 100,000 molecules. This research employed various molecular descriptors to discover the optimal one for this task. To evaluate the effectiveness of these possible medications against SARS-CoV-2, more in-vitro and in-vivo research is required. Public Library of Science 2023-09-05 /pmc/articles/PMC10479925/ /pubmed/37669264 http://dx.doi.org/10.1371/journal.pone.0288053 Text en © 2023 Ashraf et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Ashraf, Faisal Bin Akter, Sanjida Mumu, Sumona Hoque Islam, Muhammad Usama Uddin, Jasim Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches |
title | Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches |
title_full | Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches |
title_fullStr | Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches |
title_full_unstemmed | Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches |
title_short | Bio-activity prediction of drug candidate compounds targeting SARS-Cov-2 using machine learning approaches |
title_sort | bio-activity prediction of drug candidate compounds targeting sars-cov-2 using machine learning approaches |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10479925/ https://www.ncbi.nlm.nih.gov/pubmed/37669264 http://dx.doi.org/10.1371/journal.pone.0288053 |
work_keys_str_mv | AT ashraffaisalbin bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches AT aktersanjida bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches AT mumusumonahoque bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches AT islammuhammadusama bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches AT uddinjasim bioactivitypredictionofdrugcandidatecompoundstargetingsarscov2usingmachinelearningapproaches |