Cargando…

Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development

Leveraging machine learning has been shown to improve the accuracy of structure-based virtual screening. Furthermore, a tremendous amount of empirical data is publicly available, which further enhances the performance of the machine learning approach. In this proof-of-concept study, the 3CL(pro) enz...

Descripción completa

Detalles Bibliográficos
Autores principales: Tahir ul Qamar, Muhammad, Zhu, Xi-Tong, Chen, Ling-Ling, Alhussain, Laila, Alshiekheid, Maha A., Theyab, Abdulrahman, Algahtani, Mohammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570399/
https://www.ncbi.nlm.nih.gov/pubmed/36232307
http://dx.doi.org/10.3390/ijms231911003
_version_ 1784810096656646144
author Tahir ul Qamar, Muhammad
Zhu, Xi-Tong
Chen, Ling-Ling
Alhussain, Laila
Alshiekheid, Maha A.
Theyab, Abdulrahman
Algahtani, Mohammad
author_facet Tahir ul Qamar, Muhammad
Zhu, Xi-Tong
Chen, Ling-Ling
Alhussain, Laila
Alshiekheid, Maha A.
Theyab, Abdulrahman
Algahtani, Mohammad
author_sort Tahir ul Qamar, Muhammad
collection PubMed
description Leveraging machine learning has been shown to improve the accuracy of structure-based virtual screening. Furthermore, a tremendous amount of empirical data is publicly available, which further enhances the performance of the machine learning approach. In this proof-of-concept study, the 3CL(pro) enzyme of SARS-CoV-2 was used. Structure-based virtual screening relies heavily on scoring functions. It is widely accepted that target-specific scoring functions may perform more effectively than universal scoring functions in real-world drug research and development processes. It would be beneficial to drug discovery to develop a method that can effectively build target-specific scoring functions. In the current study, the bindingDB database was used to retrieve experimental data. Smina was utilized to generate protein-ligand complexes for the extraction of InteractionFingerPrint (IFP) and SimpleInteractionFingerPrint SIFP fingerprints via the open drug discovery tool (oddt). The present study found that randomforestClassifier and randomforestRegressor performed well when used with the above fingerprints along the Molecular ACCess System (MACCS), Extended Connectivity Fingerprint (ECFP4), and ECFP6. It was found that the area under the precision-recall curve was 0.80, which is considered a satisfactory level of accuracy. In addition, our enrichment factor analysis indicated that our trained scoring function ranked molecules correctly compared to smina’s generic scoring function. Further molecular dynamics simulations indicated that the top-ranked molecules identified by our developed scoring function were highly stable in the active site, supporting the validity of our developed process. This research may provide a template for developing target-specific scoring functions against specific enzyme targets.
format Online
Article
Text
id pubmed-9570399
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95703992022-10-17 Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development Tahir ul Qamar, Muhammad Zhu, Xi-Tong Chen, Ling-Ling Alhussain, Laila Alshiekheid, Maha A. Theyab, Abdulrahman Algahtani, Mohammad Int J Mol Sci Article Leveraging machine learning has been shown to improve the accuracy of structure-based virtual screening. Furthermore, a tremendous amount of empirical data is publicly available, which further enhances the performance of the machine learning approach. In this proof-of-concept study, the 3CL(pro) enzyme of SARS-CoV-2 was used. Structure-based virtual screening relies heavily on scoring functions. It is widely accepted that target-specific scoring functions may perform more effectively than universal scoring functions in real-world drug research and development processes. It would be beneficial to drug discovery to develop a method that can effectively build target-specific scoring functions. In the current study, the bindingDB database was used to retrieve experimental data. Smina was utilized to generate protein-ligand complexes for the extraction of InteractionFingerPrint (IFP) and SimpleInteractionFingerPrint SIFP fingerprints via the open drug discovery tool (oddt). The present study found that randomforestClassifier and randomforestRegressor performed well when used with the above fingerprints along the Molecular ACCess System (MACCS), Extended Connectivity Fingerprint (ECFP4), and ECFP6. It was found that the area under the precision-recall curve was 0.80, which is considered a satisfactory level of accuracy. In addition, our enrichment factor analysis indicated that our trained scoring function ranked molecules correctly compared to smina’s generic scoring function. Further molecular dynamics simulations indicated that the top-ranked molecules identified by our developed scoring function were highly stable in the active site, supporting the validity of our developed process. This research may provide a template for developing target-specific scoring functions against specific enzyme targets. MDPI 2022-09-20 /pmc/articles/PMC9570399/ /pubmed/36232307 http://dx.doi.org/10.3390/ijms231911003 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tahir ul Qamar, Muhammad
Zhu, Xi-Tong
Chen, Ling-Ling
Alhussain, Laila
Alshiekheid, Maha A.
Theyab, Abdulrahman
Algahtani, Mohammad
Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development
title Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development
title_full Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development
title_fullStr Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development
title_full_unstemmed Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development
title_short Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development
title_sort target-specific machine learning scoring function improved structure-based virtual screening performance for sars-cov-2 drugs development
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570399/
https://www.ncbi.nlm.nih.gov/pubmed/36232307
http://dx.doi.org/10.3390/ijms231911003
work_keys_str_mv AT tahirulqamarmuhammad targetspecificmachinelearningscoringfunctionimprovedstructurebasedvirtualscreeningperformanceforsarscov2drugsdevelopment
AT zhuxitong targetspecificmachinelearningscoringfunctionimprovedstructurebasedvirtualscreeningperformanceforsarscov2drugsdevelopment
AT chenlingling targetspecificmachinelearningscoringfunctionimprovedstructurebasedvirtualscreeningperformanceforsarscov2drugsdevelopment
AT alhussainlaila targetspecificmachinelearningscoringfunctionimprovedstructurebasedvirtualscreeningperformanceforsarscov2drugsdevelopment
AT alshiekheidmahaa targetspecificmachinelearningscoringfunctionimprovedstructurebasedvirtualscreeningperformanceforsarscov2drugsdevelopment
AT theyababdulrahman targetspecificmachinelearningscoringfunctionimprovedstructurebasedvirtualscreeningperformanceforsarscov2drugsdevelopment
AT algahtanimohammad targetspecificmachinelearningscoringfunctionimprovedstructurebasedvirtualscreeningperformanceforsarscov2drugsdevelopment